Gene Athe_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2303 
Symbol 
ID7407722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2436246 
End bp2439302 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content38% 
IMG OID643716667 
ProductS-layer domain protein 
Protein accessionYP_002574146 
Protein GI222530264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TCAAAAGACT CATCGCTATC GTAACAGTTC TCTTATTTGC TCTTTCAATA 
ATTGCGCCTG TGTTTGCACA GGACGAAGCT ACTACAGAAG AGACAGCTGG TTCTGTGTAT
GACCAGGCAG CAAAGATTCT CCAAGACAAA GGTATCTTGA AAGGTAATGA GCAAGGCGAC
CTGATGCTCG ACAAGCAACT TACAAGAGCA GAAATCTTGG CAATGATTAT CAGAGCAACA
GGTCAAGAAG ATGTGGTAAA AGACTATGTT TATGCTGAGC AGTCATTCAC AGATGTTCCA
CAGGATCATT GGGCATTTGC ATATGTTGAG GCAGGTAAAG ACCTCGGCAT TGTAAATGGT
TACCCAGATG GAACATTCAA GCCAGACAAA CCTGTCAAGT TTGAAGAGCT CTGCAAGATG
CTTGTTGCTG CAAAAGGTGA AAGCCCAGCA GCAGGAAAAT GGCCTCTCAA CTATGTAAGA
AAAGCTCTTG AACTTGGGTT CTTCAATGGT ATTGAAGATG AAGTTGGAAT TGGTGATGTT
GTAATCAGAG GTCAAGCAGC TGTAGCATTC GCAAATGCAT TCTTCCCACC AGAAAAGACT
ATAGTTGTTA AAGATGTTAA AGCAGTTGCA AATGACACAA TTGAAGTTTA TGTTGATGCA
TATCTTGGAA ATGAGCCTGC AACACTTGAA GATGGCGATG TAATTCCTCT TGACTTTGAG
ATCAAGGATG CATCTGATGC ATCAAAGACA ATTGCGGTAA CATTAATTGA TTCTCAGGCA
TCTGACTTTG GAGCAGGAAA GCTTGTGCTC AAGACAGCAG CGCAAACAGA AGGTGCTACT
TATAAACTGT ATTACAAGGG TAATGATACA GGCAAAACTT TTGTAGCAGT ACCAGTACAG
CTTCAAGTTG CAAAAGTTGA AGTACCAAAC TTGAAGCAAG TTGTTGTAAC GTTCAATAGA
GATGTAAAGG ACGTTTATGC TGTTGATAAA AATAACTATG AAATTAAAGT AGGAGATGCA
ACAAAGTCAA TTGGTGCAGT GCGACTGAGC GAAGATAAGA AGAAGGTTAC ACTAATTTTG
AAAGACGATT TAGCAAACCA GGACAAAGTT AAAGTTACAC TCAAAACAGG TTTGGGTCTT
AAGGAGGCAT ATACAACTGA AATTGGCCCT GTACAAGATG GAACAGCTCC GGCTATAGTA
AAGGTTACAG CTGAGAACCC GCAGAAGCTT AGAGTTGAGT TCAGTGAACC AGTTAAAAAC
TATGCAAATC CTGCAAACTA TACATTGAAC GGAATGTACC TCGTAAAAGA GGTTAAGGGA
TTCAATGAAG ACAGTTCTAT AGACCCAAAT GTAATTGTTC TGAATCTTTA CATTCCTCTG
AATGTTGGTA ATAACACATT GAGTGTTACA GGTGTTACAG ATATAGCTGG ATTGCTTGTA
GTAAATCCAT CAATGAGCTT TAGTGTTGCA GAAGACAAGT CACCTATTGA GTTGAAGAAT
GTCACAGCAA CACTTAGCCA GGTTAAGCTT GAATTTAGCA AAGCAATACA GAGCATTGTA
AGTGTATCGT TGTCAAATGG TATTATAGCA GGTACATCAG TAGATGGTAA CGTAGTAACA
ATATCTTCTG GTGATGAGTT GGCAACTTGT ATACCAGTAT CTGGTGCTAA GGTTACAATA
GAAGTTAAGG ATTATACTGG GCAGACAGCA AAATTTGAAA AATACGTTGT TCCAACAATT
GACACAGAAA GACCTACTGT TAAATCAGTT ACAATAGCTG ATTCTACTAC AGTAAAGGTA
ACATTTAGTG AAGATGTACG TGTTCCTAAT CCTTCTGACA ACAAGATTAT TGTGAAAGAT
AAAGATGGTA ACCAAAAAGT TATCAGTGTA ATTACCTGGG ATACAGACAG TAGTGGTAAT
CAGATTAAGA ATACATTGAA AGTTGTATTG TCTTCACAAC TGCCAGCAGG TGTTGTAACA
GTTCAAGTAA GTGGCATTGA AGACCTCACA CCACTTAAGA ATGCAAGCTT GCCGCAGACA
GTTACAGCTA CACTTTCTGA TACAAGTGCA CCGGATGTAG TTGCTCCAAT AGTATACAAT
GATGTTTCTG GCGACACAAC AGAACTCTAT ATAACATTTA ATAAGACATT GAATGCAGCA
AGCGCTAATA TAGCTACAAA TTATAAGTAT CTGGATAGCA GTTATGTACT TAAGGATTTC
AGTGGTGCAA CTGCAAGCGT GTTGGCAAAT GGTAAAACTG TGAAACTTGT AATTAAAGAT
TCTGAATTTG CTAATATGAC ATACTTGCAG ATAATTGGTG TTGCAGATAC AAATGGTAAT
AAGGCAACAT TGGCAATTCA GAAAGATGCT ACTAAATTTG TATCTTCATC CACTGCAATT
GTTACTATTG ACTCAACTAA CGGTGTTCAA GCTGTTTCAA CAACACAGTT AAAAGTATTC
TTAACAGGCA ATATTAATGA ATATACACTT TATGCTGGGG ATTTTGAGGT CAAAGCAGGG
TCAAATACAA TAGGAGTTCT TTATGCGACA TGGGATGCAG GTAGCAAAGC TGTTGTTTTG
AACCTTGCAA CAGCAATTGG TGCTGATGCT AAGAAAGATG GAAATGGTGT GACTGTAACT
ATCAAGGCTA ACTCAATTAC AAAAGACCTG CTTGGAAGAT CAATTAATAG TGGCAGTGCA
GTAGGACCAG TGACTGCAAG TGATAAGATT GCACCAACAA TTACTGCAGT TGAAGCTGTA
TACAATGCAA CATATAATGT TACAGAGGTT ACAGTTAAAT TCAGTGAAGC AATTGTTGTA
GATTCAACAT ATGTTCTTGA CCAATTCAAG GTGTATATCG GAGGAGCAAT AACAAATCCT
GATGCAGGCG TTGATGTTAA GACCGACAAT ATAGTCTTCA AGTTCAGTGG TGACAAGAGA
TACAGCACAA TTAAGGTAGA CTATGTACCA GCATATGACA CAAGTAAAAG AGTAAAAGAT
TCATTAACAA ATAATAATGA ACTTGCACAA ACAAGTGTGA GTGGAACATG GAAATAA
 
Protein sequence
MKKFKRLIAI VTVLLFALSI IAPVFAQDEA TTEETAGSVY DQAAKILQDK GILKGNEQGD 
LMLDKQLTRA EILAMIIRAT GQEDVVKDYV YAEQSFTDVP QDHWAFAYVE AGKDLGIVNG
YPDGTFKPDK PVKFEELCKM LVAAKGESPA AGKWPLNYVR KALELGFFNG IEDEVGIGDV
VIRGQAAVAF ANAFFPPEKT IVVKDVKAVA NDTIEVYVDA YLGNEPATLE DGDVIPLDFE
IKDASDASKT IAVTLIDSQA SDFGAGKLVL KTAAQTEGAT YKLYYKGNDT GKTFVAVPVQ
LQVAKVEVPN LKQVVVTFNR DVKDVYAVDK NNYEIKVGDA TKSIGAVRLS EDKKKVTLIL
KDDLANQDKV KVTLKTGLGL KEAYTTEIGP VQDGTAPAIV KVTAENPQKL RVEFSEPVKN
YANPANYTLN GMYLVKEVKG FNEDSSIDPN VIVLNLYIPL NVGNNTLSVT GVTDIAGLLV
VNPSMSFSVA EDKSPIELKN VTATLSQVKL EFSKAIQSIV SVSLSNGIIA GTSVDGNVVT
ISSGDELATC IPVSGAKVTI EVKDYTGQTA KFEKYVVPTI DTERPTVKSV TIADSTTVKV
TFSEDVRVPN PSDNKIIVKD KDGNQKVISV ITWDTDSSGN QIKNTLKVVL SSQLPAGVVT
VQVSGIEDLT PLKNASLPQT VTATLSDTSA PDVVAPIVYN DVSGDTTELY ITFNKTLNAA
SANIATNYKY LDSSYVLKDF SGATASVLAN GKTVKLVIKD SEFANMTYLQ IIGVADTNGN
KATLAIQKDA TKFVSSSTAI VTIDSTNGVQ AVSTTQLKVF LTGNINEYTL YAGDFEVKAG
SNTIGVLYAT WDAGSKAVVL NLATAIGADA KKDGNGVTVT IKANSITKDL LGRSINSGSA
VGPVTASDKI APTITAVEAV YNATYNVTEV TVKFSEAIVV DSTYVLDQFK VYIGGAITNP
DAGVDVKTDN IVFKFSGDKR YSTIKVDYVP AYDTSKRVKD SLTNNNELAQ TSVSGTWK