Gene Athe_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0143 
Symbol 
ID7408505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp180621 
End bp182699 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content37% 
IMG OID643714547 
Productalpha amylase catalytic sub domain protein 
Protein accessionYP_002572070 
Protein GI222528188 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0052779 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCTT TAAAAAAGCT TGTTGAGATT CTGAATAATA AAGCAAAAGA GTGGGATGGA 
AAAAACGATT TCAGAGTACC AAAACTTTGG GATAGTTTCG GCTATGATGG GTTTGAGAAA
AAAGAAAACA ATGATGGCAC AATCTCAGTA AACCCTTATA AGTTTGTTGC ACAGGCGATA
GAAAAAGCAA TTTTGCCTTG TATGAAAAAT AATACTGATT ATCTTCAACC ACTTTCAGAA
ATACTTTCTA AAGATAATAG ACCTTCAGAA AAAAGTTATA GCAGCTGGAT TGAAAAGAGC
AGTGTATATG GTATGCAAAT TCGTACATTT TCAGCCTGGG ATCATGATGG TTCAAAAGAA
CTTGAACTTG AAAATAGCTT CGGTCTAAAA GATACAGGTA CATTTGTTAA AACAATTGCA
CTTTTACCAT ACATAAAAAG AATGGGCTTT GATGCTATTT ATACACTTCC AATTACCAAA
AACAGCACAA AGTATAAAAA AGGTGAGATG GGTTCACCTT ATGCAGTCAA GAACTTTTTT
GAACTTGACC CAGTGCTTTT TGACCCTATG GCTGATGAAC TTTCAATAGA TGAACAATTT
AAAGCTTTGA TTGAAGCGTG CCATATTCTT GACATAAGAT TTATAATTGA TATTATTCCA
CGCACATCTG CGAGAGATTC TGATTTCATA CTGGAACATC CTGATTGGTT TTACTGGATT
AAAGTAGAGG ATTTGAAAGA CTACGGTCCG CCAAAGCTGA CTTTGATAAA AGAGTTTACA
AAGGCAAATG AAGAAAACAT TGAGCTTATA TACAAAGACC CGGCTGTGCA AAAGCATATT
AAAAAGTTTG TACCATCACC TGACAGGTTT GACCCTCAAA AGTGGCAGAG TATAAAAGAA
AGGTGCGAAA AAGACAAAAA CCTTGACTTT TTCGAGCTAA TTGAAAAAGA GATTGGGATA
ACTACTGCAC CAGCTTTTTC AGATTGTCTT AATGACCCGC AGCCACCTTG GACAGATGTG
ACATATTTGA GGCTTTATCT TGACCATCCT GTACAGTCTG CAAAGTATGT AGATGAAAGC
CAGCCGCCGT ATATTCTTTT TGATGTTATA AGAGGAAATA TCTTCAAAGG TAAAAAGCCA
AACAAAAAAC TTTGGGAAAA AATCGCAAAC ATCATTGTAT ACTACCAGAA GAACTTTGGC
ATAGATGGTG CAAGGATTGA CATGGGTCAT GCTCTTCCAA AAGAACTGGA AGACATGATA
ATTTCAAATG CCAGAAAAGT AGACACCGAA TTTTGCTTTA TAGCAGAAGA GCTTTCGATG
AATGCGCACA GAAAAGCAAA AGAGTCAGGG TATGACATGA TAATAGGTGA TCTTTGGGCA
AGAGAACCAC GATACTATCA AGGAACGCTG AAAAAGGTTT TGGATATACT TTTGCAGCTT
GAAGTCCCTG TTTTTGCGGC ATGTGAAATC CCAGACAGTC CGCGCGCAGC TTCAAGGCTT
GGAAAAAGAG AATTTTCACG GTTTGCGACA GTGCTCAATT TCTTTTTACC AAACAGTGTG
CCTTTTGTCA CGTGCGGGCA GGAAGTTTAT GAAGTTCAGC CAATGAACTT AGGTCTTGAC
CCGCAACCAG AAGGCAAATT TATGCTTTCT AAATCAGACC CTCTTTATGG GAAACTTGCG
TTTTTTGACA AGTACGCTCT TCACTGGACA AACGAAGGTG CAGATGAAAT GATAGCCTTA
ATTGAAGGAG TAGCTAAGAT AAGAAAAGAA TATTTGGACT TTATGAGCAA AGAAAACTTT
TTCAAGATTC CTCACAACAG CAAATTTGTC TTGGCATTTG GATACAAACT TTTTTCTGAA
CATGGGAAGC AGTACCTGAT TGTAGTTGCA AACGCTGATA TTCTGCGAAA AAGAAAAGTC
AGTTTGAATT TGATGAAAGC AGGTTTATTT GATGTGGGGA AAGGAAGACA AATTGAGTGT
ATCTACTCTC TAAAAGGAAA TAAAAATCAT ATGTACGATT TTCCACACAT TACAGTTGAT
ATGGAAACTC TTGATATAAA AATTTTAAAG GTAAAATAA
 
Protein sequence
MSALKKLVEI LNNKAKEWDG KNDFRVPKLW DSFGYDGFEK KENNDGTISV NPYKFVAQAI 
EKAILPCMKN NTDYLQPLSE ILSKDNRPSE KSYSSWIEKS SVYGMQIRTF SAWDHDGSKE
LELENSFGLK DTGTFVKTIA LLPYIKRMGF DAIYTLPITK NSTKYKKGEM GSPYAVKNFF
ELDPVLFDPM ADELSIDEQF KALIEACHIL DIRFIIDIIP RTSARDSDFI LEHPDWFYWI
KVEDLKDYGP PKLTLIKEFT KANEENIELI YKDPAVQKHI KKFVPSPDRF DPQKWQSIKE
RCEKDKNLDF FELIEKEIGI TTAPAFSDCL NDPQPPWTDV TYLRLYLDHP VQSAKYVDES
QPPYILFDVI RGNIFKGKKP NKKLWEKIAN IIVYYQKNFG IDGARIDMGH ALPKELEDMI
ISNARKVDTE FCFIAEELSM NAHRKAKESG YDMIIGDLWA REPRYYQGTL KKVLDILLQL
EVPVFAACEI PDSPRAASRL GKREFSRFAT VLNFFLPNSV PFVTCGQEVY EVQPMNLGLD
PQPEGKFMLS KSDPLYGKLA FFDKYALHWT NEGADEMIAL IEGVAKIRKE YLDFMSKENF
FKIPHNSKFV LAFGYKLFSE HGKQYLIVVA NADILRKRKV SLNLMKAGLF DVGKGRQIEC
IYSLKGNKNH MYDFPHITVD METLDIKILK VK