Gene Athe_0227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0227 
Symbol 
ID7407218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp272944 
End bp275385 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content39% 
IMG OID643714627 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002572150 
Protein GI222528268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGAATAA ATCTTGACGG AAAATGGAAG TTCAGAGAAG TTGGTCAGAA TGAGTACTAC 
GAGGCAAGTG TGCCAGGATG TGTCCAGCTT GATTTGATTA ACCTTGGAAA GCTTCCCGAC
CCTTTTTATG CCACAAACGA GGTTTTGTTT TATGACTTAG AAGAAAAAGA TTTTGAGTAT
GTAAAAGAGT TTGATGTTGA TAATGTCGAT TTTCAGGTCA AAAAGCTTGT GTTCGAAGGA
ATTGATACGG TATCTGAGAT TTATTTAAAT GACCATTATT TGGGTAAGAC CGACAACATG
TTTTTGAAGT ATGAATTTGA TGTGAGTCTT GCACTTAAAA AAGGGAAAAA CATTTTAAAG
GTGGTCCTTC TTTCACCAAT AAAAGAAGCA GAAAGGCTCA AAAGCATTTA CCAGTCAAGC
TACAGCTATC CGCAAAGAAG CTGGATAAGA AAGTCGCAGT ATTCATACGG CTGGGACTGG
GGGCCAAGAA TTCTCCAGAT AGGAATCTGG AAGAGCGTGT ATTTAGAGCT TCACAATGGG
CTTGAAATTC AAGATGAGTT TGTAAAAGTG GAAAGTATCT CAGATGAGCT TGCCATTGTA
AGAGTGTTTG CAAAGATAAA CTGTTTTGAA AAACCATCAG AAGTAGAGAT AGAGTTATTT
GATGGTAGCT TTTCTGTGAA AGTTTTTCCA GAGGTTTATA AATCAAAAGA TGGGTATTTT
ATTGATGAGA GGATTGAGAT AGAAAACCCC AAACTTTGGT GGCCAAACGG GTATGGGGAG
CCTTCTTTAT ATGAGTTTAA AATAACTGCA AAGACTTCAA ATGAAGCTCA AGAAAAGAAG
GTCACAACAG GCCTCAGGAC TGTAAGAGTA ATAAAGGAAA AGGATGAATA TGGCGAAAGT
TTTATCTTTG AGATAAATGG CAAGAAGATT TTTGCAAAAG GAGCAAACTG GATACCTGCA
GATTCCATCC TGCCAAGGCT GAAAGAAGAT GACTATAAAG AATTAATTAA AATGGCAAAA
GATGCGAACA TGAACATGCT CAGGGTCTGG GGCGGCGGTA TTTATGAGTA TGACTGGTTT
TACGACGAAT GTGACAAAAA CGGTATAATG GTGTGGCAGG ATTTCATGTT TGCATGCGCA
ATCTACCCTG ATGAGTTTGA CTTTTTCGTT GAGAATTTCA TAAAGGAGGC AGAGTACCAG
ATAAAGAGGC TCAGAAACCA TCCGTGCATT GTGCTGTGGT GTGGGAATAA CGAAAACAAC
TGGGGCTTTA GAGATTGGTG GCACATCGGC GATCCAGAGT TTTTGGGCAA CAGGATATAC
AAAAAGGTGC TTCCTGAAAT TCTGGCAAAA CTTGACCCAA CAAGACCGTA CCACATCTCA
AGCCCGTATG GAGGTGAGCA TCCAAACAGC GAAAAAGCTG GCGACAAACA TACATGGGAT
ATCTGGGCTG GCTGGAAAGA CTACATCTAT TATAAGCACG ACAATGCAAG GTTTGTCTCT
GAGTTTGGTT TTCAGGCGGC AGCACACCTT GATACAATGA AAAAGTACAT TCCTCTTAAA
GACCAAACAA TCTTCTCAAA AACTTTGAGA ATGCACGAAA AGCAGGAAGA AGGTTTGGAA
AGGCTTATAA GGTATATGGC AGGCTCAATT GGTCTACCAA AAGACTTTGA CTCTTTTGTG
TATTTGTCAC AGTTTGTTCA AAAAGAGGCT ATTAAGCTTG CGGTTGAGCA TTACAGGAAG
AATAAGTTCG CAACAGCAGG GGCTCTTTAC TGGCAGCTTA ATGACTGCTG GCCTGTCATC
TCATGGTCGT CAATTGACTA TCTGAAAAGA AGAAAAGCAC TTTACTATGA GTCAAAAAGG
ATATTTGCAA AGTTTTTGCC AGTGGTTGAG TATGAGGATG GAAAACTAAA GGTATATATT
GTTAGTGATG AGCTAGAGCC AAAACAAGGT AAGCTCAATA TTACAATCTG GAACTTTGAT
GGGCAAAAGT TATACGAGAA AAACCTAACT GTGGAAATTC CTGAAAATAG TGTGGTTGAG
GCATTTTCTG AAAAAGTAGA AAACTTGAAT ATTTTAAAGG GCGAGTGGTT GTATATACCC
AAACATGTTG AAACGGCTGT AATTGGGGAT AAGATAGACA GAGGACTTTT GGAAAGCATA
GTTTTTGTGA GCCTTTTTGT CGATAGAGTA GAGTACGAAA ATTACTTTGT ATTTGAAAAG
CCAATAAACC TTGAACTAAA ACCCAGTCAG TTTGAGTACA GAATAGAAGA TGACCATATT
ATAATAAAAC CCAAAACTCC TGCAATTTGC CTTATAATTG AAGCTGACAG GGATGTAGAG
AACAACTTCA TTTTTGCGAG ACCTGAAAAA GAGTATAAGA TTAATCTAAA TGGAGGACAG
GTCAGAAAGG TTTGTGATTT GTTAGATTTG ATTGAGCGAT GA
 
Protein sequence
MRINLDGKWK FREVGQNEYY EASVPGCVQL DLINLGKLPD PFYATNEVLF YDLEEKDFEY 
VKEFDVDNVD FQVKKLVFEG IDTVSEIYLN DHYLGKTDNM FLKYEFDVSL ALKKGKNILK
VVLLSPIKEA ERLKSIYQSS YSYPQRSWIR KSQYSYGWDW GPRILQIGIW KSVYLELHNG
LEIQDEFVKV ESISDELAIV RVFAKINCFE KPSEVEIELF DGSFSVKVFP EVYKSKDGYF
IDERIEIENP KLWWPNGYGE PSLYEFKITA KTSNEAQEKK VTTGLRTVRV IKEKDEYGES
FIFEINGKKI FAKGANWIPA DSILPRLKED DYKELIKMAK DANMNMLRVW GGGIYEYDWF
YDECDKNGIM VWQDFMFACA IYPDEFDFFV ENFIKEAEYQ IKRLRNHPCI VLWCGNNENN
WGFRDWWHIG DPEFLGNRIY KKVLPEILAK LDPTRPYHIS SPYGGEHPNS EKAGDKHTWD
IWAGWKDYIY YKHDNARFVS EFGFQAAAHL DTMKKYIPLK DQTIFSKTLR MHEKQEEGLE
RLIRYMAGSI GLPKDFDSFV YLSQFVQKEA IKLAVEHYRK NKFATAGALY WQLNDCWPVI
SWSSIDYLKR RKALYYESKR IFAKFLPVVE YEDGKLKVYI VSDELEPKQG KLNITIWNFD
GQKLYEKNLT VEIPENSVVE AFSEKVENLN ILKGEWLYIP KHVETAVIGD KIDRGLLESI
VFVSLFVDRV EYENYFVFEK PINLELKPSQ FEYRIEDDHI IIKPKTPAIC LIIEADRDVE
NNFIFARPEK EYKINLNGGQ VRKVCDLLDL IER