Gene Athe_2370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2370 
Symbol 
ID7407789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2517530 
End bp2520610 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content37% 
IMG OID643716734 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_002574213 
Protein GI222530331 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0768675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTGACA AATCTTTTTA TCAAAATCCA AAGATTCAGC ATGTGAACAT GGCACAGCCA 
AGGTCATCAT TTGTTCCTTA TGCCAGTGTA GAAAATGCTT TGCTTGGTGA TTGGGAACTT
TCAGAATACT TGAGATTACT TAATGGAAAC TGGTATTTTA AGTTATTTGA TATGCCCTGT
AAAGTTGACC AAGAGATTAT AAAATCAGAT GCTAAATTTA CAGGATTTGA TAAAATTATT
GTTCCAAGTA ATTTTCAGCT TTTTGGTTAC GACAAACCAA TATACACTAA TACACGGTAT
CCCATCCCAG TTGACCCACC ATATGTACCT GATATTAATC CTACTGGGGT GTATAAGAAA
GAGATTTTTA TTTCAAAAGA GGATAAAGAA AAGAGAATAT TTTTAGTCTT TGAGGGTGTG
GATTGTGCTT TTTATTTATA TGTAAATCAG GAGTTTGCTG GATTTTCAAA AGTAAGCCAC
ATGATGCATG AGTTTGACAT CACAGACCTA TTGACTGAAG GTAAAAATAT TATAACAGTT
GCAGTTTTAA AATATGCAGA CAGTACATAT TTAGAAGACC AAGACAAGTG GCGAATGAGT
GGAATATTTA GAGATGTATA TTTACTTGTG CGCCCCAAGA TATATCTCAA AGACATTTAT
CTCAAGCCTG ACCTTAACGA TAACCTAACT AAAGGCAGTT TGGTAGCAGA AATCGAAATC
TCAAATTCAA ATGAAGAAAA ATCAGAATTT TATGTAGATG TACAAATATA CGCAGACAAA
GAACTACTAA AATCTGCCTC AAAACTTGCA AGCATTTTAC CAAAAACCAC TGAACAGTTC
AGATTTGAGT TCCAAATTGA AAAACCATTG CTTTGGAGTG CTGAAACGCC AAATCTCTAC
AAATTTATTG TGATTATAAA GGATACATCA GGTAAGATTT TGGAGGTCAT CCCTCAAAAC
TTTGGGTTTA GGAAAATTGA AATTAAAGAT GGCGTATTTT ACCTAAATAA TGTACCCATT
AAATTAAAAG GTGTCAACAG ACACGACATG CACCCAAGAG TTGGATTTGC AGTGACAAGA
AAGATGATGC AAGAGGACAT AACTTTGATG AAACAGCACA ATATAAACTG TGTAAGAACT
TCACACTATC CAAATCATCC TTATTTTTTG GAGCTGTGTG ACAAATTTGG GATTTACGTA
ATTGATGAAG CTGATTTAGA AACACATGGA TTTGGAGCAG TTGGTGACTG GGGACTTTTA
GCAAAAGACC CTGTGTGGGA AGATGCGTTT CTCGAAAGGG CAAAGATGAT GGTAATGAGG
GACAAAAACC ACCCATCAAT CATTATGTGG TCGCTTGGAA ATGAATCTGG CTATGGTCCA
AACCACGATA AAATGGCTCA GTGGATAAGG TCATACGATA ATAGCCGTCC TATTCATTAC
GAAAGTGCCC GTGATGCAGA GGTTGTGGAT ATTGTAAGTG TAATGTACCC TCCTGTAGAA
AAACTTGAAG AAGAGGGCAA AAAACAAGAA AAAAGGCCAT TTTTTATGTG TGAGTATGCA
CATGCAATGG GAAATGGTCC TGGAAACCTC AAAGAGTATT GGGATGTGAT ATATAAATAC
CCAAGGCTTT TAGGTGGATG TGTGTGGGAG TGGGCAGACC ACGGAATTTT GACAAAGACA
CCTGATGGGA AAGAGTACTA TGCCTACGGC GGGGATTTTG GAGATGAGCC AAATGATGGT
AACTTCTGTA TAGATGGACT TCTTTTCCCC GACAGAACTC CGTCGCCCGG GATGATTGAG
CTGAAAAAAG TTTATGAGCC AGTTATGATT GAACTTTTAG ATAAAAAAAG CGGAATTTTC
AAGGTAACAA ACAGGTATGA TTTTATATCT TTGAATCACA TTGAAGTTGA ATGGGAACTT
TTGTTAGGCG GCAGGGTTGT GAAAGAGGGC TTTGTTGATG TGAGCGATGT ACTGCCTCAC
TCTTCAAAAG AGGTCAAGAT TGATGAAGTC AAAGAAGTTT TAGAAAGTTG CAAGGAAGAG
CTTTTTATTA CCTTCACGGC AAAGCTCAAA AACTCAATGC CTTGGGCAAA AAGAGGTTTT
GTTATAACAA AATCTCAGAT TGCAATTAAA GAAGAAACAT CTCAAGATGC TGTGCAAAAA
ATTGAGAAGA TAAATGCCAT TTTATCAAAG CAAGACAGGT TTGAGGTTTG CAAGTTACAT
GACAAATTGG TAGTTTTCGC AGGTAACACA GAAGTAGAGT TTTGCCGATG GACAGGTGAT
TTGGTAAGCT TGAAACATAG TGACCTTGAG CTTATAAAAT CCTCACCAAG ATTTAATATT
TGGAGGGCTC CAACAGATAA TGATGTGCAT ATCAAAAACG AATGGATAAA AGCTGGATTT
GACAAGCTTC AAAGAAGAAT TGTAAATGTC AGCTTTGAAG AACACAGCCA GTACTTCAAA
GTACAAACAA CTTCGGTATA TGGCGCATAT TCAGTAAAAC CTGGATTTGA AGTAACCACA
AGCTACAAGG TTTTCAAATC GGGAATTGTA GAGACAAATG TGTATGCGCA AGCTCTCAGA
CAGCTTCCGC CACTTCCAAA GATAGGACTG CAATTTATGA TGCCAAAAGA GTTTGAGTAT
GTCAAATATT ATGGACGTGG ACCTCATGAA AACTATCCTG ATATAAAACA GAGCGCAACT
GTAGAAATAT ATGACATGGC TATAAAAGAC ATGTACGTTC CATATATAAT GCCCCAGGAA
TATGGAAACA GGTGCGACGT TAGATGGGCT TTTGTATACA ACATCTATGG AATAGGACTT
TGTATTAGAG GCATCCCTAC ATTTAACTTC AGTGCAAGAG AATACACTGA TGATGTGCTC
ACAAAAGCAA AACACTCATA TGAGCTGACA AAAGCAGATG GAATTGTTGT GAATGTTGAC
TTTAAGATTG GTGGTATCGG AAGCCAGAGC TGCGGTCCCG GCCCACTTGA GAAGTACTTA
GTCAAGGATG ACAAATTTGA ATTTTGCTTT TATATGATAC CTGTTGATAG CAATAGTTTA
GACGTTGAAA AGCTATGGTA A
 
Protein sequence
MLDKSFYQNP KIQHVNMAQP RSSFVPYASV ENALLGDWEL SEYLRLLNGN WYFKLFDMPC 
KVDQEIIKSD AKFTGFDKII VPSNFQLFGY DKPIYTNTRY PIPVDPPYVP DINPTGVYKK
EIFISKEDKE KRIFLVFEGV DCAFYLYVNQ EFAGFSKVSH MMHEFDITDL LTEGKNIITV
AVLKYADSTY LEDQDKWRMS GIFRDVYLLV RPKIYLKDIY LKPDLNDNLT KGSLVAEIEI
SNSNEEKSEF YVDVQIYADK ELLKSASKLA SILPKTTEQF RFEFQIEKPL LWSAETPNLY
KFIVIIKDTS GKILEVIPQN FGFRKIEIKD GVFYLNNVPI KLKGVNRHDM HPRVGFAVTR
KMMQEDITLM KQHNINCVRT SHYPNHPYFL ELCDKFGIYV IDEADLETHG FGAVGDWGLL
AKDPVWEDAF LERAKMMVMR DKNHPSIIMW SLGNESGYGP NHDKMAQWIR SYDNSRPIHY
ESARDAEVVD IVSVMYPPVE KLEEEGKKQE KRPFFMCEYA HAMGNGPGNL KEYWDVIYKY
PRLLGGCVWE WADHGILTKT PDGKEYYAYG GDFGDEPNDG NFCIDGLLFP DRTPSPGMIE
LKKVYEPVMI ELLDKKSGIF KVTNRYDFIS LNHIEVEWEL LLGGRVVKEG FVDVSDVLPH
SSKEVKIDEV KEVLESCKEE LFITFTAKLK NSMPWAKRGF VITKSQIAIK EETSQDAVQK
IEKINAILSK QDRFEVCKLH DKLVVFAGNT EVEFCRWTGD LVSLKHSDLE LIKSSPRFNI
WRAPTDNDVH IKNEWIKAGF DKLQRRIVNV SFEEHSQYFK VQTTSVYGAY SVKPGFEVTT
SYKVFKSGIV ETNVYAQALR QLPPLPKIGL QFMMPKEFEY VKYYGRGPHE NYPDIKQSAT
VEIYDMAIKD MYVPYIMPQE YGNRCDVRWA FVYNIYGIGL CIRGIPTFNF SAREYTDDVL
TKAKHSYELT KADGIVVNVD FKIGGIGSQS CGPGPLEKYL VKDDKFEFCF YMIPVDSNSL
DVEKLW