Gene Athe_0460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0460 
Symbol 
ID7407538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp524264 
End bp526699 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content36% 
IMG OID643714848 
Productglycosyltransferase 36 
Protein accessionYP_002572365 
Protein GI222528483 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTG GCTATTTTGA CGATGCTAAA AGAGAATATG TAATCACAAC ACCGCTTACT 
CCATATCCTT GGATAAACTA TCTTGGAATG AAAGATTTTC TATCTCTGAT TTCAAACCAC
GCTGGAGGCT ACTGTTTTTA TAAAGATGCA AGGCTTCGAA GAATAACAAG ATTTCGTTAT
AACAACGTTC CGCTTGATAT GGGTGGAAGA TATTTTTACA TAAAAGATGG GGATGATGTC
TGGTCACCCT CATGGATGCC AACAAGAAAA GACCTTGAAT TTTACCAGTG CAGGCATGGT
CTTGGTTATA CAATTATCAC GGGCAGAAGA AATGGTATTG AGGTTGAACA AAGGTTTTTT
GTTCCTGTTG ATGAAAACTG TGAGATACAT CATTTGAGAA TTACTAACAA GACAAATACT
AAAAAAGAAA TTAAACTTTT CTCTTTAATC GAATTTTGTC TGTGGAACGC ACTTGACGAT
ATGACAAACT TCCAGAGAAA CTTTTCAACA GGTGAGGTTG AAATTGAAGG TTCTGTAATC
TACCATAAGA CTGAATACAG AGAAAGAAGA AATCATTTTT CTTTTTACTC AGTAAATGTT
CCTATTTCTG GGTTTGATAC AGACAGAGAT ACTTTTCTTG GGCTTTACAG AGGATTTGAA
AACCCTGCAG CTGTTGAATT AGGAAAAAGT TTTAATTCAG AAGCTCATGG CTGGTCACCA
ATTGCATCTC ACATGATTGA AATTAGCCTT TTGCCAGAAG AAACAAAAGA GCTTGTATTT
GTTCTTGGCT ATGTTGAAAA TGAACCTGAA AAGAAGTGGT TTAAAAAAGG CGTTATAAAT
AAAGAAAAGG CGTATAAGAT GATTGAAAAG TTCTCAAAAC CAGAAGATGT AAATGCTGCA
TTTGAAAAAT TGAAAGAATT CTGGGATGGG CTTTTGGACA AGTTCAATGT ATCAACAGGA
ATTGACAAAG TAGATAGAAT GGTAAATATA TGGAATCAAT ATCAATGCAT GGTTACATTT
AACCTCTCCA GAAGTGCATC ATATTTTGAA TCCGGAATTG GTAGAGGAAT GGGATTTAGA
GATTCAAACC AGGATATTCT TGGTTTTGTT CACCAGATCC CAGAGCGTGC AAGAGAAAGA
ATCTTAGACT TAGCTGCAAC CCAGTTAGAA GATGGTGGGG CATACCACCA ATATCAGCCA
CTTACAAAGA GAGGGAACAA TGAAATTGGT GGAAACTTCA ATGATGACCC TTTGTGGCTG
ATACTTTCAA CAGTTCACTA TATAAAAGAA ACTGGAGACT GGTCAATCCT TGACGAAGTA
GTACCATTTG AAAATAACCC TGAAAAAGTG GGAACACTGT TTGAACATCT GAAAAGAGCA
TTTTATCATG TAGTCAACAA CTTAGGCCCA CATGGACTTC CTCTTATCGG CAGGGCTGAC
TGGAATGACT GTTTGAACCT CAATGCTTTT TCGACAAACC CTGATGAATC GTTCCAGACA
TGTGACAACA AAGATGGCAA AACTGCTGAA TCGGTTATGA TTGCAGGGAT GTTTGTATAT
GTTGGAAAGG AATTTGTAAA GATTTGTGAA AGGTTGGGTA AAGAAGATAT TGCAAAAGAT
GCACAATACC ACATTGAAAA GATGAAAGAG GCAATTTTAA ATTATGGTTA CGATGGCGAG
TGGTTTTTAA GAGCATATGA CTACTTTGGA AACAAAGTTG GAAGCAAAGA AAATGATGAG
GGTAAGATAT TTATCGAAAC ACAAGGTTTT TGTGTTATGG CACAAATAGG ACTTGATGAT
GGAAAAGCTA TCTCTGCACT TGATTCTGTT AAAAAATATC TTGACACAGA ACATGGAATT
GTACTTGTTC AGCCGGCATT TACTGAGTAT AAAATTCATT TAGGAGAGAT TACAAGCTAT
CCACCCGGCT ATAAGGAAAA TGCAGCAGTT TTTTGTCACA ACAACCCATG GATTATGATT
GCAGAATGTA TAGTTGGAAG AGGAGACAGA GCATTTGAAT ACTGGTCAAA GATTGCTCCA
TCGTACAGAG AAGATATAAG TGAAATTCAT AAGCTTGAAC CATATGTATA CTGCCAGATG
ATTGCTGGAA AAGATGCATA CAAACCGGGA GAGGCAAAAA ATTCATGGCT GACAGGTTCT
GCCGCATGGA ATTTTGTTGC AATGACACAG TGGATTTTAG GAATAAGACC TGACTTTGAT
GGACTTTTAA TAGATCCTTG TATACCAAAA GAATGGAATG GATTTACTGT AAAGAGAGTG
TTCAGAAATG CAGTTTACAA TATAAAAGTA AAAAACCCTG ATGGTGTTTC AAAAGGTATA
AAAAAAGTTG TGGTTGATGG TAAAGAAATG TCTTCTAATT TAATACCAGC TTTCTCAGAT
GGCAAAGAGC ATTTTGTTGA AGTGATAATG GGATAG
 
Protein sequence
MKFGYFDDAK REYVITTPLT PYPWINYLGM KDFLSLISNH AGGYCFYKDA RLRRITRFRY 
NNVPLDMGGR YFYIKDGDDV WSPSWMPTRK DLEFYQCRHG LGYTIITGRR NGIEVEQRFF
VPVDENCEIH HLRITNKTNT KKEIKLFSLI EFCLWNALDD MTNFQRNFST GEVEIEGSVI
YHKTEYRERR NHFSFYSVNV PISGFDTDRD TFLGLYRGFE NPAAVELGKS FNSEAHGWSP
IASHMIEISL LPEETKELVF VLGYVENEPE KKWFKKGVIN KEKAYKMIEK FSKPEDVNAA
FEKLKEFWDG LLDKFNVSTG IDKVDRMVNI WNQYQCMVTF NLSRSASYFE SGIGRGMGFR
DSNQDILGFV HQIPERARER ILDLAATQLE DGGAYHQYQP LTKRGNNEIG GNFNDDPLWL
ILSTVHYIKE TGDWSILDEV VPFENNPEKV GTLFEHLKRA FYHVVNNLGP HGLPLIGRAD
WNDCLNLNAF STNPDESFQT CDNKDGKTAE SVMIAGMFVY VGKEFVKICE RLGKEDIAKD
AQYHIEKMKE AILNYGYDGE WFLRAYDYFG NKVGSKENDE GKIFIETQGF CVMAQIGLDD
GKAISALDSV KKYLDTEHGI VLVQPAFTEY KIHLGEITSY PPGYKENAAV FCHNNPWIMI
AECIVGRGDR AFEYWSKIAP SYREDISEIH KLEPYVYCQM IAGKDAYKPG EAKNSWLTGS
AAWNFVAMTQ WILGIRPDFD GLLIDPCIPK EWNGFTVKRV FRNAVYNIKV KNPDGVSKGI
KKVVVDGKEM SSNLIPAFSD GKEHFVEVIM G