Gene Athe_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1104 
Symbol 
ID7409661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1196600 
End bp1197916 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content38% 
IMG OID643715470 
Productglycoside hydrolase family 4 
Protein accessionYP_002572978 
Protein GI222529096 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAAAA TAGCTATCAT AGGTGCAGGA AGTGGAGTTT TCACAAGGAA CTTGGTAAGA 
GACATTTTGT CATATCCAGA GCTAAGAAAC TCTACAATAG CGCTTATGGA CATTGACAGT
ATAAGGCTTG AATTTATGAG AAAAGCTCTG CAAAAGCTCA TTGAACAGGA AAAGTATCCT
ACTAAACTTG AAGCTACAAC TGATAGAAAA GAGGCTTTGA AAGGTGCAAA ATATGTTATT
GTCACAATAC AGATTGGAGG TTTAAAACCT TTCGAGTATG ACATTTACAT TCCTCTAAAA
TATGGTGTAA AACAGGCAGT TGGTGATACA ATAGGTCCGG GTGGAGTTTT CAGGGCTTTG
CGAACAATAC TGGTTTTACT TGACATTGCA AAGGACATGG AAGAGCTGTG TCCTGACGCA
CTTTTGCTCA ATTATGTAAA TCCAATGGCA ATGAATTGCT GGGCGTTAAA TAAGGCTACG
AATATAAAAA ATGTAGGGCT TTGTCACAGT GTTCAAGGAA CTGCTGAATT TTTAGCAAAA
ATTATTGGGG CAAAAATGGA AGAAATTTCA TACTTATGCG CAGGTATAAA CCATATGGCA
TGGTTTTTAA AATTTGAGTG GAATGGGAAA GATGCATATC CTCTTATAAG GGAAAAAGCA
AGTGACCCCG AAATCTATAC ACAGGATGTT ACAAAATTCG AAATACTAAA ACATTTTGGA
TATTATGTAA CAGAGTCAAG TTTTCACATG TCTGAATATG TTCCCTATTT TAGAAAGAGC
GACGATTGGA TAAATAAAAT CCATAGAACA CATTCATGGC ACAAAGAACA TTACAATGGT
ATGTATCTGC ACTGCTGTTT AGATGCTGCG AAAACTTTGC TTGAAGACCT GAGGAAAATG
GCAGAGGCAG ACTACATCGA CCCCAAGAGA AGTAACGAAT ACTGTGCAAC TATCATCCAT
TCCATAGAGA CAAATACTCC AGCTGTGATA AATGGTAATG TTGAAAACAA AGGTTTAATA
ACAAATCTAC CTGAAGGATG TTGCGTTGAA GTACCATGTT TGGTTGACAA AAATGGTATT
CAGCCAACTT ATGTTGGAAA TCTACCACCA CAGCTTGCAG CTTTGAACAG AACAAACATA
AACGTTCAAG AGCTAACTGT TCTTGCTGCT TTGACAGGTG ATAGGGAAGC AGTTTATCAT
GCAATTATGA TGGACCCTCT CACAAGTGCT GTTTTGGATT TAGACCAGAT ACGCCAGATG
GTAGATGAGA TGTTTGAAGC TGAAAAAGAA TGGCTGCCAG AAAAGTTTTA TAGGTAA
 
Protein sequence
MLKIAIIGAG SGVFTRNLVR DILSYPELRN STIALMDIDS IRLEFMRKAL QKLIEQEKYP 
TKLEATTDRK EALKGAKYVI VTIQIGGLKP FEYDIYIPLK YGVKQAVGDT IGPGGVFRAL
RTILVLLDIA KDMEELCPDA LLLNYVNPMA MNCWALNKAT NIKNVGLCHS VQGTAEFLAK
IIGAKMEEIS YLCAGINHMA WFLKFEWNGK DAYPLIREKA SDPEIYTQDV TKFEILKHFG
YYVTESSFHM SEYVPYFRKS DDWINKIHRT HSWHKEHYNG MYLHCCLDAA KTLLEDLRKM
AEADYIDPKR SNEYCATIIH SIETNTPAVI NGNVENKGLI TNLPEGCCVE VPCLVDKNGI
QPTYVGNLPP QLAALNRTNI NVQELTVLAA LTGDREAVYH AIMMDPLTSA VLDLDQIRQM
VDEMFEAEKE WLPEKFYR