Gene Athe_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0398 
Symbol 
ID7409333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp456257 
End bp458524 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content37% 
IMG OID643714787 
ProductKojibiose phosphorylase 
Protein accessionYP_002572305 
Protein GI222528423 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00143976 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTT CAGAAAAAAA CTGGCTGATT GAACAGGAAA GTTTTGGAGT TTCACATAAA 
CATGAAACCT GTTTTGCTCT TACAAATGGG TATATAGGAA TAAGGGGAAT CAATGAAGAG
GTTTTTTGTG ATGAGATACC AGGAACTTTC ATAGCAGGTG TATTTGACAA AGACACAGCT
CAGGTTACAG AACTTGTGAA TTTGCCAAAT CCAATAGGTC TTAGGATATA TATAAATAGA
GAATTTCTAA ATCCTTTAAA ATGTGAAGTG CTTGAGTTTA GAAGAATTTT AGACTTAAAA
CAGGGACTGC TTTTCCGAAA ATTGAGACTA AAAGATGAAA AGGGTAGAGT TACATCAATA
GAAGGATTCC GATTTGTCAG CATGAAAAAT AAAAATCTTA TTGTTCAAAA GTACAATGTG
GTTTGCGAAA ACTACTCAGC GGTTTTAAAT GTAGAAAGTT TTATCGATGC CAATACCATG
AACTCCAAGG ATATTCCAAA CGATAGAGTA AAACATTATG AAGTAGAAGA TAAAAAGGAT
TGCAAAAGCT GTATATTCCT TAGTATTACA ACAAAGGATA AAAGATACAA AGTGGGAATA
GCAAGTTCTA CAGAGGTTTT ATTAGATAGC CAGAAATGTT ATTTTAATAG ATTTGTAAAA
GATTTAGGAA GCGTTATTAC CGAAAACCTT GAGGTTGAGG CGAAAGAAGG AAAAAGTTAT
GAGATTGTAA AGCTGAGTGT ATTGGTGTCC TCGAGAGAAA ACGTTGAGGA TATTTTTAAA
AGTTCTATAA GCAAGCTTGA AAGAGCAAAA GAATTAGGTG TTGAGAGGCT GCTTTCTGAG
CATATAGAAG AGTATGACAA GCTTTGGGAT GCTGCCAAGG TTGAGGTAAT TGGTGATGAG
GTTGCAGATA GAAGTCTTAA ATTTAATGTT TTCCATCTAC TTAGTATGGC AAATCCAGAA
GATGAACATG TGAGCCTTGG TGCAAAAGGC CTTCATGGTG AAGGCTACAA AGGACATGTT
TTTTGGGATA CAGAGATATT TATGCTCCCG TTTTACATTT ACACAAATCC AAAAGCTGCA
AGGTCAATGC TGATGTACAG GTACAATCTT TTGGACGCTG CAAGGGAGAA CGCAAAGAAA
AATGGATACA AAGGTGCGCA GTTCCCCTGG GAGTCTGCAG ACACCGGTCA AGAGGAGACG
CCAAAGTGGG GGTATGATTA TCTTGGCAAA CCTGTTCGGA TATGGACAGG GGATATAGAA
TATCATATCT CATCAGATAT AGCCTTTGCA GTTTTAGAGT ATGTGCGTGC AACAGATGAT
ATAGAGTTTC TTTTAAACTA TGGTGTGGAA ATTGTGATTG AAACAGCAAG GTTTTGGGCT
TCTATTTGTA AATACAATGA AGAAAAGGAT AGATATGAAA TAAATGATGT GATAGGTCCG
GATGAGTTCC ATGAACATTG CAACAACAAT GCTTACACCA ATTATCTTGC AAAGTGGAAC
TTGGAGAAGG CCTTTGAACT TTTCACACGC TTAGAGGAAA ATTACCCCAG CCATTTTGAA
AGGTTAGTAA AAAAAATAAA CTTATCAGAA GATGAACCTT TGAACTGGCT AAAAGTTGCA
TCAAAGATTT ATATTCCATA CCATCCTGAA ACAAAGCTAA TTGAACAGTT TGAAGGATAT
TTTAGTCTTA AAGATTTTGT TATTAAAGAA TACGACAGCA ACAATATGCC AGTCTGGCCA
GAAGGTGTTG AGCTTGACAA GCTAAATAGC TATCAGCTCA TTAAACAGGC AGACGTTGTG
ATGCTTTTGT ATTTGCTTGG CGACCAGTTT GATGAAGAGG TTATGAAAAT AAACTATGAT
TACTATGAAA AAAGGACAAT GCACAAATCG TCTTTGAGCC CGAGTATCTA TGCTTTGATG
GGAGTAAGAG TGGGTGAGAC AAAAAGAGCA TATATAAATT TTATGCGTAC CGCTTTGACA
GACATTGAAG ACAATCAAGG CAACACAGCT TTGGGGATAC ACGCTGCGTC TTTGGGCGGC
ACATGGCAAG CTTTGATATT TGGTTTTGGA GGTTTAAAAG TGGAAAAAGA TGATGTTCTA
TCTGTCAATC CGTGGCTTCC TGAAAAATGG GAAGCTTTGA AATTTAGCAT CTGGTGGAAA
GGAAACTTGC TGGATTTTGT CATAACCCAG GAAAATGTTG AAATCAGAAA AAGAGTGAAC
AAGAGCAAAG TGAAAATCAT GATAAAAGAT AAAGAAATGG TCTTATAG
 
Protein sequence
MKLSEKNWLI EQESFGVSHK HETCFALTNG YIGIRGINEE VFCDEIPGTF IAGVFDKDTA 
QVTELVNLPN PIGLRIYINR EFLNPLKCEV LEFRRILDLK QGLLFRKLRL KDEKGRVTSI
EGFRFVSMKN KNLIVQKYNV VCENYSAVLN VESFIDANTM NSKDIPNDRV KHYEVEDKKD
CKSCIFLSIT TKDKRYKVGI ASSTEVLLDS QKCYFNRFVK DLGSVITENL EVEAKEGKSY
EIVKLSVLVS SRENVEDIFK SSISKLERAK ELGVERLLSE HIEEYDKLWD AAKVEVIGDE
VADRSLKFNV FHLLSMANPE DEHVSLGAKG LHGEGYKGHV FWDTEIFMLP FYIYTNPKAA
RSMLMYRYNL LDAARENAKK NGYKGAQFPW ESADTGQEET PKWGYDYLGK PVRIWTGDIE
YHISSDIAFA VLEYVRATDD IEFLLNYGVE IVIETARFWA SICKYNEEKD RYEINDVIGP
DEFHEHCNNN AYTNYLAKWN LEKAFELFTR LEENYPSHFE RLVKKINLSE DEPLNWLKVA
SKIYIPYHPE TKLIEQFEGY FSLKDFVIKE YDSNNMPVWP EGVELDKLNS YQLIKQADVV
MLLYLLGDQF DEEVMKINYD YYEKRTMHKS SLSPSIYALM GVRVGETKRA YINFMRTALT
DIEDNQGNTA LGIHAASLGG TWQALIFGFG GLKVEKDDVL SVNPWLPEKW EALKFSIWWK
GNLLDFVITQ ENVEIRKRVN KSKVKIMIKD KEMVL