Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0398 |
Symbol | |
ID | 7409333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 456257 |
End bp | 458524 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714787 |
Product | Kojibiose phosphorylase |
Protein accession | YP_002572305 |
Protein GI | 222528423 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00143976 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTT CAGAAAAAAA CTGGCTGATT GAACAGGAAA GTTTTGGAGT TTCACATAAA CATGAAACCT GTTTTGCTCT TACAAATGGG TATATAGGAA TAAGGGGAAT CAATGAAGAG GTTTTTTGTG ATGAGATACC AGGAACTTTC ATAGCAGGTG TATTTGACAA AGACACAGCT CAGGTTACAG AACTTGTGAA TTTGCCAAAT CCAATAGGTC TTAGGATATA TATAAATAGA GAATTTCTAA ATCCTTTAAA ATGTGAAGTG CTTGAGTTTA GAAGAATTTT AGACTTAAAA CAGGGACTGC TTTTCCGAAA ATTGAGACTA AAAGATGAAA AGGGTAGAGT TACATCAATA GAAGGATTCC GATTTGTCAG CATGAAAAAT AAAAATCTTA TTGTTCAAAA GTACAATGTG GTTTGCGAAA ACTACTCAGC GGTTTTAAAT GTAGAAAGTT TTATCGATGC CAATACCATG AACTCCAAGG ATATTCCAAA CGATAGAGTA AAACATTATG AAGTAGAAGA TAAAAAGGAT TGCAAAAGCT GTATATTCCT TAGTATTACA ACAAAGGATA AAAGATACAA AGTGGGAATA GCAAGTTCTA CAGAGGTTTT ATTAGATAGC CAGAAATGTT ATTTTAATAG ATTTGTAAAA GATTTAGGAA GCGTTATTAC CGAAAACCTT GAGGTTGAGG CGAAAGAAGG AAAAAGTTAT GAGATTGTAA AGCTGAGTGT ATTGGTGTCC TCGAGAGAAA ACGTTGAGGA TATTTTTAAA AGTTCTATAA GCAAGCTTGA AAGAGCAAAA GAATTAGGTG TTGAGAGGCT GCTTTCTGAG CATATAGAAG AGTATGACAA GCTTTGGGAT GCTGCCAAGG TTGAGGTAAT TGGTGATGAG GTTGCAGATA GAAGTCTTAA ATTTAATGTT TTCCATCTAC TTAGTATGGC AAATCCAGAA GATGAACATG TGAGCCTTGG TGCAAAAGGC CTTCATGGTG AAGGCTACAA AGGACATGTT TTTTGGGATA CAGAGATATT TATGCTCCCG TTTTACATTT ACACAAATCC AAAAGCTGCA AGGTCAATGC TGATGTACAG GTACAATCTT TTGGACGCTG CAAGGGAGAA CGCAAAGAAA AATGGATACA AAGGTGCGCA GTTCCCCTGG GAGTCTGCAG ACACCGGTCA AGAGGAGACG CCAAAGTGGG GGTATGATTA TCTTGGCAAA CCTGTTCGGA TATGGACAGG GGATATAGAA TATCATATCT CATCAGATAT AGCCTTTGCA GTTTTAGAGT ATGTGCGTGC AACAGATGAT ATAGAGTTTC TTTTAAACTA TGGTGTGGAA ATTGTGATTG AAACAGCAAG GTTTTGGGCT TCTATTTGTA AATACAATGA AGAAAAGGAT AGATATGAAA TAAATGATGT GATAGGTCCG GATGAGTTCC ATGAACATTG CAACAACAAT GCTTACACCA ATTATCTTGC AAAGTGGAAC TTGGAGAAGG CCTTTGAACT TTTCACACGC TTAGAGGAAA ATTACCCCAG CCATTTTGAA AGGTTAGTAA AAAAAATAAA CTTATCAGAA GATGAACCTT TGAACTGGCT AAAAGTTGCA TCAAAGATTT ATATTCCATA CCATCCTGAA ACAAAGCTAA TTGAACAGTT TGAAGGATAT TTTAGTCTTA AAGATTTTGT TATTAAAGAA TACGACAGCA ACAATATGCC AGTCTGGCCA GAAGGTGTTG AGCTTGACAA GCTAAATAGC TATCAGCTCA TTAAACAGGC AGACGTTGTG ATGCTTTTGT ATTTGCTTGG CGACCAGTTT GATGAAGAGG TTATGAAAAT AAACTATGAT TACTATGAAA AAAGGACAAT GCACAAATCG TCTTTGAGCC CGAGTATCTA TGCTTTGATG GGAGTAAGAG TGGGTGAGAC AAAAAGAGCA TATATAAATT TTATGCGTAC CGCTTTGACA GACATTGAAG ACAATCAAGG CAACACAGCT TTGGGGATAC ACGCTGCGTC TTTGGGCGGC ACATGGCAAG CTTTGATATT TGGTTTTGGA GGTTTAAAAG TGGAAAAAGA TGATGTTCTA TCTGTCAATC CGTGGCTTCC TGAAAAATGG GAAGCTTTGA AATTTAGCAT CTGGTGGAAA GGAAACTTGC TGGATTTTGT CATAACCCAG GAAAATGTTG AAATCAGAAA AAGAGTGAAC AAGAGCAAAG TGAAAATCAT GATAAAAGAT AAAGAAATGG TCTTATAG
|
Protein sequence | MKLSEKNWLI EQESFGVSHK HETCFALTNG YIGIRGINEE VFCDEIPGTF IAGVFDKDTA QVTELVNLPN PIGLRIYINR EFLNPLKCEV LEFRRILDLK QGLLFRKLRL KDEKGRVTSI EGFRFVSMKN KNLIVQKYNV VCENYSAVLN VESFIDANTM NSKDIPNDRV KHYEVEDKKD CKSCIFLSIT TKDKRYKVGI ASSTEVLLDS QKCYFNRFVK DLGSVITENL EVEAKEGKSY EIVKLSVLVS SRENVEDIFK SSISKLERAK ELGVERLLSE HIEEYDKLWD AAKVEVIGDE VADRSLKFNV FHLLSMANPE DEHVSLGAKG LHGEGYKGHV FWDTEIFMLP FYIYTNPKAA RSMLMYRYNL LDAARENAKK NGYKGAQFPW ESADTGQEET PKWGYDYLGK PVRIWTGDIE YHISSDIAFA VLEYVRATDD IEFLLNYGVE IVIETARFWA SICKYNEEKD RYEINDVIGP DEFHEHCNNN AYTNYLAKWN LEKAFELFTR LEENYPSHFE RLVKKINLSE DEPLNWLKVA SKIYIPYHPE TKLIEQFEGY FSLKDFVIKE YDSNNMPVWP EGVELDKLNS YQLIKQADVV MLLYLLGDQF DEEVMKINYD YYEKRTMHKS SLSPSIYALM GVRVGETKRA YINFMRTALT DIEDNQGNTA LGIHAASLGG TWQALIFGFG GLKVEKDDVL SVNPWLPEKW EALKFSIWWK GNLLDFVITQ ENVEIRKRVN KSKVKIMIKD KEMVL
|
| |