Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2229 |
Symbol | |
ID | 7407648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2362287 |
End bp | 2363957 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643716595 |
Product | ribulokinase |
Protein accession | YP_002574074 |
Protein GI | 222530192 |
COG category | [C] Energy production and conversion |
COG ID | [COG1069] Ribulose kinase |
TIGRFAM ID | [TIGR01234] L-ribulokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00201608 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGT TCAGTATAGG AATTGATTTT GGAACACAGT CAGGAAGGGC AGTGCTTGTT AATGTTGAGA CAGGTGAAGA GGTTGCAACA AGTGTGAAAG AGTACACTCA TGGAGTTATG GACGAGAGCT TGCCAGATGG TACAAAACTT CCTCATGACT GGGCACTTCA ACATCCACAG GATTATATAG AAGTCTTGGC AACAACTGTT CCAGATGTCT TGAAAAAAGC AGGAGTTTCC AAAGACGATG TGATTGGAAT AGGAATTGAC TTTACAGCCT GTACAATGCT TCCTATCAAA AAGGATGGGA CACCGCTTTG TGAGCTTCCC AAGTTCAAAT CAAATCCTCA TGCCTATGTT AAGCTGTGGA AACACCATGC TGCTCAAAAG TATGCAAATA GGCTAAATAG AATAGCTCAA GAAAGAGGAG AGAAGTTTTT ACAAAGATAT GGCGGAAAGA TTTCATCCGA ATGGCTATTC CCCAAAATCA TGCAGATTTT AGAAGAAGCA CCAGAAGTCT ATGAAGAGGC AGACAAATTT ATAGAAGCAG CTGACTGGAT TGTGTTTAAG ATGACAGGGG TTGAGAAGAG AAACTCATGT ACAGCAGGAT ATAAAGCTAT CTGGAGCAAG AGGGAAGGGT ATCCTTCAAA AGAGTTTTTC AAAGCACTTC ATCCAAGGCT TGAAAATGTT GTTGATGAAA AGCTATCACG CGAGATATAC CCGATTGGGC AAAAAGCTGG TGAGCTCACA GAAGAGATGG CAAAGCTTAT GGGATTAAAT CCTGGGACAG CTGTTGCAAT TGCAAATGTT GATGCTCATG TGTCAGTTCC TGCTGTTGGG ATTACTGATA TTGGCAAGAT GCTGATGATA ATTGGAACAT CTACATGCCA CATGCTTTTG TGGAATGAAG AGAAGATGGT ACCAGGAATT TGCGGATATG TTGAAGATGG AATTTTACCT GGTTTTTATG GGTATGAGGC AGGACAGAGC TGTGTTGGAG ATCATTTTGA GTGGTTTGTT GAAAACTGCG TGCCAGCTCA ATATCACGAT GAGGCAAAAC AAAAAGGACT CAACATATAT CAGCTACTCA AAGAAAAAGC AAAGGCTTTA AAGCCTGGCC AGAGCGGTCT TTTGGCGCTT GACTGGTGGA ATGGCAACAG GTCAATTCTG GTTGATGCAG ACCTCACTGG TATGATGCTT GGCATGACCT TGACTACAAA GCCAGAGGAG ATGTACAGAG CACTGATTGA GGCAACGGCA TATGGCACAA AGATAATAAT TGATAACTTC AATGAGCACG GTGTTGAAGT AAGAGAGCTT TATGCGTGTG GAGGAATTGC TGAAAAAGAT GAGCTTTTGA TGCAGATTTA TGCTGATGTG ACAGGGCTTG AAATAAAGGT TTCTGCATCA CCTCAAACAC CGGCACTTGG TTCTGCAATG TTTGGTGCTG TTGCTGCAGG AAAAGAAAGA GGTGGTTATG ATAGCATATT TGAAGCTGCA AAGAAAATGG CAAAACTTAA AGACTATTCT TATAAACCAA ATCCGCAAAA CCATGAGATT TACAAAAAGC TATACAGAGA ATACAGAATA CTTCATGACT ATTTTGGAAG AGGAGCAAAT GATGTAATGA AGAGATTAAA AGAGATAAAA GATGAAGTTT CGAAAATATA A
|
Protein sequence | MAKFSIGIDF GTQSGRAVLV NVETGEEVAT SVKEYTHGVM DESLPDGTKL PHDWALQHPQ DYIEVLATTV PDVLKKAGVS KDDVIGIGID FTACTMLPIK KDGTPLCELP KFKSNPHAYV KLWKHHAAQK YANRLNRIAQ ERGEKFLQRY GGKISSEWLF PKIMQILEEA PEVYEEADKF IEAADWIVFK MTGVEKRNSC TAGYKAIWSK REGYPSKEFF KALHPRLENV VDEKLSREIY PIGQKAGELT EEMAKLMGLN PGTAVAIANV DAHVSVPAVG ITDIGKMLMI IGTSTCHMLL WNEEKMVPGI CGYVEDGILP GFYGYEAGQS CVGDHFEWFV ENCVPAQYHD EAKQKGLNIY QLLKEKAKAL KPGQSGLLAL DWWNGNRSIL VDADLTGMML GMTLTTKPEE MYRALIEATA YGTKIIIDNF NEHGVEVREL YACGGIAEKD ELLMQIYADV TGLEIKVSAS PQTPALGSAM FGAVAAGKER GGYDSIFEAA KKMAKLKDYS YKPNPQNHEI YKKLYREYRI LHDYFGRGAN DVMKRLKEIK DEVSKI
|
| |