Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0703 |
Symbol | |
ID | 3998163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | - |
Start bp | 724470 |
End bp | 725759 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637958502 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_565418 |
Protein GI | 91772726 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTAA TGGAAGATGC AAGGAAGGGA CTGATCACCC CTGAGATTAG AAGTGTTGCC GAACTTGAAG GGATAGATGT AGAGCTTGTA AGGTCCTGTG TTGCAAAGGG ACTTGTGACC ATCCCGAAGA ACATAAAAGG TAAATCCCGT GCATTCGGCA TTGGGAAATA TATGAGCGTT AAGGTAAACG CCAATATCGG AACCTCAAGG GATTTTGTGA ATATCGATGA AGAGGTCGAT AAAGCAAAAG CAGCGGTAAA ATACGGAGCA GATACCATTA TGGACCTGTC CACCGGAGGA GACCTCGACC TTATACGTTC AAGGATAATG GATGCAGTGG ATGTTCCTAT TGGAACCGTG CCAATATACC AGGCAGCAGC ATCTCATAAG ACTGTGGTCG ATATGACATC TGATGACATG TTCAATGCTG TTCGCAAACA TGCTGAAGAT GGGGTTGATT TTGTCACTAT CCATTCAGGT GTCAACTACA ATGCATTGGA AAGGCTCAAA AAGGGAGACC GGATCACAAA CGTTGTCAGT CGTGGCGGCT CTTTCACCAT TGCATGGATG ATACACAATG AACAGGAAAA TCCATTCTAT TCCGAATATG ACTACCTTGT GGAGATCGCT AAGGAATACG ACCTGACAAT AAGTCTTGGC GATGGTATGA GACCGGGCTG TATCCATGAC GCATCCGATG CACCAAAGTT CATGGAGTTC ATAACTCTCG GAGAACTTGT CAGCAAAGCA AGGGAATCAG ACGTGCAGAC CTTTGTGGAA GGTCCTGGAC ATGTACCTTT GAACGAAGTC GAACTTAGCG TAAAGGCCAT GAAGGAATTG TGCCACGGTG CACCCCTCTA CTTGCTTGGC CCTCTTGTGA CCGATATTGC ACCAGGATAT GACCATATAA CAGGAGCTAT CGGAGGCACA CTGGCAGGAA TGTACGGAGC GGATTTCCTT TGCATGACAA CACCGGCAGA ACATCTTGCA TTGCCAACCG TAGATGACAT ACGCGAAGGT GCTATTGTCA CAAATATCGC TGCACACGCT GCCGATCTTA CTCGCGAGGG ACAGAAAGAA AAGGCACGTG AGCTGGATGA CAGGATGGCA CATGCCCGAG CTGAACTCGA TTGGGAGACA CAATTCGAAG TTGCCATTGA CAGTGAAAAG GCACGCAAGA TCAGGGAAAG CCGAAACACC GGTACTGATG CGTGTTCCAT GTGCGGTGAA CTCTGTGCCA TGAAGATAGT CAAAAGTGCA CTTGAAGAGA GCAGGCAGAA AGACAATTGA
|
Protein sequence | MTLMEDARKG LITPEIRSVA ELEGIDVELV RSCVAKGLVT IPKNIKGKSR AFGIGKYMSV KVNANIGTSR DFVNIDEEVD KAKAAVKYGA DTIMDLSTGG DLDLIRSRIM DAVDVPIGTV PIYQAAASHK TVVDMTSDDM FNAVRKHAED GVDFVTIHSG VNYNALERLK KGDRITNVVS RGGSFTIAWM IHNEQENPFY SEYDYLVEIA KEYDLTISLG DGMRPGCIHD ASDAPKFMEF ITLGELVSKA RESDVQTFVE GPGHVPLNEV ELSVKAMKEL CHGAPLYLLG PLVTDIAPGY DHITGAIGGT LAGMYGADFL CMTTPAEHLA LPTVDDIREG AIVTNIAAHA ADLTREGQKE KARELDDRMA HARAELDWET QFEVAIDSEK ARKIRESRNT GTDACSMCGE LCAMKIVKSA LEESRQKDN
|
| |