Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2999 |
Symbol | |
ID | 7093494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3312191 |
End bp | 3314020 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643466310 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002363272 |
Protein GI | 217979125 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.548936 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCC ACGACAAGCC GAAGAGCCTT CGTCCGGAAA GCGTGACCCA AGGGCCCATG GCCGGCTCGC GCAAGATCTA TGTGCATCCG GACGGTCGCT CCGACATCGC CGTGCCGCTG CGTGAGATCA CCCTCAATCC GTCCGCAAAC GAGCCGCCGG TGCGCGTCTA CGACGCCTCG GGACCCTATA CGGAGTCCGA GCCCGCGATC GACCTTTCCG CCGGCTTGCC GCGCATCCGG GACCCTTGGC TGGCGAGGCG CGCGGGGCTC GACTTCTACT CCGGCCGCCC GGTTCAGCCG GAAGACAATG GTTCGGTGTC GCCGGACCGG CTCGCCCCGC CCTGCCCCGC CAACACCTCG CCGCGCAAGG GCCGGGACGG CGCGCTGGTG ACGCAATATG AATTCGCCCG CGCAGGCGTG ATCACCGAAG AAATGATCTA TGTCGCGGCC CGCGAAAACC TTGGCCGAGA GGCGGTGCTG GAGGGCGCCG AAGCGAAAAT CGCCGACGGC GAAAGTTTCG GCGCGTCGAT CCCGGCCTTC ATCACGCCGG AATTCGTGCG CGACGAGATC GCCCGCGGCC GCGCCATCAT CCCCGCCAAC ATCAACCATC CCGAACTCGA GCCGATGATC ATCGGCCGCA ATTTTCTCGT GAAGGTCAAC GCCAACATCG GCAATTCGGC GGTGACGTCG TCCGTCGCCG AAGAGGTCGA AAAGATGGTG TGGGCGATCC GCTGGGGCGC CGATACGGTG ATGGATCTCT CGACGGGACG CAACATCCAC AACATCCGCG ACTGGATCAT CCGCAACTCG CCAGCGCCGA TCGGCACCGT GCCGATCTAC CAGGCGCTGG AGAAGGTCAA TGGCGATCCG ACCAAACTGA CATTCGAATT GTTCCGCGAC ACGCTGATCG AACAGGCCGA ACAGGGCGTC GATTATTTCA CGATCCACGC CGGCGTGCGG CTCGGCTATA TTCCGCTGAC GGCGAAGCGG ACCACGGGCA TCGTCTCGCG CGGCGGCTCG ATCATGGCCC GCTGGTGCCT TGCCCATCAC AAGGAAAGCT TCCTCTACGA GCGCTTCGAC GAGCTCTGCG ACGTCATGCG CGCCTATGAC GTCTCCTTCT CGCTCGGCGA CGGCTTGCGC CCGGGCTCGA TCGCCGACGC CAACGACCGC GCCCAATTCG CCGAATTGGA GACGCTCGGC GAGCTGACCA AAATCGCCTG GGACAAGGGC TGCCAGGTGA TGATCGAAGG GCCCGGCCAT GTGCCGATGC ACAAGATCAA GATCAACATG GAAAAGCAGC TGAAGGAATG TCACGAGGCG CCGTTCTACA CGCTCGGGCC GCTGACAACG GACATTGCGC CAGGCTATGA CCACATCACC TCCGGCATCG GCGCTGCGAT GATCGGCTGG TTCGGCTGCG CGATGTTGTG TTATGTGACG CCGAAAGAAC ATCTCGGCCT GCCGGACCGC GACGACGTCA AGACAGGCGT CATTACCTAC CGCATCGCCG CCCATGCCGG CGACCTCGCC AAGGGCCATC CGGCGGCGCA GCTGCGCGAC GACGCGCTGT CGCGCGCGAG ATTCGATTTC CGCTGGGAGG ATCAGTTCAA TCTCGGCCTC GATCCCGACA CGGCGCGGCA GTTCCACGAC GAGACGCTGC CGAAGGATGC GCATAAGACT GCGCATTTCT GCTCGATGTG CGGTCCGCAA TTCTGTTCGA TGAAGATCAC GCAGGATATT CGCGACGAGG TCGCCGCGAT GGAACGCGAG AAGGGCATGG CCGACAAGAG CGCCGAATTC CTGGACGGCG GCGGCAAGCT TTACGTTTAA
|
Protein sequence | MNIHDKPKSL RPESVTQGPM AGSRKIYVHP DGRSDIAVPL REITLNPSAN EPPVRVYDAS GPYTESEPAI DLSAGLPRIR DPWLARRAGL DFYSGRPVQP EDNGSVSPDR LAPPCPANTS PRKGRDGALV TQYEFARAGV ITEEMIYVAA RENLGREAVL EGAEAKIADG ESFGASIPAF ITPEFVRDEI ARGRAIIPAN INHPELEPMI IGRNFLVKVN ANIGNSAVTS SVAEEVEKMV WAIRWGADTV MDLSTGRNIH NIRDWIIRNS PAPIGTVPIY QALEKVNGDP TKLTFELFRD TLIEQAEQGV DYFTIHAGVR LGYIPLTAKR TTGIVSRGGS IMARWCLAHH KESFLYERFD ELCDVMRAYD VSFSLGDGLR PGSIADANDR AQFAELETLG ELTKIAWDKG CQVMIEGPGH VPMHKIKINM EKQLKECHEA PFYTLGPLTT DIAPGYDHIT SGIGAAMIGW FGCAMLCYVT PKEHLGLPDR DDVKTGVITY RIAAHAGDLA KGHPAAQLRD DALSRARFDF RWEDQFNLGL DPDTARQFHD ETLPKDAHKT AHFCSMCGPQ FCSMKITQDI RDEVAAMERE KGMADKSAEF LDGGGKLYV
|
| |