Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3206 |
Symbol | |
ID | 6134030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3549433 |
End bp | 3551316 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641643394 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001770046 |
Protein GI | 170741391 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.184602 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCC CCGTCATTCC CCCCAAAGGC CTGCCGCAGA GCGTGACCAC GGGACCGATC GCCGGCTCCG TCAAGGTCTA TGCCAGTCCC GAGGGGCGGC CTGACATCCG GGTGCCGCTG CGCGAGATCG CCCTCAGCGA TCCGGCCGAG GCGCCGGTGC GCGTCTACGA TCCCTCGGGC CCCTACACGG AGAGCGACGC CAAGGTCGAC CTCGCGGCCG GCCTGCCCCA GCTGCGCGAT CCCTGGATCG CCGGCCGCGG CTACGCCGCG GTGACGCCCC GCGCGGTCAA GCCGGAGGAT AACGGCTTCG CGGCCGAGGA CAGGCTGGTC GCCCCCTGCC CGGCCGCGCG CACGATCCGC AGGGCCGCTC CCGGCCAGAT GGTGACCCAG TACGAGTTCG CCCGGGCCGG GATCGTCACC GAGGAGATGA TCTACGTCGC CCACCGGGAG AATCTCGGCC GCCGGGCGAT GCTGGACCAG GCCGAGGCCA AGCTCGCCGA CGGCGAGAGC TTCGGCGCGG CGATTCCGCC CTTCATCACC CCGGAATTCG TGCGCGACGA GATCGCCCGC GGCCGCGCCA TCATCCCGGC CAACATCAAC CACACCGAAC TCGAGCCGAT GGCCATCGGC CGCAACTTCC TGGTCAAGAT CAACGCCAAT ATCGGCAACT CGGCCGTGAC CTCCTCGGCG GCCGAGGAGG TCGAGAAGAT GGTCTGGGCG ACCCGCTGGG GCGCCGACAC CGTCATGGAC CTGTCGACGG GCCGCAACAT CCACAACATC CGCGGCTGGA TCCTGCGCAA CGCGCCGGTG CCGATCGGGA CCGTGCCGAT CTACCAGGCC CTGGAGAAGG TCGGGGGCGA CCCGCTGAAA CTCGACTGGG AGGTGTTCAG GGACACGCTC ATCGAGCAGG CCGAGCAGGG GGTCGACTAC TTCACCATCC ATGCGGGCGT GCGGCTCGCC CACGTGCCGC TGACCGCGCG GCGGGTCACC GGCATCGTCT CGCGCGGCGG CTCGATCATG GCCCGCTGGT GCCTCGCCGG CCACCGGGAA TCGTTCCTCT ACGAGCGGTT CGACGAGATC TGCGAGATCA TGCGCGCCTA CGACGTCTCC TTCTCGCTCG GGGACGGCCT GCGGCCCGGC TCGATCGCCG ACGCCAACGA CGCGGCGCAA TTCGCCGAGC TCGAGACCCT CGGCGAACTC ACCAAGGTCG CCTGGGAGAA GGGCTGTCAG GTCATGATCG AGGGCCCCGG CCACGTGCCG ATGCACAAGA TCAAGGTCAA CATGGAGAAG CAGCTGCGCG AGTGCGGCGA GGCGCCGTTC TACACGCTCG GGCCGCTGAC CACCGACGTG GCGCCCGGCT ACGACCACAT CACCTCCGGC ATCGGCGCGG CGATGATCGG CTGGTACGGC ACCGCGATGC TCTGCTACGT GACCCCCAAG GAGCATCTCG GCCTGCCGAA CCGCGACGAC GTGAAGACCG GCGTCATCAC CTACAAGATC GCCGCCCACG CCGCCGACCT CGCCAAGGGC CACCCGGCCG CGCAGCTGCG CGACGACGCC CTGTCCCGGG CCCGGTTCGA CTTCCGCTGG GAGGACCAGT TCAACCTCTC CCTCGACCCG GACACGGCCC GCGCCTACCA CGACGAGACG CTGCCCAAGG ACGCCCACAA GGTCGCGCAT TTCTGCTCGA TGTGCGGCCC GAAATTCTGC TCGATGAAGA TCACGCAGGA CCTGCGGGCG GACGTGCTGG CCATGGAGGC GGCCGGCACC GTGGTGGGCG CCGCCCCCGC GATGAGCGAG GCGGACCGGG CGGCCGGCAT GGCGGCCAAG TCGGCCGAGT TCCTGGCCGA GGGCGGCAAG CTCTACGTCG ACGCCGCGGA GTGA
|
Protein sequence | MNAPVIPPKG LPQSVTTGPI AGSVKVYASP EGRPDIRVPL REIALSDPAE APVRVYDPSG PYTESDAKVD LAAGLPQLRD PWIAGRGYAA VTPRAVKPED NGFAAEDRLV APCPAARTIR RAAPGQMVTQ YEFARAGIVT EEMIYVAHRE NLGRRAMLDQ AEAKLADGES FGAAIPPFIT PEFVRDEIAR GRAIIPANIN HTELEPMAIG RNFLVKINAN IGNSAVTSSA AEEVEKMVWA TRWGADTVMD LSTGRNIHNI RGWILRNAPV PIGTVPIYQA LEKVGGDPLK LDWEVFRDTL IEQAEQGVDY FTIHAGVRLA HVPLTARRVT GIVSRGGSIM ARWCLAGHRE SFLYERFDEI CEIMRAYDVS FSLGDGLRPG SIADANDAAQ FAELETLGEL TKVAWEKGCQ VMIEGPGHVP MHKIKVNMEK QLRECGEAPF YTLGPLTTDV APGYDHITSG IGAAMIGWYG TAMLCYVTPK EHLGLPNRDD VKTGVITYKI AAHAADLAKG HPAAQLRDDA LSRARFDFRW EDQFNLSLDP DTARAYHDET LPKDAHKVAH FCSMCGPKFC SMKITQDLRA DVLAMEAAGT VVGAAPAMSE ADRAAGMAAK SAEFLAEGGK LYVDAAE
|
| |