Gene M446_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3206 
Symbol 
ID6134030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3549433 
End bp3551316 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content70% 
IMG OID641643394 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001770046 
Protein GI170741391 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.184602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC CCGTCATTCC CCCCAAAGGC CTGCCGCAGA GCGTGACCAC GGGACCGATC 
GCCGGCTCCG TCAAGGTCTA TGCCAGTCCC GAGGGGCGGC CTGACATCCG GGTGCCGCTG
CGCGAGATCG CCCTCAGCGA TCCGGCCGAG GCGCCGGTGC GCGTCTACGA TCCCTCGGGC
CCCTACACGG AGAGCGACGC CAAGGTCGAC CTCGCGGCCG GCCTGCCCCA GCTGCGCGAT
CCCTGGATCG CCGGCCGCGG CTACGCCGCG GTGACGCCCC GCGCGGTCAA GCCGGAGGAT
AACGGCTTCG CGGCCGAGGA CAGGCTGGTC GCCCCCTGCC CGGCCGCGCG CACGATCCGC
AGGGCCGCTC CCGGCCAGAT GGTGACCCAG TACGAGTTCG CCCGGGCCGG GATCGTCACC
GAGGAGATGA TCTACGTCGC CCACCGGGAG AATCTCGGCC GCCGGGCGAT GCTGGACCAG
GCCGAGGCCA AGCTCGCCGA CGGCGAGAGC TTCGGCGCGG CGATTCCGCC CTTCATCACC
CCGGAATTCG TGCGCGACGA GATCGCCCGC GGCCGCGCCA TCATCCCGGC CAACATCAAC
CACACCGAAC TCGAGCCGAT GGCCATCGGC CGCAACTTCC TGGTCAAGAT CAACGCCAAT
ATCGGCAACT CGGCCGTGAC CTCCTCGGCG GCCGAGGAGG TCGAGAAGAT GGTCTGGGCG
ACCCGCTGGG GCGCCGACAC CGTCATGGAC CTGTCGACGG GCCGCAACAT CCACAACATC
CGCGGCTGGA TCCTGCGCAA CGCGCCGGTG CCGATCGGGA CCGTGCCGAT CTACCAGGCC
CTGGAGAAGG TCGGGGGCGA CCCGCTGAAA CTCGACTGGG AGGTGTTCAG GGACACGCTC
ATCGAGCAGG CCGAGCAGGG GGTCGACTAC TTCACCATCC ATGCGGGCGT GCGGCTCGCC
CACGTGCCGC TGACCGCGCG GCGGGTCACC GGCATCGTCT CGCGCGGCGG CTCGATCATG
GCCCGCTGGT GCCTCGCCGG CCACCGGGAA TCGTTCCTCT ACGAGCGGTT CGACGAGATC
TGCGAGATCA TGCGCGCCTA CGACGTCTCC TTCTCGCTCG GGGACGGCCT GCGGCCCGGC
TCGATCGCCG ACGCCAACGA CGCGGCGCAA TTCGCCGAGC TCGAGACCCT CGGCGAACTC
ACCAAGGTCG CCTGGGAGAA GGGCTGTCAG GTCATGATCG AGGGCCCCGG CCACGTGCCG
ATGCACAAGA TCAAGGTCAA CATGGAGAAG CAGCTGCGCG AGTGCGGCGA GGCGCCGTTC
TACACGCTCG GGCCGCTGAC CACCGACGTG GCGCCCGGCT ACGACCACAT CACCTCCGGC
ATCGGCGCGG CGATGATCGG CTGGTACGGC ACCGCGATGC TCTGCTACGT GACCCCCAAG
GAGCATCTCG GCCTGCCGAA CCGCGACGAC GTGAAGACCG GCGTCATCAC CTACAAGATC
GCCGCCCACG CCGCCGACCT CGCCAAGGGC CACCCGGCCG CGCAGCTGCG CGACGACGCC
CTGTCCCGGG CCCGGTTCGA CTTCCGCTGG GAGGACCAGT TCAACCTCTC CCTCGACCCG
GACACGGCCC GCGCCTACCA CGACGAGACG CTGCCCAAGG ACGCCCACAA GGTCGCGCAT
TTCTGCTCGA TGTGCGGCCC GAAATTCTGC TCGATGAAGA TCACGCAGGA CCTGCGGGCG
GACGTGCTGG CCATGGAGGC GGCCGGCACC GTGGTGGGCG CCGCCCCCGC GATGAGCGAG
GCGGACCGGG CGGCCGGCAT GGCGGCCAAG TCGGCCGAGT TCCTGGCCGA GGGCGGCAAG
CTCTACGTCG ACGCCGCGGA GTGA
 
Protein sequence
MNAPVIPPKG LPQSVTTGPI AGSVKVYASP EGRPDIRVPL REIALSDPAE APVRVYDPSG 
PYTESDAKVD LAAGLPQLRD PWIAGRGYAA VTPRAVKPED NGFAAEDRLV APCPAARTIR
RAAPGQMVTQ YEFARAGIVT EEMIYVAHRE NLGRRAMLDQ AEAKLADGES FGAAIPPFIT
PEFVRDEIAR GRAIIPANIN HTELEPMAIG RNFLVKINAN IGNSAVTSSA AEEVEKMVWA
TRWGADTVMD LSTGRNIHNI RGWILRNAPV PIGTVPIYQA LEKVGGDPLK LDWEVFRDTL
IEQAEQGVDY FTIHAGVRLA HVPLTARRVT GIVSRGGSIM ARWCLAGHRE SFLYERFDEI
CEIMRAYDVS FSLGDGLRPG SIADANDAAQ FAELETLGEL TKVAWEKGCQ VMIEGPGHVP
MHKIKVNMEK QLRECGEAPF YTLGPLTTDV APGYDHITSG IGAAMIGWYG TAMLCYVTPK
EHLGLPNRDD VKTGVITYKI AAHAADLAKG HPAAQLRDDA LSRARFDFRW EDQFNLSLDP
DTARAYHDET LPKDAHKVAH FCSMCGPKFC SMKITQDLRA DVLAMEAAGT VVGAAPAMSE
ADRAAGMAAK SAEFLAEGGK LYVDAAE