Gene Mmar10_0569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0569 
Symbol 
ID4286540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp662244 
End bp664115 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content64% 
IMG OID638140034 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_755800 
Protein GI114569120 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000923464 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000172451 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACAAGC CCACCTCACA ATCCGAATTC GAGACTCCCA AGGTCACCAC CGGGCCGCTG 
CCGGGGTCGC GTCGGGTCTA TACCTCGCCC AAGGCGGGGC CGGCGCTGAA AGTGCCGCAT
CGCGAGATCG ACTTGCATCC CACCGCCAAC GAGCCGCCGG TGCGCGTCTA TGACACCTCC
GGGCCCTACT CGGATCCGGC AGCGACCATC GACGTGGAGA CCGGTCTGCC GCGTCACCGC
CTCGCCTGGT ACAAGGATCG CCAGCTGACC GAGTATGAGG GCCGCCCGCT CTCGCCGCTC
GATAATGGCG GCGCCACGGG CAAGTATCTC GCCCGCGAAT TCCCGGCCAC GCACAAGCCG
TTGAAGGGCA CGGATGGCCA GCCGGTGACC CAGTATGAAT TTGCCCGCGC CGGCATCATC
ACGCCGGAAA TGGTCTATGT CGCCGAGCGC GAGAACCTGG GTCGCAAGAC GGCGGTCGAG
GGTGCCGCCG AACGCCTCGC CGATGGCGAG AGCTTTGGCG CCGAGATTCC GGAACATGTC
ACGCCGGAAT TCGTCCGTGA CGAGATCGCC CGCGGCCGCG CCATCATCCC GGCCAATATC
AACCACCCGG AACTCGAGCC GATGATCATC GGCCGTAATT TCCTGACCAA GATCAACGCC
AATATCGGCA ATAGCGCCGT CGCCTCCTCG GTCGAGGAAG AGGTCGACAA GATGGTCTGG
TCGATCCGCT GGGGCGGCGA CACGGTGATG GACCTGTCGA CCGGCAAGAA TATCCACAAC
ACCCGCGAAT GGATCCTGCG CAATTCCCCC GTCCCGATCG GCACCGTGCC GATCTACCAG
GCTCTGGAGA AGGTCAACGG CATTGCCGAG GACCTGACCT GGGAGGTGTT CCGCGACACG
CTGATCGAGC AGGCCGAGCA GGGCGTCGAC TATTTCACCA TCCATGCCGG CGTGCGGCTG
GCCTATGTGC CGCTCACCGC CGACCGGGTC ACCGGCATCG TCTCGCGCGG CGGGTCGATC
ATGGCGAAAT GGTGCCTGGC CCATCACACC GAGAGCTTCC TCTACACCCA TTTCGAGGAG
ATCTGCGACA TCATGCGGGC CTATGATGTC TCCTTCTCGC TGGGCGACGG GCTGCGCCCC
GGCTCCATCG CCGACGCCAA TGATCGCGCG CAATTTGCCG AGCTGGAGAC GCTGGGCGAG
CTCACAAAGA TCGCCTGGGC CAAGGGCTGC CAGGTGATGA TCGAGGGGCC GGGCCATGTC
GCCATGCACA AGATCAAGGC CAATATGGAC AAGCAGTTGA AGGAGTGCCA CGAGGCGCCC
TTCTATACGC TGGGGCCGCT CACCACCGAT ATCGCGCCCG GCTATGACCA CATCACCAGC
GGTATCGGCG CCGCCATGAT CGGCTGGTTC GGCTGTGCCA TGCTCTGCTA TGTCACGCCC
AAGGAACATC TGGGCCTGCC CGACCGCGAT GACGTCAAGG TCGGCGTGAT TACCTATAAA
ATCGCCGCCC ACGCCGGCGA CCTCGCCAAG GGCCACCCCG CCGCCAAGAT CCGCGACGAC
GCCCTGTCAC GGGCCAGGTT CGAATTCCGC TGGGAAGACC AGTTCAACCT CGCGCTTGAT
CCTGAAACCG CCCGCGATTT CCACGACAAG ACCCTCCCCA AGGAAGCCCA CAAGGTCGCC
CATTTCTGCT CCATGTGCGG GCCGAAGTTT TGCTCGATGG AGATTACGCG GGAGATCCGG
GACCGGTTTG GGAGTGCGCG GGCGCCGAAT GCTGTGGATG CCGGGATGGC GGAGAAGGCG
GCGGAGTTCC GCGAGAAGGG TGGGGAGATT TATCTCTCGG AAGACGGCAA GGTCCGCGAA
CCAATAGACT AG
 
Protein sequence
MNKPTSQSEF ETPKVTTGPL PGSRRVYTSP KAGPALKVPH REIDLHPTAN EPPVRVYDTS 
GPYSDPAATI DVETGLPRHR LAWYKDRQLT EYEGRPLSPL DNGGATGKYL AREFPATHKP
LKGTDGQPVT QYEFARAGII TPEMVYVAER ENLGRKTAVE GAAERLADGE SFGAEIPEHV
TPEFVRDEIA RGRAIIPANI NHPELEPMII GRNFLTKINA NIGNSAVASS VEEEVDKMVW
SIRWGGDTVM DLSTGKNIHN TREWILRNSP VPIGTVPIYQ ALEKVNGIAE DLTWEVFRDT
LIEQAEQGVD YFTIHAGVRL AYVPLTADRV TGIVSRGGSI MAKWCLAHHT ESFLYTHFEE
ICDIMRAYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTKIAWAKGC QVMIEGPGHV
AMHKIKANMD KQLKECHEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP
KEHLGLPDRD DVKVGVITYK IAAHAGDLAK GHPAAKIRDD ALSRARFEFR WEDQFNLALD
PETARDFHDK TLPKEAHKVA HFCSMCGPKF CSMEITREIR DRFGSARAPN AVDAGMAEKA
AEFREKGGEI YLSEDGKVRE PID