Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0569 |
Symbol | |
ID | 4286540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 662244 |
End bp | 664115 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638140034 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_755800 |
Protein GI | 114569120 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000923464 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00000172451 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACAAGC CCACCTCACA ATCCGAATTC GAGACTCCCA AGGTCACCAC CGGGCCGCTG CCGGGGTCGC GTCGGGTCTA TACCTCGCCC AAGGCGGGGC CGGCGCTGAA AGTGCCGCAT CGCGAGATCG ACTTGCATCC CACCGCCAAC GAGCCGCCGG TGCGCGTCTA TGACACCTCC GGGCCCTACT CGGATCCGGC AGCGACCATC GACGTGGAGA CCGGTCTGCC GCGTCACCGC CTCGCCTGGT ACAAGGATCG CCAGCTGACC GAGTATGAGG GCCGCCCGCT CTCGCCGCTC GATAATGGCG GCGCCACGGG CAAGTATCTC GCCCGCGAAT TCCCGGCCAC GCACAAGCCG TTGAAGGGCA CGGATGGCCA GCCGGTGACC CAGTATGAAT TTGCCCGCGC CGGCATCATC ACGCCGGAAA TGGTCTATGT CGCCGAGCGC GAGAACCTGG GTCGCAAGAC GGCGGTCGAG GGTGCCGCCG AACGCCTCGC CGATGGCGAG AGCTTTGGCG CCGAGATTCC GGAACATGTC ACGCCGGAAT TCGTCCGTGA CGAGATCGCC CGCGGCCGCG CCATCATCCC GGCCAATATC AACCACCCGG AACTCGAGCC GATGATCATC GGCCGTAATT TCCTGACCAA GATCAACGCC AATATCGGCA ATAGCGCCGT CGCCTCCTCG GTCGAGGAAG AGGTCGACAA GATGGTCTGG TCGATCCGCT GGGGCGGCGA CACGGTGATG GACCTGTCGA CCGGCAAGAA TATCCACAAC ACCCGCGAAT GGATCCTGCG CAATTCCCCC GTCCCGATCG GCACCGTGCC GATCTACCAG GCTCTGGAGA AGGTCAACGG CATTGCCGAG GACCTGACCT GGGAGGTGTT CCGCGACACG CTGATCGAGC AGGCCGAGCA GGGCGTCGAC TATTTCACCA TCCATGCCGG CGTGCGGCTG GCCTATGTGC CGCTCACCGC CGACCGGGTC ACCGGCATCG TCTCGCGCGG CGGGTCGATC ATGGCGAAAT GGTGCCTGGC CCATCACACC GAGAGCTTCC TCTACACCCA TTTCGAGGAG ATCTGCGACA TCATGCGGGC CTATGATGTC TCCTTCTCGC TGGGCGACGG GCTGCGCCCC GGCTCCATCG CCGACGCCAA TGATCGCGCG CAATTTGCCG AGCTGGAGAC GCTGGGCGAG CTCACAAAGA TCGCCTGGGC CAAGGGCTGC CAGGTGATGA TCGAGGGGCC GGGCCATGTC GCCATGCACA AGATCAAGGC CAATATGGAC AAGCAGTTGA AGGAGTGCCA CGAGGCGCCC TTCTATACGC TGGGGCCGCT CACCACCGAT ATCGCGCCCG GCTATGACCA CATCACCAGC GGTATCGGCG CCGCCATGAT CGGCTGGTTC GGCTGTGCCA TGCTCTGCTA TGTCACGCCC AAGGAACATC TGGGCCTGCC CGACCGCGAT GACGTCAAGG TCGGCGTGAT TACCTATAAA ATCGCCGCCC ACGCCGGCGA CCTCGCCAAG GGCCACCCCG CCGCCAAGAT CCGCGACGAC GCCCTGTCAC GGGCCAGGTT CGAATTCCGC TGGGAAGACC AGTTCAACCT CGCGCTTGAT CCTGAAACCG CCCGCGATTT CCACGACAAG ACCCTCCCCA AGGAAGCCCA CAAGGTCGCC CATTTCTGCT CCATGTGCGG GCCGAAGTTT TGCTCGATGG AGATTACGCG GGAGATCCGG GACCGGTTTG GGAGTGCGCG GGCGCCGAAT GCTGTGGATG CCGGGATGGC GGAGAAGGCG GCGGAGTTCC GCGAGAAGGG TGGGGAGATT TATCTCTCGG AAGACGGCAA GGTCCGCGAA CCAATAGACT AG
|
Protein sequence | MNKPTSQSEF ETPKVTTGPL PGSRRVYTSP KAGPALKVPH REIDLHPTAN EPPVRVYDTS GPYSDPAATI DVETGLPRHR LAWYKDRQLT EYEGRPLSPL DNGGATGKYL AREFPATHKP LKGTDGQPVT QYEFARAGII TPEMVYVAER ENLGRKTAVE GAAERLADGE SFGAEIPEHV TPEFVRDEIA RGRAIIPANI NHPELEPMII GRNFLTKINA NIGNSAVASS VEEEVDKMVW SIRWGGDTVM DLSTGKNIHN TREWILRNSP VPIGTVPIYQ ALEKVNGIAE DLTWEVFRDT LIEQAEQGVD YFTIHAGVRL AYVPLTADRV TGIVSRGGSI MAKWCLAHHT ESFLYTHFEE ICDIMRAYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTKIAWAKGC QVMIEGPGHV AMHKIKANMD KQLKECHEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP KEHLGLPDRD DVKVGVITYK IAAHAGDLAK GHPAAKIRDD ALSRARFEFR WEDQFNLALD PETARDFHDK TLPKEAHKVA HFCSMCGPKF CSMEITREIR DRFGSARAPN AVDAGMAEKA AEFREKGGEI YLSEDGKVRE PID
|
| |