Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1681 |
Symbol | |
ID | 4283862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1847626 |
End bp | 1849203 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638141169 |
Product | sugar transferase |
Protein accession | YP_756911 |
Protein GI | 114570231 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.895795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0521671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCGA GACCGATGGG GACTGCCGTG GATTATTCCA ACCAGCCAAG CCAGACCGAG CACGCGCCCC GCAAAGCGTC GGGGCCGGCC AATGACATGC CCGACACGCT GCGCCCGGCC AATCCGGACA CAGCCAGCCT GACGCGCGCC CATCTGAAAG CAGACGCGCG CGCCCCCCGC AACCGGGTCA ATCGAAAGGC GCTCGGTCAC ATCTGGCAGA GCATCGACAT ATTCGCCGTG CTGCTCCTGA CCACGACCGG CGCCTATGCG CTGTCAGGGG GGGACGTCAT CGCCGTGGCC GCCGGGGAGT TGTTTCCCCT TCTCGCCTTC GCGGCCCTTT GCCCGGCTTT CACCCTGTTG ATGGGGTTGT ACAAGGTGGA AGCGCGAGAA AGCGCCGCCT TTCGCATGTT GCGCGCCGTC ATTGCGACCG CGCTGACCGG CAGCGCGATT ACTGCATTGT CCCTCATTAC AGCGCCCGAC ATGGCACCGC AGATCGCCAC ATTCGCCCTC ACGGCCGTGG GTGCACTGAC CCTCCTGCAT GTCATCTATG CCGGCTTTGT CCAGCACTGG GCCCGGTCGG GCCGGCTGGC CCGCAATGTT GTTCTCGTCG GAGCAACGGC CAATGCCAGC AAGCTGATCA AGGCCAATGC CGGGTCTGGG ACCGTCAATG TTGTCGGCAT TTTCGACGAC CGCGCCGCCC GCAGCCCCCA GGCACTGGCT GGAGCGCCCT ATCTGGGAAC AACCGACGAC CTTTTGAGCT GGTCGCTGCT GCACGAGGTC GACCGCATCA TCCTGACGGT AACACCGAAG GCAGAAGACC GTGTCCGCTT GCTGCTGGGC AAGCTGCGCG CCCTGCCGCA CACGGTCTGT CTCCTGCTTG ACCTCGACAG TTTCGATCCG GCCGAAACCA CGCTCGATGA TATTATCGGC GTCCAGGCGG CGCGGATGAG CGGCGTCGAG GAACGCTTCG GACATGATCT GGCCAAGCGA ACACAGGATA TCGTCCTGGC CCTCGGATTG AGCCTCGTCG CCCTGCCCGT CATGGCACTG ATCGCCCTGG CGGTCCGTCT GGGCAGCCCC GGTCCGGTCC TGTTCCGCCA GGTCCGTGAA GGCTTCAACG GCCGCCCGAT CAAGGTCCTG AAATTCCGGA CCATGCGTCA CGACCCGGCG TCGGCCGCCA AGCCGATGCG TCAGGTCGAA CTCGATGATC CGCGCGTGAC ACGAATCGGC GGTTTCCTGC GCAAGACAAG CCTCGATGAG CTGCCGCAAT TGTGGAATGT CCTGGTCGGT GAGATGTCGC TGGTCGGTCC GCGCCCACAC GCGCCGGGCA TGCGGACAGG TGGCACCGAA ACAGCCAAGC TGGTTGCCGA ATATGCCCAT CGCCATCGCG TCAAGCCGGG CATCACAGGC TGGGCCCAGA TCAACGGATC GCGCGGCCCC TTGCACTCAC CGGAAGCCGC CCGCGAGCGG GTGGCCTATG ACGTCGCCTA TATCGCCAAG GCCAATTTCT GGTTCGATCT GTGGATCATG GCCCGCACCC TGCCGGCCCT GCTCGGTGAC AAGGCCAATA TTCGCTGA
|
Protein sequence | MSARPMGTAV DYSNQPSQTE HAPRKASGPA NDMPDTLRPA NPDTASLTRA HLKADARAPR NRVNRKALGH IWQSIDIFAV LLLTTTGAYA LSGGDVIAVA AGELFPLLAF AALCPAFTLL MGLYKVEARE SAAFRMLRAV IATALTGSAI TALSLITAPD MAPQIATFAL TAVGALTLLH VIYAGFVQHW ARSGRLARNV VLVGATANAS KLIKANAGSG TVNVVGIFDD RAARSPQALA GAPYLGTTDD LLSWSLLHEV DRIILTVTPK AEDRVRLLLG KLRALPHTVC LLLDLDSFDP AETTLDDIIG VQAARMSGVE ERFGHDLAKR TQDIVLALGL SLVALPVMAL IALAVRLGSP GPVLFRQVRE GFNGRPIKVL KFRTMRHDPA SAAKPMRQVE LDDPRVTRIG GFLRKTSLDE LPQLWNVLVG EMSLVGPRPH APGMRTGGTE TAKLVAEYAH RHRVKPGITG WAQINGSRGP LHSPEAARER VAYDVAYIAK ANFWFDLWIM ARTLPALLGD KANIR
|
| |