Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1970 |
Symbol | |
ID | 4284678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2151468 |
End bp | 2152406 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638141470 |
Product | spore coat polysaccharide biosynthesis protein glycosyltransferase-like protein |
Protein accession | YP_757200 |
Protein GI | 114570520 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.750902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCC CCTTCAAAAT CGCCATTCGG GCGGATGGCT CGAAGCTGAT TGGCCTCGGA CACGTCATGC GCTGCGGCGC GCTGGCCAAT GCGCTGGCTG AGATCGGTGC CGGAATTGCC TGGTTGACGA CAACGCCTCA GCATTTGCCC GCCGGCCTGT CGCATGCGGT CGAGCCCGTG CAACTGGACA ATGACGAACA GCTGGCCGAC GCGCTGACCG CCCGCAACAT CCACCATCTT GTCGCCGACT GGCACCGCAC CGACCCGCAG CGTGTCCACA GCCTGAGGGC AAACGGCGTG CACGTCAGCC TGGTCGGCAA TTTCCTGCAG GATGCAGTCC CTGACCTGCA TATCAGGCAA GGCTTCCTGC CGGGCATGTC GCCATCCGGG GCGCCAACCT TGAGTGGCCC GAAATACCTG TTGCTTCCCG CCTCATGCGA GGCGCTGCCG CCACGCCCCG TCGCGGCGAC AGCCCGGCGG GTCCTGCTGT CGCTGGGCGG CACTGACAGC CCGCTTCTGG CACGCATCCG GGACCGCCTG GCGCAAGGCT TTCCGGCAAT CGAGGTCGAT GGGCGCGGCC CCGTCGGCAA TGGTCCGATC CCGCCTCTGA CCGAAGCCAT GCGAAGAGCC GATATCGGAA TTCTGGCTGG CGGAACGAGC TTGCACGAGG CGGCAGCGAC CGGCCTGCCG AGCCTGTGTC TGCCTATCGC CGCCAATCAG TTCGAGCGGG CCGGTCATTT TGAAAGCGCG GGCCTCGGCA TCAGTCTGGA TCCGGCAGAC CCCGGTTTCG ACCAGCAGTT CGACACGGCA CTGGCCAGGC TTGTTTCAGA TCAAGCCGGA CGACAGGACA TGGCCCGAAC CGGCCAGGCC CTGGTGGACG GCGGCGGAGC CCGACGCGTC GCCACCCACC TGGCCGCCCT CATCACCGCC GGGACCTGA
|
Protein sequence | MARPFKIAIR ADGSKLIGLG HVMRCGALAN ALAEIGAGIA WLTTTPQHLP AGLSHAVEPV QLDNDEQLAD ALTARNIHHL VADWHRTDPQ RVHSLRANGV HVSLVGNFLQ DAVPDLHIRQ GFLPGMSPSG APTLSGPKYL LLPASCEALP PRPVAATARR VLLSLGGTDS PLLARIRDRL AQGFPAIEVD GRGPVGNGPI PPLTEAMRRA DIGILAGGTS LHEAAATGLP SLCLPIAANQ FERAGHFESA GLGISLDPAD PGFDQQFDTA LARLVSDQAG RQDMARTGQA LVDGGGARRV ATHLAALITA GT
|
| |