Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1690 |
Symbol | |
ID | 4285693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 1857666 |
End bp | 1859204 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638141178 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_756920 |
Protein GI | 114570240 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000356415 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0111371 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGCG CATATCATGA AACAGGCCCC GCTTACCCGG CGGCCGCCCG GCCGGCTGGC GCCCCGAGAG GGGAGTTGGA CCTGTTCGAC GTGATCGCGC TGGCGTGGTC TCAGAAAGGC TTTATCGCGT TGATTTTCGC GCTTATTTTC GCCGTTGGCG TGGGCGCGTC ACTGACCTTG CTGAAGCCGA GTTATACATC CGATACGCGG CTTCTCGTTC TGCTTGATTC GAATCCCACT CCGACAGCGG CCGGTGTCGG TGACGCCTTC ATGCTTGATC AGGTCATGCA GTCGGAGAGC GAATTGCTCA GTTCCGAGGC GGTGCGCCGG CTGGCGCTGG AGACGCTGGG CGCCGATGTG GTGCTGGGCG AGCAAGCCGG CCTCAATGCG GACGGGCTGG CATTGCGGGA GTTGCGCGCG AATTTCAGCC TGTCGCGGGA GCCCAATTCA TCGGCCATCA ATGCCAGCTT CAAATCCCCG TCGGCGGAAA GGTCGGCGCT GATCCTGAAC AGTATTGTCG ATGCCTATCT GGCCTATCGC GAGCAGGTCC TGTTCGAGAC CGGCATTACC GGACTGGAAG TCCGGCGCGC CCAGGCTGAT GAAGCGGTCG CGGCGGCCCG GGCGGAATTG AATACCTTCC TGCAAGCCAA TGATCTCGCC AATTTCGAGG CCGAACGCCT GACCGCCGAC AATCTGGTTG GCAATCTCTC CGAGCGCGCG GCCCAGGCCC GGGCTGAACG GGATTCGGCG ATGGCCGGTG CCGCGGCCCT GCGTGAACGA CTGGGCCAGA TCCCCGAGAG CATCGAGCTC TATGTGGAAA ACGGCGTGTC CGGTGTCCTG CTCGACCGCC GGGCCGAGCG GGCGAGCCTG CTGGCGCGCT ACCAACCCGC CGCTCCGGCT GTGCAGGCCG TCGAGCGCGA GATCGAGGCC ATCGAGGAAT TCATCCAGTC CGGCGCCACG GTCGGCGAGG GGCAGCGACG TACCGGTGCC AATCCGGTGC GGCAAGCGCT GGAATCAGAG CTGGCAACCC GTGAGGCCAA TGCCGAGGCA CAGGCCTCGC TGGCGACGGC CCTGGAGGGC CAGGCGCGCA GCAATCGTGC GCTCGTCTCC CGCCTGCGCC AGTTGGAACC GGAATATACC CGACTGGCCC AGAACATCAC CGCTGCCGAA GAAGCTGCCG GCGCCGTTGC CGCGCTTGAG GCCACCGCTT CGGCCCGCCG TTCGCCGAGC CTGGGCGCGG CCGACGCCGT CCGCCTGATC GACCGGGCTG CCGCGCCCAT GGAAGGCAGC TCGATGAAGA AGCTCGGCCT GATCGGGTCC TTCGTCGCGG CTGCGGGGAT CGCGCTCTTT CTGGGATTGT TGCGCGGCTA CTGGCTGACC TATGTGCGTG CCGCTGCCCT GCCACGTTCG CGCCCGGCGG CCCCGGTCGT TGCGGCCCAA CCGGCCGGGC CGGTTCAGGA CGCGCGGCAG CCTGTTGCCG CCAATGATCC CTTCGACGGA CTGCCGATCC TCGCTCATGT CGCGGACCGG TCAATGTAA
|
Protein sequence | MTRAYHETGP AYPAAARPAG APRGELDLFD VIALAWSQKG FIALIFALIF AVGVGASLTL LKPSYTSDTR LLVLLDSNPT PTAAGVGDAF MLDQVMQSES ELLSSEAVRR LALETLGADV VLGEQAGLNA DGLALRELRA NFSLSREPNS SAINASFKSP SAERSALILN SIVDAYLAYR EQVLFETGIT GLEVRRAQAD EAVAAARAEL NTFLQANDLA NFEAERLTAD NLVGNLSERA AQARAERDSA MAGAAALRER LGQIPESIEL YVENGVSGVL LDRRAERASL LARYQPAAPA VQAVEREIEA IEEFIQSGAT VGEGQRRTGA NPVRQALESE LATREANAEA QASLATALEG QARSNRALVS RLRQLEPEYT RLAQNITAAE EAAGAVAALE ATASARRSPS LGAADAVRLI DRAAAPMEGS SMKKLGLIGS FVAAAGIALF LGLLRGYWLT YVRAAALPRS RPAAPVVAAQ PAGPVQDARQ PVAANDPFDG LPILAHVADR SM
|
| |