Gene Mmar10_1690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1690 
Symbol 
ID4285693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1857666 
End bp1859204 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content66% 
IMG OID638141178 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_756920 
Protein GI114570240 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000356415 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0111371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCG CATATCATGA AACAGGCCCC GCTTACCCGG CGGCCGCCCG GCCGGCTGGC 
GCCCCGAGAG GGGAGTTGGA CCTGTTCGAC GTGATCGCGC TGGCGTGGTC TCAGAAAGGC
TTTATCGCGT TGATTTTCGC GCTTATTTTC GCCGTTGGCG TGGGCGCGTC ACTGACCTTG
CTGAAGCCGA GTTATACATC CGATACGCGG CTTCTCGTTC TGCTTGATTC GAATCCCACT
CCGACAGCGG CCGGTGTCGG TGACGCCTTC ATGCTTGATC AGGTCATGCA GTCGGAGAGC
GAATTGCTCA GTTCCGAGGC GGTGCGCCGG CTGGCGCTGG AGACGCTGGG CGCCGATGTG
GTGCTGGGCG AGCAAGCCGG CCTCAATGCG GACGGGCTGG CATTGCGGGA GTTGCGCGCG
AATTTCAGCC TGTCGCGGGA GCCCAATTCA TCGGCCATCA ATGCCAGCTT CAAATCCCCG
TCGGCGGAAA GGTCGGCGCT GATCCTGAAC AGTATTGTCG ATGCCTATCT GGCCTATCGC
GAGCAGGTCC TGTTCGAGAC CGGCATTACC GGACTGGAAG TCCGGCGCGC CCAGGCTGAT
GAAGCGGTCG CGGCGGCCCG GGCGGAATTG AATACCTTCC TGCAAGCCAA TGATCTCGCC
AATTTCGAGG CCGAACGCCT GACCGCCGAC AATCTGGTTG GCAATCTCTC CGAGCGCGCG
GCCCAGGCCC GGGCTGAACG GGATTCGGCG ATGGCCGGTG CCGCGGCCCT GCGTGAACGA
CTGGGCCAGA TCCCCGAGAG CATCGAGCTC TATGTGGAAA ACGGCGTGTC CGGTGTCCTG
CTCGACCGCC GGGCCGAGCG GGCGAGCCTG CTGGCGCGCT ACCAACCCGC CGCTCCGGCT
GTGCAGGCCG TCGAGCGCGA GATCGAGGCC ATCGAGGAAT TCATCCAGTC CGGCGCCACG
GTCGGCGAGG GGCAGCGACG TACCGGTGCC AATCCGGTGC GGCAAGCGCT GGAATCAGAG
CTGGCAACCC GTGAGGCCAA TGCCGAGGCA CAGGCCTCGC TGGCGACGGC CCTGGAGGGC
CAGGCGCGCA GCAATCGTGC GCTCGTCTCC CGCCTGCGCC AGTTGGAACC GGAATATACC
CGACTGGCCC AGAACATCAC CGCTGCCGAA GAAGCTGCCG GCGCCGTTGC CGCGCTTGAG
GCCACCGCTT CGGCCCGCCG TTCGCCGAGC CTGGGCGCGG CCGACGCCGT CCGCCTGATC
GACCGGGCTG CCGCGCCCAT GGAAGGCAGC TCGATGAAGA AGCTCGGCCT GATCGGGTCC
TTCGTCGCGG CTGCGGGGAT CGCGCTCTTT CTGGGATTGT TGCGCGGCTA CTGGCTGACC
TATGTGCGTG CCGCTGCCCT GCCACGTTCG CGCCCGGCGG CCCCGGTCGT TGCGGCCCAA
CCGGCCGGGC CGGTTCAGGA CGCGCGGCAG CCTGTTGCCG CCAATGATCC CTTCGACGGA
CTGCCGATCC TCGCTCATGT CGCGGACCGG TCAATGTAA
 
Protein sequence
MTRAYHETGP AYPAAARPAG APRGELDLFD VIALAWSQKG FIALIFALIF AVGVGASLTL 
LKPSYTSDTR LLVLLDSNPT PTAAGVGDAF MLDQVMQSES ELLSSEAVRR LALETLGADV
VLGEQAGLNA DGLALRELRA NFSLSREPNS SAINASFKSP SAERSALILN SIVDAYLAYR
EQVLFETGIT GLEVRRAQAD EAVAAARAEL NTFLQANDLA NFEAERLTAD NLVGNLSERA
AQARAERDSA MAGAAALRER LGQIPESIEL YVENGVSGVL LDRRAERASL LARYQPAAPA
VQAVEREIEA IEEFIQSGAT VGEGQRRTGA NPVRQALESE LATREANAEA QASLATALEG
QARSNRALVS RLRQLEPEYT RLAQNITAAE EAAGAVAALE ATASARRSPS LGAADAVRLI
DRAAAPMEGS SMKKLGLIGS FVAAAGIALF LGLLRGYWLT YVRAAALPRS RPAAPVVAAQ
PAGPVQDARQ PVAANDPFDG LPILAHVADR SM