Gene Mmar10_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2043 
Symbol 
ID4286806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2222785 
End bp2224002 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID638141544 
Productmajor facilitator transporter 
Protein accessionYP_757273 
Protein GI114570593 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGGCA GCGCCAGCCT TCCCGGCAAG GCGACCGCAC TTGCCCTGAT CGCCACATCG 
CAAGTCCTCG CCCTGTCCAT CTGGTTCGCC GGAGCCGCCG CCCTGCCCGC CTTGATGGCC
GCTACCGATA TCGGGCCCAT GCGACAGGCC GCCCTGACCA GTTCGGTCCA GCTCGGCTTT
GTCATCGGCG CCGTGCTCAG CGCCGTGACC GGGCTTGCCG ACCGCTTGCC GCCGCAGCGC
CTGTTCGCAC TCGGCAGTAT CATTGCCGCC CTGGCCAATA TCGCCGCCCT GCAGCTGGAA
CCAGGCGGCT GGAGCCTGAT CGCCAGTCGG GCTCTGGCCG GTGCCGCTCT GGCCCTGGTC
TATCCGGTCG GCATGAAGCT GGCGGCCAGC TGGGCACGAG GCGATGCGGG CTTTCTGGTC
GGGTTGCTGG TCGGTGCGCT GACCCTGGGC TCGGCCCTGC CCTTCATGTT CAACCTGGCT
GGCGACATCG CCGACTGGCG CCTGCCCTTC ATGGCGTCAG CGATGGCCGC CCTGATCGCG
GCCAGCCTGA TCCTGCTGGC CCGCGGCGGG CCGGGCCTGC GCCCGGCCGC CCGGCTGGAC
CCCGGCGCAT TCACACTGTC TGTCCGCGAC CCGGCCTTGC GTCTCGTCAA TCTGGGCTAT
CTCGGTCACA TGTGGGAGCT GTACGCCATG TGGGCCTGGA TCGGCCCGTT TGCTCACGCC
TATTGGACGC GGCTGGGCGG TGATGCCCGA CTGGGTGACC TGACCGCCTT TGCGGTCGTC
GCCAGCGGCG CCATCGCCTG TCTCGCCGCC GGCCGCCTGG CTGACCGGTT CGGCCGCACA
CGCATCACCA TCATCGCCCT GGGCATTTCC GGCAGTTGCG CCCTGCTGGT CGGCCCCGCC
TTCGCGCTGG CGCCCTGGCT GATGATCCCG CTTTTGATTG TCTGGGGCAT GGCGGTGATC
GCCGACAGCG CCCAGTTCTC CGCCGCCATC ACCGAGCTGG CGCCGCCGGA ACGGACCGGC
ACCTTGCTGA CCATCCAGAC GGCGATGGGC TTTACCCTGA CCGTGATCAT GATCCAGGCC
TTGGGCTATT GGATCGAACT TGTCGGTTGG GCATGGGCCT TCACGCCGTT GGCGATCGGA
CCGGCTGTCG GAGTTTGGGC GATGGCCCGC CTGCGCGCCC GACCGGAAGC GGCCAGGCTC
GCAGGCGGCA ATCGCTGA
 
Protein sequence
MSGSASLPGK ATALALIATS QVLALSIWFA GAAALPALMA ATDIGPMRQA ALTSSVQLGF 
VIGAVLSAVT GLADRLPPQR LFALGSIIAA LANIAALQLE PGGWSLIASR ALAGAALALV
YPVGMKLAAS WARGDAGFLV GLLVGALTLG SALPFMFNLA GDIADWRLPF MASAMAALIA
ASLILLARGG PGLRPAARLD PGAFTLSVRD PALRLVNLGY LGHMWELYAM WAWIGPFAHA
YWTRLGGDAR LGDLTAFAVV ASGAIACLAA GRLADRFGRT RITIIALGIS GSCALLVGPA
FALAPWLMIP LLIVWGMAVI ADSAQFSAAI TELAPPERTG TLLTIQTAMG FTLTVIMIQA
LGYWIELVGW AWAFTPLAIG PAVGVWAMAR LRARPEAARL AGGNR