Gene Mmar10_2670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2670 
Symbol 
ID4286036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2916808 
End bp2918433 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content62% 
IMG OID638142169 
Productsodium/proline symporter 
Protein accessionYP_757894 
Protein GI114571214 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTG AAGCTCTTAT CTCGCTGGCA CTCTACTTCA TCCTGATGAT GGCGATCGGG 
CTCTATGCCT ATCGCAAGTC CACGGATGAC GTGTCCGGCT ACATGCTGGG CGGACGCCAG
CTGCACCCAG CGGTGGCGGC CCTGTCGGCC GGCGCGTCCG ACATGTCGGG CTGGATGCTG
ATGGGCCTGC CGGGTGCGAT CCTGCTGACC GGCATGTCGG AAAGCTGGAT TGCCATCGGC
CTGATGATCG GCGCCTATTT CAACTACAAG CTCGTGGCCC CGCGGCTGCG CGTCTATACC
GAGGTGGCCG ACGACGCGAT CACGATCCCG GACTATTTCG AGAAGCGTTT CGCCGACAAT
TCGCGGCTTC TTCGCGTGAT CTCGGCCATC GTCATCGTGT TGTTTTTCAC CATTTATACG
TCAGCCGGCG TGGTTGCGGG CGGCAAGCTG TTCGAGGCCT CCTTCGGTCT CAACTACCAG
ATCGGCCTGT TCGTCACCGC CGGAGTGGTC GTGGCCTATA CATTGTTCGG TGGTTTCCTC
GCGGTCAGCC TGACCGACTT CGTGCAGGGC GTCATCATGT TCCTGGCGCT GGTACTGGTG
CCGGTGGTGA CCATCATGCA GCTGGGCGGT TTCGGCCCCA CGATCGACGC GCTGAGCACG
TTGAGCGTCG ATGTTGGTGG CTGGGACAAA CCCTATCTGT CGATGATCCC GCAAGAGAGC
CTCGGCCTCG CGGTCATCGG CATCATTTCG ACCATGGCCT GGGGCCTGGG CTATTTCGGC
CAGCCGCACA TCATCGTGCG CTTCATGGCC GTGAAGTCGC TCAAGGACGT CAAGGTCATG
CGCCGGATCG GCATGAGCTG GATGCTGATC ACCGTGGTCG GCGCCCTGCT GACCGGGGCC
GCCGGCCTCG CCTATGTGAA GAGCCAGGGC ATCCAGACCG CGGACGGGCT CAATGTTGCC
CGGGTCGTGG AAGCGCCTGG CGATGCGACC ACCACGATGG GGGATCTGGA AAACATCGAC
ACAACCGGCG CCTTTGCTGA CGGCGATGTG GTGCTCGGCG ACCAGGAAAC CATCTTCATC
CTGCTCTCCC AGGTCATTTT CCACCCCTAT ATCGCCGGCT TCCTCCTGGC CGCCATCCTG
GCCGCGATCA TGAGTACGAT CTCGTCCCAG CTCCTGGTCT CATCAAGTTC GCTGACCGAG
GACTTTTACA AGGTCTTCCT GCGACGCGGT GCCAGCCAGA AAGAGCTGGT TCTGGTCGGA
CGCCTGTCGG TGCTGGCGGT ATCGCTGGTG GCGATAGGAC TGGCCTTCAA TCCGGACAGC
AACATACTTG GCCTCGTATC CAACGCCTGG GCCGGCTTTG GTGCCGCCTT CGGACCGGTC
ATCCTGGTCA GCCTGTTCTG GCGCGGCATG ACCCGCCTGG GTGCCATCGC CGGCATGATT
GCCGGTGCGG TCACTGTGCT GGTGTGGATT TACGGCCTGC AGCTGTCGGG CGTTATGTAT
GAAATCGTGC CAGGCTTCAT TGCCTGCCTG GTGACGCTCT ACATCGTCTC CAAGGCAACC
GCCAAACCGG GACCGGAGGT CACCGACTAT TTCGACGAGA TGCACACGCG CGTCAAAGCC
GGATAG
 
Protein sequence
MDIEALISLA LYFILMMAIG LYAYRKSTDD VSGYMLGGRQ LHPAVAALSA GASDMSGWML 
MGLPGAILLT GMSESWIAIG LMIGAYFNYK LVAPRLRVYT EVADDAITIP DYFEKRFADN
SRLLRVISAI VIVLFFTIYT SAGVVAGGKL FEASFGLNYQ IGLFVTAGVV VAYTLFGGFL
AVSLTDFVQG VIMFLALVLV PVVTIMQLGG FGPTIDALST LSVDVGGWDK PYLSMIPQES
LGLAVIGIIS TMAWGLGYFG QPHIIVRFMA VKSLKDVKVM RRIGMSWMLI TVVGALLTGA
AGLAYVKSQG IQTADGLNVA RVVEAPGDAT TTMGDLENID TTGAFADGDV VLGDQETIFI
LLSQVIFHPY IAGFLLAAIL AAIMSTISSQ LLVSSSSLTE DFYKVFLRRG ASQKELVLVG
RLSVLAVSLV AIGLAFNPDS NILGLVSNAW AGFGAAFGPV ILVSLFWRGM TRLGAIAGMI
AGAVTVLVWI YGLQLSGVMY EIVPGFIACL VTLYIVSKAT AKPGPEVTDY FDEMHTRVKA
G