Gene Mmwyl1_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_1081 
Symbol 
ID5366295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp1206743 
End bp1208239 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content44% 
IMG OID640803422 
Productsodium/proline symporter 
Protein accessionYP_001339947 
Protein GI152995112 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000772394 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000701401 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATAGAAA GCAGCTATGC GATCAGCTTT ACATTCCTAG CCTATCTAGT AGTTATGCTA 
GGAATTGGGC TGTACGCTTA CAAACGTACC TCCAGTTCAG AAGACTACTT TCTTGGTGGA
CGTTCTTTAG GACCTTGGCC TACCGCCTTG TCCGCAGGCG CTTCAGACAT GAGTGGCTGG
TTACTACTTG GGTTACCTGG TTACGCTTTT GCCGCAGGCT TAGAGTCTTT CTGGATTGCT
GGTGGCCTTT TTGTTGGAAC ATGGCTTAAC TGGCTAATTT GTGCAAAACG CCTAAGAACC
TACAGCATCA AAGCGAACAA CGCACTCACG CTCCCAGACT TCCTATCCAC TCGATTCAAT
GACAAATCAA AACTTATTCA AACCATTTCA GCTCTGTTCA TTTTGCTGTT CTTCTTGTTC
TATACAAGTT CCGGTTTGGT TGCTGGCGGT AAATTGTTTG AAACAGTATT TGGCTTAGAT
TACACCTATG CAGTAATTAT CGGTACAGTA TGTGTGGTTT CCTACACGCT GTTTGGCGGC
TTTCTTGCCG TAGCTTGGAC AGACCTTGTT CAGGGCTTGA TGATGAGTGC GGCATTAGTG
ATTGTACCTC TCGTTGCTAT TGATGGCGGC TGGTCTGAGC TGTCTAGTAC TTTAATGGCA
AAAAATCCAA ACTTACTAGA TATCTGGACT AACGTAAGTG GTGAACCTCT AACTGCAATA
GGGATTATTT CTCTTGTGGC TTGGGGTCTA GGCTACTTCG GTCAACCGCA CATCCTAGCG
CGCTTTAAAG CCTCTCGCTC CAATAAAGAC ATCAAAACAG CTCGCCGTAT TGCTGTCGTT
TGGACATTCA TTTCTATGGC TGGCGCATTG CTTATTGGTC TTGTTGGTAT CGGCTTTATC
GACACAAACC TAACAGGAGA CTTGGCTGAC CCAGAGAAGA TCTTCATGAT TTTGGTCAAT
GCCGTTTTCC ACCCTGTTGT TGCTGGTATT CTTCTTGCCG CTATCCTAGC AGCGATCATG
AGTACTGCCG ACTCTCAATT ATTAGTATCT TCTTCTGCGT TAGCTGAAGA TTTTTACAAG
CAACTATTCA ATAAAAAAGC GACTCAAAAA CAAATCGTCA ACGTTGGACG CTTTGCGGTT
GTGGCTATTT CTATTATTGC GTTATTACTT GCATTGAACC CAGAAAGCTC AGTACTCGGC
CTAGTGTCTT ATGCTTGGGC TGGTTTTGGC GCTGCATTTG GCCCAGCTAT CCTTCTTAGC
TTGTTCTGGC GCAACATGAA CCGTAACGGT GCACTAGCAG GCATCATCAT TGGTGGTGTG
ACTGTTGTAG TATGGAAACA GTTAACTGGC GGCATTTTCG ATCTTTATGA AATTGTTCCT
GGCTTCTTGT TCTCGACTAT CGCTATCTTT GCGGTAAGTC TAGCAACAGG TGCTCCAGAA
GAGTCTGTAA CAGAATCTTT TGATGAATAC GAAAAAGCAC TAGATACAAT GGACTAA
 
Protein sequence
MIESSYAISF TFLAYLVVML GIGLYAYKRT SSSEDYFLGG RSLGPWPTAL SAGASDMSGW 
LLLGLPGYAF AAGLESFWIA GGLFVGTWLN WLICAKRLRT YSIKANNALT LPDFLSTRFN
DKSKLIQTIS ALFILLFFLF YTSSGLVAGG KLFETVFGLD YTYAVIIGTV CVVSYTLFGG
FLAVAWTDLV QGLMMSAALV IVPLVAIDGG WSELSSTLMA KNPNLLDIWT NVSGEPLTAI
GIISLVAWGL GYFGQPHILA RFKASRSNKD IKTARRIAVV WTFISMAGAL LIGLVGIGFI
DTNLTGDLAD PEKIFMILVN AVFHPVVAGI LLAAILAAIM STADSQLLVS SSALAEDFYK
QLFNKKATQK QIVNVGRFAV VAISIIALLL ALNPESSVLG LVSYAWAGFG AAFGPAILLS
LFWRNMNRNG ALAGIIIGGV TVVVWKQLTG GIFDLYEIVP GFLFSTIAIF AVSLATGAPE
ESVTESFDEY EKALDTMD