Gene Shewmr4_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1398 
Symbol 
ID4251417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1627830 
End bp1629281 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content50% 
IMG OID638117997 
Productsodium/proline symporter 
Protein accessionYP_733533 
Protein GI113969740 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0934266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000241033 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACGATTG AAACCCCGAT TTTAATCACA TTTGTTGGCT ATTTAGTATT GATGATGGGC 
ATAGGATTTT GGGCTTACCG TGCCACCGAT ACTGTTGATG ATTATATTTT AGGCGGCCGT
AAAATGGGTC CCGCCGTCAC CGCGCTTAGC GTGGGCGCCT CCGATATGTC AGGCTGGTTG
CTCTTAGGCT TACCTGGCGC CGTTTACTTA GGCGGCTTAG GCGAAGCATG GATTGGGATT
GGCTTAGTGG TCGGCGCTTG GTTGAACTGG TTGTTTGTGG CTAAACGCCT GCGCATTTAT
ACCCAATTAG CAGACAACGC CCTCACCCTG CCCGACTTCT TCGAGAAACG TTTTCACGAT
AAACAGGGTT ACCTCAAGCT TGTCTCTGCG GTCACAATTC TGGTATTTTT CACCTTCTAT
GCTTCCTCTG GCATGGTGGG CGGCGCCATT CTGTTCGAAA AAGTCTTCGG CCTCGATTAT
AACGTTGCCT TAGTGATTGG CTCGGCCATC ATTGTGGGTT ACACCTTTAT TGGCGGTTTT
TTTGCCGTGA GTTGGACCGA CTTCTTCCAA GGCTGTTTAA TGCTAATAGC CCTGCTGATC
ATCCCCTTTG CGGTGTTCTC GCACCCAGAG AGCCATGAGG GAATTGAGTC TATCGACCCA
GCCATGTTGG CACTGATCAG TGACAAAACC ACAGTGATTG GCTTGTTATC ACTGCTCGCA
TGGGGGCTCG GTTATTTCGG TCAGCCCCAT ATTTTGTCGC GCTTTATGGC CATTGGCAGC
GCCGATGCAC TGCCGCTGTC ACGCCGTATC GCCATGAGCT GGATGATGTT ATCTCTCATC
GGCGCGTTAG CCACAGGCTT AGCGGGTTCA CTGTATTTTG CCAATCAACC CTTGGCCAAT
GCCGAAACCG TGTTTATTCA TTTAGCCCAA GCCGCCTTCA ATCCTTGGAT TGGCGGATTA
CTGATCGCCG CGATTCTGTC GGCCATTATG AGTACCATTG ATTCGCAGTT GTTGGTGTGT
TCGAGCGTGA TCACCGAGGA CTTTTACCGC AAGTGGTTGC GCCCTCAAGC CGACGATCGC
GAACTGATGA TGGTAGGTCG TATGGGCGTG CTGGCCATTG CCGTGATCGC AGGGATCATT
GCCCTGAATC CTGAAAGCAG TGTATTAAGC TTAGTCAGTT ATGCGTGGGC AGGTTTTGGT
GCCGCCTTTG GCCCTGTCGT ACTGCTATCA CTCTTCTGGA AGCAATACAG CCGTAACGGT
GCAATTGCTA CTATTATTGT CGGCGCATTA ACCGTGGTGA TATGGAAGCA GCTCACAGGT
GGCATTTTCG ACCTATACGA AATCTTACCT GGATTTGTGT TTGCAATTAT TGCCGGTGTC
ATCGTCAGTA AAATGTCACG CCCTGCCGAA GCCATTACCG CAGAATTTGA GCAATTTAAA
TCCGCGTTAT AA
 
Protein sequence
MTIETPILIT FVGYLVLMMG IGFWAYRATD TVDDYILGGR KMGPAVTALS VGASDMSGWL 
LLGLPGAVYL GGLGEAWIGI GLVVGAWLNW LFVAKRLRIY TQLADNALTL PDFFEKRFHD
KQGYLKLVSA VTILVFFTFY ASSGMVGGAI LFEKVFGLDY NVALVIGSAI IVGYTFIGGF
FAVSWTDFFQ GCLMLIALLI IPFAVFSHPE SHEGIESIDP AMLALISDKT TVIGLLSLLA
WGLGYFGQPH ILSRFMAIGS ADALPLSRRI AMSWMMLSLI GALATGLAGS LYFANQPLAN
AETVFIHLAQ AAFNPWIGGL LIAAILSAIM STIDSQLLVC SSVITEDFYR KWLRPQADDR
ELMMVGRMGV LAIAVIAGII ALNPESSVLS LVSYAWAGFG AAFGPVVLLS LFWKQYSRNG
AIATIIVGAL TVVIWKQLTG GIFDLYEILP GFVFAIIAGV IVSKMSRPAE AITAEFEQFK
SAL