Gene Sbal223_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4236 
Symbol 
ID7089177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp5036749 
End bp5038245 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content47% 
IMG OID643463110 
Productsodium/proline symporter 
Protein accessionYP_002360125 
Protein GI217975374 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTACC TCGCTTACAT TTCCTTAACT ATCTACTTTA TCGTTATGCT CGCCATTGGC 
TTATTCGCTT ACAAAAAATC CACCAGCGAT GTGTCAGGCT ATATTCTTGG CGGCCGTAAG
GTAAGTCCAC AGGTCACCGC ACTCTCAGCG GGTGCGTCTG ACATGAGTGG CTGGATGCTA
ATGGGTCTTC CCGGTGCTAT GTTTTTGGTT GGTTTCGAAA CCATTTACAT CGCACTTGGG
CTATTGATTG GTGCTCTAAT CAATTACTTG GTCGTCGCAC CAAAACTTAG GGTTTATACC
GAGGTGGCGG ACAACGCCCT CACTATCCCG GAGTTTTTTG CTAAACGCTT CGGCAATGCC
AACGGCAGTA TCCGTATTAT TTCGGCCGTG ATCATTGTGA TCTTCTTTAC CTTGTATACC
TCAGCGGGTT TAGTGGCTGG CGGTAAATTG TTTGAGTCAG CATTTGGCCT GAACTACGAC
ATAGGTTTAG TCGTCACACT AGCGGTTGTG GTTTCTTATA CCCTTTTAGG CGGATTCTTG
GCAGTGAGCC TCACTGACTT TGTCCAAGGC TGTATTATGT TTGTCGCCTT AGTCTTAGTC
CCTGTCGTGG CATATCAAGA ATTTACCAGC GCTGACACTA TGTTGAACTT TGCATATCAG
TCGATTCCAC ACTTCACCGA TGCGATGCAA AACGTCACTA TGTTAGGGCT TATCTCAAGC
TTATCTTGGG GACTCGGTTA CTTTGGCCAA CCGCATATTA TTGTGCGTTT TATGGCGATT
CGTTCAGTAG CTGACATTAA AACCGCCAGA CACATAGGTA TGGGCTGGAT GACAGTGACC
ATTGTGGGTG CACTGGCAAC AGGTCTTGTC GGTATCGCTT ACGCCAATAA ATTTGGTATG
AAGCTGACCG ATCCTGAAAC TATCTTTATC GTGTTTTCTG AGTTGTTGTT CCATCCGATC
ATTAGTGGCT TCTTACTGGC CGCCATTCTG GCCGCGATCA TGAGCACCAT TTCATCGCAG
CTATTAGTGT CATCAAGCTC ACTCACAGAA GATATCTATC GCGTTATTTC GAAGAAAGAA
TCTACCGAGA AAGACATGGT GAAAATGGGA CGCTTTGGCG TAGCAGGCGT GGCAATTGTC
GCGAGTCTAC TGGCGCTCGA CCGCTCTAGC AGTGTGCTAT CACTGGTGAG TAATGCATGG
GCAGGGTTCG GTGCCGCATT TGGTCCGTTA GTTTTGTTTA GCTTGTATAA AGCGAACCTA
ACCCACAAAG CCGCGATTGC CGGCATAGTG TCAGGCGCTG CTACAGTGTT GTTTTGGATT
TATGCCCCGG TATTGGCAGA TGGACAAGCC TTAACCACAG TGGTGTATGA AATGATCCCC
GGCTTTGCCG TGAGCAGTGT GGTGATTTGG ATAGTCTCGT TACTCGATAC TGATCCCTGC
GCCAAAACCA CTAAATTATT CCATAAGGCT GGCCGTGTTT TAGCCGAAGA CAGATGA
 
Protein sequence
MNYLAYISLT IYFIVMLAIG LFAYKKSTSD VSGYILGGRK VSPQVTALSA GASDMSGWML 
MGLPGAMFLV GFETIYIALG LLIGALINYL VVAPKLRVYT EVADNALTIP EFFAKRFGNA
NGSIRIISAV IIVIFFTLYT SAGLVAGGKL FESAFGLNYD IGLVVTLAVV VSYTLLGGFL
AVSLTDFVQG CIMFVALVLV PVVAYQEFTS ADTMLNFAYQ SIPHFTDAMQ NVTMLGLISS
LSWGLGYFGQ PHIIVRFMAI RSVADIKTAR HIGMGWMTVT IVGALATGLV GIAYANKFGM
KLTDPETIFI VFSELLFHPI ISGFLLAAIL AAIMSTISSQ LLVSSSSLTE DIYRVISKKE
STEKDMVKMG RFGVAGVAIV ASLLALDRSS SVLSLVSNAW AGFGAAFGPL VLFSLYKANL
THKAAIAGIV SGAATVLFWI YAPVLADGQA LTTVVYEMIP GFAVSSVVIW IVSLLDTDPC
AKTTKLFHKA GRVLAEDR