Gene Sbal223_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1588 
Symbol 
ID7089989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp1856507 
End bp1857958 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content48% 
IMG OID643460489 
Productsodium/proline symporter 
Protein accessionYP_002357516 
Protein GI217972765 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000493204 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000453194 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACGATTG AAACCCCGAT TTTAATCACA TTTGTTGGTT ACTTAGTATT GATGATGGGC 
ATAGGTTTTT GGGCTTACCG TGCTACCGAT ACTGTTGATG ATTATATTTT AGGTGGCCGC
AAAATGGGCC CCGCTGTGAC CGCACTCAGT GTGGGTGCAT CCGATATGTC AGGTTGGCTG
TTACTGGGTT TACCCGGCGC GGTTTACTTA GGCGGCTTAG GTGAAGCTTG GATTGGCATA
GGGTTAATTT TTGGCGCTTG GCTGAACTGG CTTTTTGTTG CCAGACGACT GCGTATTTAC
ACTCAACTCG CCGATAACGC CCTCACCTTA CCGGATTTCT TCGAGAAACG TTTCCACGAT
ACCCAAGGCT ATCTAAAGCT AGTCTCCGCA ATCACTATTT TAGTGTTTTT CACTTTCTAT
GCTTCCTCAG GCATGGTCGG TGGCGCGATT CTATTTGAAA AAGTCTTTGG TCTCGATTAC
ACAGTGGCGC TGGTGATTGG CTCAGCCATC ATAGTCGGTT ACACCTTTAT TGGCGGCTTC
TTTGCCGTGT GTTGGACAGA CTTTTTCCAA GGCTGTTTGA TGCTTGTCGC CCTCTTAATC
GTCCCCTTTG CCGTTTTCTC TCACCCTGAA AGCCACGCCG GAATTGAAAC TATCGATCCT
GCGATGTTAG CCTTGGTCAG CGACAAAACC ACAGTGATAG GCATGTTGTC TTTACTCGCA
TGGGGTCTTG GCTATTTTGG TCAGCCGCAT ATTTTGTCGC GCTTTATGGC GATAGGCAGT
GCCGACGCCC TTCCCTTATC GCGCCGTATT GCCATGAGCT GGATGGTCTT ATCTTTAATT
GGCGCTTTAG CCACAGGTAT TGCCGGTTCT CTGTATTTCG CTAACGCCCC ACTAGCGAAT
TCAGAAACGG TATTTATTCA TTTAGCCCAA GCCGCGTTTA ATCCGTGGAT TGGTGGTCTA
CTCATTGCAG CCATTTTGTC GGCCATCATG AGTACTATCG ATTCACAGTT ACTGGTGTGC
TCAAGCGTGA TCACTGAAGA TTTCTACCGT AAATGGTTAC GCCCACAAGC GGATGATCGC
GAGTTGATGA TGGTCGGCCG CATGGGTGTG CTGGCGATTG CCGTGATCGC AGGCATCATT
GCCCTCAATC CTGAAAGCAG TGTATTAAGC CTTGTGAGTT ATGCATGGGC TGGCTTTGGT
GCGGCCTTTG GTCCTGTGGT CTTGTTATCG CTATTTTGGA AGCAATACAG CCGTAATGGT
GCCATAGCTA CTATTATTGT CGGCGCATTA ACGGTCGTAA TTTGGAAGCA ACTGACGGGG
GGGATTTTCG AGTTATACGA AATCCTGCCA GGATTTGTAT TCGCCACATT CGCCGGTATT
TTGGTGAGCA AATTGTCTGC ACCGAGTGAA AATGTAACAA CAGAGTTCGA ACAATTTAAG
TCTGCACTTT AG
 
Protein sequence
MTIETPILIT FVGYLVLMMG IGFWAYRATD TVDDYILGGR KMGPAVTALS VGASDMSGWL 
LLGLPGAVYL GGLGEAWIGI GLIFGAWLNW LFVARRLRIY TQLADNALTL PDFFEKRFHD
TQGYLKLVSA ITILVFFTFY ASSGMVGGAI LFEKVFGLDY TVALVIGSAI IVGYTFIGGF
FAVCWTDFFQ GCLMLVALLI VPFAVFSHPE SHAGIETIDP AMLALVSDKT TVIGMLSLLA
WGLGYFGQPH ILSRFMAIGS ADALPLSRRI AMSWMVLSLI GALATGIAGS LYFANAPLAN
SETVFIHLAQ AAFNPWIGGL LIAAILSAIM STIDSQLLVC SSVITEDFYR KWLRPQADDR
ELMMVGRMGV LAIAVIAGII ALNPESSVLS LVSYAWAGFG AAFGPVVLLS LFWKQYSRNG
AIATIIVGAL TVVIWKQLTG GIFELYEILP GFVFATFAGI LVSKLSAPSE NVTTEFEQFK
SAL