Gene Csal_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1248 
Symbol 
ID4027626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1424136 
End bp1425626 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content63% 
IMG OID637966426 
ProductSodium/proline symporter 
Protein accessionYP_573302 
Protein GI92113374 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.126391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTG GTGTCTGGAT AAGCCTTTTC GCCTACTTTG CGCTCATGAT CGGCATCGGC 
CTTTATGCCA TGCGCAAGTC CACGTCCACA TCCGAAGACT ACATACTGGG TGGCCGTACC
CTCAGTCCCA AAGTGGCGGC GCTGTCGGCC GGCGCTTCGG ACATGAGCGG CTGGCTGCTG
CTGGGGCTGC CCGGGGCAAT GTTCGTCTCC GGCCTGGGCT CGGCATGGAT CGGCATCGGC
CTGCTGGTGG GGGCTTTCTT CAACTGGCTC CTGGTCGCGC CACGACTGCG TGAACAGACG
GTTCACTATG GCAACGCGAT CACCATCCCC TCCTTCCTGG CCAACCGGTT CCCGACCCAG
GCCTTGTCGT TGAGGACGGT TTCCGCCATC GTCATCGTCG TCTTCTTCGC GGTCTACACG
GCGTCCGGCC TGGTGGCAGG CGGCAAGCTG TTCGAAAGCG CGTTTTCCGG GGTCATCAAC
ATCGGCGGGC TCAGCGATTA CGCGGTAGGC ATCATCATCA CCCTGGGCGT GGTGCTGGTC
TACACGGTCG TCGGCGGCTT CCTGGCCGTG AGCATGACGG ACTTCGTGCA GGGCTGCATC
ATGATGCTGG CGCTGGTGAT CATGCCGGCA GTCGTGATCT TCGGCGAAGG CGGCGGCGGC
TTCTCCCAGG CCTCGCAGAC GCTGAACGAA GTCGACCCCA CGCTCTTGTC ATGGACGGAC
GGCCTGACCT TCATCGGCTG GTTGTCCGCC GTGACCTGGG GCCTGGGCTA TTTCGGTCAG
CCGCACATCA TCGTGCGCTT CATGGCCATC CGGTCGCTGA AGGACGTGCC CGTCGCCCGC
AACATCGGCA TGAGCTGGAT GCTCATTTCC CTGATCGGCG CGATCTCGCT CGGGATCTTC
GGCCGCGCCT ACGCCATCCG CAATGGCATG GACGTAGAGG ATCCGGAAAC GATCTTCATC
ATCCTGGCGG ACCTGCTGTT CCACCCGCTG ATCACCGGCT TCCTCTACGC CGCATTGCTG
GCCGCGATCA TGAGTACCGT GTCCAGCCAG TTGCTGGTGG CCTCCTCGTC GCTGACCGAA
GACTTCTACC GCCTGTTCCT GCGCAAGGAA GCCACGGAGA AAGAGACCGT CGGCATCGGC
CGGGTCAGTG TCGTGCTGGT CGGCCTGGTG GCGGCCGTGA TTGCATCCGA TCCGGACTCT
CAGGTGCTGG GACTGGTGAG CAACGCCTGG GCAGGTTTCG GCGCGGCATT CGGTCCGCTG
ATCATCCTGT CGCTGATGTG GTCGCGCACG AACGGTGCCG GCGCCATCGC GGGCATGGTC
GTGGGTGCCG CTACGGTCAT GATCTGGATT TCTCTGGGCT GGAACGCATC GTTCATGGGC
GGTCCCGGCG TTTACGAGAT CATCCCCGGC TTCATCGCTG CCTTCATCGC GATCCTGGTG
GTGAGCAGCC TGACCAGCGA CGCCGGAGAA TACAAGCACA TCTCCCGCTG A
 
Protein sequence
MAIGVWISLF AYFALMIGIG LYAMRKSTST SEDYILGGRT LSPKVAALSA GASDMSGWLL 
LGLPGAMFVS GLGSAWIGIG LLVGAFFNWL LVAPRLREQT VHYGNAITIP SFLANRFPTQ
ALSLRTVSAI VIVVFFAVYT ASGLVAGGKL FESAFSGVIN IGGLSDYAVG IIITLGVVLV
YTVVGGFLAV SMTDFVQGCI MMLALVIMPA VVIFGEGGGG FSQASQTLNE VDPTLLSWTD
GLTFIGWLSA VTWGLGYFGQ PHIIVRFMAI RSLKDVPVAR NIGMSWMLIS LIGAISLGIF
GRAYAIRNGM DVEDPETIFI ILADLLFHPL ITGFLYAALL AAIMSTVSSQ LLVASSSLTE
DFYRLFLRKE ATEKETVGIG RVSVVLVGLV AAVIASDPDS QVLGLVSNAW AGFGAAFGPL
IILSLMWSRT NGAGAIAGMV VGAATVMIWI SLGWNASFMG GPGVYEIIPG FIAAFIAILV
VSSLTSDAGE YKHISR