Gene Csal_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1068 
Symbol 
ID4028083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1207503 
End bp1209245 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content66% 
IMG OID637966246 
ProductNa+/solute symporter 
Protein accessionYP_573124 
Protein GI92113196 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTATA CCGCTGCCAT CCTCTTGAGC CTCGTCGGCT ACTTCATTAT CGGCAATCTG 
GCCGGCCGCC GGGTCAAGAA TCTCGACGAC TACCTGGTTG CCGGGCGTAA CGCCCCCACG
CTGATGATCC TCGGCACCCT GGTCGCCAGC TACCTGAGTA CCAGTGCCTT TCTGGGCGAG
ACCGGTTTCG CCTACCAGGG GTATCCCTTC GTCATGCTGA TCTTCGCCGC CATCAGCACC
TCGGGCTGTC TGCTCGGGGC GCTGGTGTTC GGCCGTTACC TGCGCCGCAG CGAGGCGCTG
ACGGTGCCTG AGTTCTTCGG CAAGCGCTTC GCTTCGAAAC GCGTGCAGAT GATCGCCGGG
CTCACCACCG TATTCGGGCT GGGGGCTTAC TTGCTGGGCG TGATGCAGGG GGCCGGCATC
ATGTTCGAGG AGCTCACCGG CTTGCCCTAT TGGTCCGGGT TGGTACTGGT ATGGCTCACC
TACACCGGGT TCATTCTCTA CTCCGGCTCC CAGGGCGTGG TGCTCACCGA CACCGTGATG
TTCGTGATTT TCTCCATCGC CGCGGTGGGC GGCTCACTCT ACATCATCAG CGATCTCGGC
GGCTGGTCGG GAGTGATCGA GGGGCTGCTG GCGCAGTCCG ACAAGCCCGG CATCGCGCTA
TGGCACGGCG TGATCGAGGG TGAGGGAGCG ACCTTCTCAT CACCCGGTGA AGCGCTTATC
TGGGGGCTGA CGCTGGGGCT GGTATGGGCC ACGGTGCTGG CGGCCAGCCC CTGGCAGGCG
AGCCGCTATC TGATGGCGCG CAACGAGCAC GTGGTGATGC GCTCGGCCAT GCTCACCGCC
GTGGTGCTGG CGGTCTTCTA CGCGCTGATG ATGATGACCG GCGCGGCGAT CAACCTCTAC
GACCCGACGC TCGACGGCGA GCGTGCCATG GTGTGGGCGG CGCTCAACGT GCTGCCGAGC
TGGCTCGGCA TCATCATCCT CACTGGCGTG TTTGCCGCCG GGCTCTCCTC GTGCTCGACC
TTTCTGTCGA TCATCGGCTT CAGTCTCGCC CACGATATCC TGCCGTCTTC GCGCAGCCAG
CGCAGTGCCA TGCGGGCTAG CCGCATCGGC GTGCTGGTGG CCGGGCTGGT AGCGCTGCTG
CTGGCACTCT TCCAGCCGCC GGCGGTGATG GCAGTGGTAT GGTTCGCCGC CACGCTGTTC
GCTTCCTCCT GGGGTCCGGT GGCGCTGATG AGCATCTGGA GCCGTCGGAT TACCGCCGCT
GGTGCCGGCT GGGGCCTGGG GGTCGGCTTC GTCGGCAATC TGATCTTGTC GCTGATGGTG
CAGGCGGAAT GGGTCTCGCT GCCCGTTTAT CTCCACCCGG TGATACTCAG CACCCTGGTG
GCGCTGGTGG CGATCGTGAT CGTCTCCAGG TTGACCCACG TCAGCCAGGC AGAGAGTGAG
TACCGCGCCT TTCTGCACCG TAGCCCGGTG GAGAACGAGC GCAGTTGGGC GGCGGTCAGT
CGACGGGTGG CCGGGGCGAC CATGCTGGCC GGCGTCGCGG TGGGAGCCTT TCTATGGTGG
CAGTACGCCG GCTCGCTGGC GGCGCTGGCC GGACGTTTCG GCGTGGCCGA TAGCGCGGTG
ACCGGCGCCT ACCTGCTGGC GACGGGATGT GGCGTCATGC TGGTGATCGC CGGTGCGGTG
GGCTTCCGCC TGGTGAGCGC ACGCAGCACG AAAGTCGGCA GCGCCCACAT GGCGAAGCCC
TGA
 
Protein sequence
MIYTAAILLS LVGYFIIGNL AGRRVKNLDD YLVAGRNAPT LMILGTLVAS YLSTSAFLGE 
TGFAYQGYPF VMLIFAAIST SGCLLGALVF GRYLRRSEAL TVPEFFGKRF ASKRVQMIAG
LTTVFGLGAY LLGVMQGAGI MFEELTGLPY WSGLVLVWLT YTGFILYSGS QGVVLTDTVM
FVIFSIAAVG GSLYIISDLG GWSGVIEGLL AQSDKPGIAL WHGVIEGEGA TFSSPGEALI
WGLTLGLVWA TVLAASPWQA SRYLMARNEH VVMRSAMLTA VVLAVFYALM MMTGAAINLY
DPTLDGERAM VWAALNVLPS WLGIIILTGV FAAGLSSCST FLSIIGFSLA HDILPSSRSQ
RSAMRASRIG VLVAGLVALL LALFQPPAVM AVVWFAATLF ASSWGPVALM SIWSRRITAA
GAGWGLGVGF VGNLILSLMV QAEWVSLPVY LHPVILSTLV ALVAIVIVSR LTHVSQAESE
YRAFLHRSPV ENERSWAAVS RRVAGATMLA GVAVGAFLWW QYAGSLAALA GRFGVADSAV
TGAYLLATGC GVMLVIAGAV GFRLVSARST KVGSAHMAKP