Gene Csal_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1670 
Symbol 
ID4028682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1899909 
End bp1901057 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID637966859 
Productbenzoate membrane transport protein 
Protein accessionYP_573722 
Protein GI92113794 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTACCG TTCCTGTTTC CCCACGCGGG GCCGCCTTTG TCTTCAATGG CCGCGAGCTC 
AATGGCGCCC TGGGCGATCT CGGCACCCTG CTGCCGCTGC TGCTGGGCGT GCTGGCGGTG
GGCGGCGTGT CGCCGGGGCC GGTGCTGTTC GGTTTCGCGG CGTTCTATCT CGTCACCGCG
TTCTACTATC GCCTGCCGAT TCCCGTACAG CCCATGAAGG CCGTCGCCGC GATGCTGCTC
ACCGTGGGCA TGTCGGCCTC CGAACTGGCC ATCGGCGGTA TGATCATCGG CCTGGTGATG
CTGGTGCTCG GGCTCACGGG ATGGATAGGC CACCTGCGCC GTTTGATTCC GCAATCCGTG
CTGGCCGGGC TGCAACTGGG GCTGGGCGTG ATGCTGGCGC TGGCCAGTCT CTCGCTGATG
GCCGAGCAAG CCTGGCTGGC GGGGGTGACC CTGGCGGTGC TGCTGGTGGC GATGCGCATA
CCGGGGTGCC CCTCGGTACT GCTCGCCTTG CTGGTCGCGG TGGGGTTGGG CATTCCGCAA
TGGGGGCAAG GGCCGGATCT CGTCGCGGCC GGGCAGGGCA TGTTCCCACT CACGGGCTGG
CCCGGCGTCG AGTCCTTCGA GCGTGCCATG TCGATGCTGG TGCTGCCGCA ACTCTCGCTG
ACCGTCACCA ACGCCATCGT GCTCACCGCG CTCGTCGCCG GCGACTACTT CGGCGAGCGC
GCGGCGCATG TCACCCCCGC GCGGCTGTCG ATCACCACCG GCCTGGCCAA TCTCCTGCTC
AGCCCCCTGG GGGCCTTGCC CATGTGCCAC GGTGCGGGCG GGCTGGCGGC GCATTACCGC
TTCGGCGCGC GCAGCGGCAC GGCGCCGTTG TTGCTGGGGT TGGGACTGCT GGGCGTGGCG
TGTCTGCCAA CATCGTGGGG CCTGGCCATG CTCGCCGCGA TTCCCGTCGC CGGGCTGGGC
GCCTTGCTGC TCGTTGCCGC CTGGCAACTG GCCGTCACCA AGCGACTTTA CGATAGCAAG
CCCTCGTGCT GGCCGGTGAT CGCCGCGACC GCCGTGGCGA CGGTCGCGCT GGATCCTTTC
TGGGGGCTGG TGGCCGGGGG CGTCAGCGAA TGGGCAAGAG TCAGCTGGCG TCGGCGTCGC
CACGCTTGA
 
Protein sequence
MPTVPVSPRG AAFVFNGREL NGALGDLGTL LPLLLGVLAV GGVSPGPVLF GFAAFYLVTA 
FYYRLPIPVQ PMKAVAAMLL TVGMSASELA IGGMIIGLVM LVLGLTGWIG HLRRLIPQSV
LAGLQLGLGV MLALASLSLM AEQAWLAGVT LAVLLVAMRI PGCPSVLLAL LVAVGLGIPQ
WGQGPDLVAA GQGMFPLTGW PGVESFERAM SMLVLPQLSL TVTNAIVLTA LVAGDYFGER
AAHVTPARLS ITTGLANLLL SPLGALPMCH GAGGLAAHYR FGARSGTAPL LLGLGLLGVA
CLPTSWGLAM LAAIPVAGLG ALLLVAAWQL AVTKRLYDSK PSCWPVIAAT AVATVALDPF
WGLVAGGVSE WARVSWRRRR HA