Gene Csal_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1994 
Symbol 
ID4027078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2251033 
End bp2252784 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content66% 
IMG OID637967189 
Productsulfate permease 
Protein accessionYP_574044 
Protein GI92114116 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAAGC GCTACCTGCC CATTCTCGAG TGGCTGCCGC GTTACGATCG TCAGACACTC 
TCGCAAGACC TGTTCGCCGC CGTCATCGTG ACGCTCATGG TCATTCCCCA GGCCCTGGCA
TATGCCCTGC TCGCGGGACT GCCGGCGGTA ACCGGTCTGT ATGCCAGCAT GCTGCCGCTG
GTGGCCTACA CCGTCTTCGG CACCAGCCGC ACCCTGGCGG TCGGGCCGAT GGCGATCGTC
TCGCTGATGA CCGCCGCGGC GCTGTCCGGC ATCGTCGCCA CGGGAACCGT CGCCTACAGC
GAAGCCGCAG CAACGCTGGC GTTTCTGTCC GGCGTCATGC TGATGCTGAT GGGCATCTTT
CGCTTGGGGT TCTTCGCCAA CTTCCTGAGT CACCCCGTGA TCTCGGGGTT GCTCAGCGCC
TCCGGCGTGC TGATCGCCAC CAGCCAATTG GGCAACCTGC TGGGCATTTC GATGTCCGGC
TTCACGCTGA TCGACCAGCT GGCCGGGCTG GCCCTGCACT GGCGGGACTT CAGCATGCCC
ACGGCACTGA TCGGCCTGGG ATCGCTGGGT TTCTTGATGG TGATGCGTCG TGCGGGGCCG
GTACTCAAGA GCTGGGGCCT CTCGGCCACG CTCAGCGGCT TCATCGCCAA GGCCGGGCCG
ATCATCGCCG TCGTCGTCTC CACCCTGCTG GTATGGGCGT TCGATCTGGA AGCGCACGGC
GTGGCCGTGG TCGGCGAGAT TCCCCGTCAT CTGCCGCCGA TCGCATTGCC CTCGCTGGAT
CCGAGCCTGC TGAGCACCCT CTGGATGCCG GCGCTCCTGA TCAGCCTGGT GGGCTTCATC
GAATCAGTCT CGCTGGCGCA GATGCTCGCC GCCAAGCGCC GACAGCGCAT CTCGCCGGAC
CAGGAGCTGT TCGCGCTGGG AGGCAGCAAC CTCGCCGCCG CGTTGAGCAG CAGCATGCCG
GTGACAGGCA GCCTGTCACG CACGGTGATC AACTTCGATG CCGGGGCACG CACGCCGGCG
GCAGGCAGCT TCGCCGCGCT GGGCGTGGCG CTGGTGACGC TCTATCTCAC GCCGCTGATC
CACTTTCTGC CCATCGCCAC GCTGGCCGCC AGCATCATCG TCTCGACCTT CACGCTGCTC
GACGCGCGGG GCCTGAAGCG CACCTGGCGC TACTCGAAGC GTGACTTCGC CGCCATGCTG
GCCACCATCG TGCTGACGTT CGTGGTCGGG GTCGAGGCGG GCGTCATGGC CGGGGTCGGC
TTGTCGCTGG CACTGTTCCT GTACCGCACC AGCCGGCCGC ACAGCGCGCT GGTGGGTCGC
GTGCCCGGCA CCGAGCACTT CCGCAACGTG GAACGCTACG CCACCGAGAA CGACCCGCAC
GTGGCCTTGC TGCGGGTCGA CGAAAGCCTG TACTTCGCCA ATGCGCGTTA TCTGGAAGAT
ACCGTCTATG CCATGGTCGC CGAGCGTCCC GCGCTCAAGC ACGTGGTCTT GATCGGCTCG
GCGGTGAACC TGATCGACGC CTCGGCGCTG GAAAGCCTCG AAGCCATCAA TGCTCGCCTG
GAGGACTCCC GAGTCAAGCT CCACCTGGCC GAGGTGAAGG GACCGGTCAT GGACCAGCTC
AAGCAGAGCG ACTTCCTCGA GCATCTCACC GGCGAGGTGT TTCTCAGCAC CTACCACGCC
TGGGAAGCCC TGCGCGAGGA AGACACCGTA GTGACGCCGC TGGATGGCGA CGACGCCTCC
CGCCAGGATT AA
 
Protein sequence
MLKRYLPILE WLPRYDRQTL SQDLFAAVIV TLMVIPQALA YALLAGLPAV TGLYASMLPL 
VAYTVFGTSR TLAVGPMAIV SLMTAAALSG IVATGTVAYS EAAATLAFLS GVMLMLMGIF
RLGFFANFLS HPVISGLLSA SGVLIATSQL GNLLGISMSG FTLIDQLAGL ALHWRDFSMP
TALIGLGSLG FLMVMRRAGP VLKSWGLSAT LSGFIAKAGP IIAVVVSTLL VWAFDLEAHG
VAVVGEIPRH LPPIALPSLD PSLLSTLWMP ALLISLVGFI ESVSLAQMLA AKRRQRISPD
QELFALGGSN LAAALSSSMP VTGSLSRTVI NFDAGARTPA AGSFAALGVA LVTLYLTPLI
HFLPIATLAA SIIVSTFTLL DARGLKRTWR YSKRDFAAML ATIVLTFVVG VEAGVMAGVG
LSLALFLYRT SRPHSALVGR VPGTEHFRNV ERYATENDPH VALLRVDESL YFANARYLED
TVYAMVAERP ALKHVVLIGS AVNLIDASAL ESLEAINARL EDSRVKLHLA EVKGPVMDQL
KQSDFLEHLT GEVFLSTYHA WEALREEDTV VTPLDGDDAS RQD