Gene Csal_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1019 
Symbol 
ID4027865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1149731 
End bp1150738 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content64% 
IMG OID637966196 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_573075 
Protein GI92113147 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.642553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTCA CGCTATCCAC CCCGCGCAAG CTACTCGGAC TGGCCGTCGG CATGGCGCTC 
AGCGCCAGCC CGCTATTGGT CCAGGCGCAG AGCGACGACG AGGTCAAGAT CGGCTTCATC
GTCAAGAAAC CCGAGCAGGC ATGGTTCATC AACGAACAAG ACGCCGCGAC CCAACTCGGC
GAAGAGAAAG GCTTCGAGGT GGTGCGTCTG TCCGGCGAGG ATGGCCAGGA AGTGCTCAGT
GCCATCGATA ACCTCCATTC CCAGGGAGCC GAGGGCTTCG TGATCTGTCC CCCGGATGTG
CGTCTGGGGC CGGCGATCAT GAACCGTGCC GAGCAATACG GCATGAAGGT GGTGACCGTC
GACGACCGTT TTGTCGGCGG TGATGGTGAG CCCATGGAGG AGGTGCCGCA CCTGGGGATG
TCCGGCTACA AGATCGGCGA GCAGGTCGGC AATGCCATCG CCGAGGAGAT GGAACGTCGC
GGCTGGGACC CGGAGGAGGT CGCGGCGCTG CGCATCACCA ACTACGAGCT GCCCACCGCC
AAGGAGCGTA CCGACGGGGC GACTGCCGCG CTGCTCGACT CGGGCTTCAA GGAAGCCAAC
ATCTTCGATG CGCCGCAGCA GAACACCGAT ACCAGCAGTG CCTTTGCTGC GGCCTCGCCG
GTCTTCTCCA AGCGCAGCGA CTTCGAGCAT TGGGTGATCT ACGCGCTCAA TGAGGAAAGT
GTGCTGGGCG GCGTGCGGGC CAGCGAGCAG TACGGGCTCG ATCCCGACCA GGTCATCGGC
GTGGGGATCA ACGGCTCCGG TGCGGCCTTT GCCGAGTTCT CGCGCGAGAC GCCCACCGGC
TTCTACGGCA CCGTGGCGGT CAGCTCGACC ATGCATGGAC GCCAGACGGC CGACAATCTC
TACCAGTGGA TCACCGAGGG CGAGAAGCCG CCGGCCAACA CGGAAACCAC GGGCAAGCTG
ATGACCCGCG ACAACTGGGA AGACGTTCGG GAAGAGCTGG GCTTGTGA
 
Protein sequence
MSFTLSTPRK LLGLAVGMAL SASPLLVQAQ SDDEVKIGFI VKKPEQAWFI NEQDAATQLG 
EEKGFEVVRL SGEDGQEVLS AIDNLHSQGA EGFVICPPDV RLGPAIMNRA EQYGMKVVTV
DDRFVGGDGE PMEEVPHLGM SGYKIGEQVG NAIAEEMERR GWDPEEVAAL RITNYELPTA
KERTDGATAA LLDSGFKEAN IFDAPQQNTD TSSAFAAASP VFSKRSDFEH WVIYALNEES
VLGGVRASEQ YGLDPDQVIG VGINGSGAAF AEFSRETPTG FYGTVAVSST MHGRQTADNL
YQWITEGEKP PANTETTGKL MTRDNWEDVR EELGL