Gene Csal_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2943 
Symbol 
ID4028338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3280833 
End bp3281765 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content65% 
IMG OID637968150 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_574987 
Protein GI92115059 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGAA CGTTTCAACA CCTCGCCACC CGCGTGCTTG GCCTGGCGGC CGTCACCGCG 
CTGTCGATAT CGCCGGTCTT CGCCGCCGAC CCGGTACGGG TATCCTCGAA GATCGACACC
GAGGGGGCCT TGCTGGGCAA CATGATGGTC CAGCTCCTGG AGCATGCCGA CGTTCCCGTG
GAGGAAAACC TCCAGCTCGG ACCCACCAAC ATCGTGCGCA GCGCCCTGCT GGAAGACGAG
ATCGACCTGT ATCCCGAGTA CACCGGCAAC GGCGCCTTCT TCACCCAGAC TACCGACGAT
CCGGCCTGGA AACAGGCCGA GGCGGGCTAC GAGAAGATCC GTGCCTACGA CAAGCAGCAC
AACGATCTCG TCTGGCTGAC ACCCGCACCG GCCAACAACA CCTGGGCGAT CGCCCTGCGT
CGGGATATCG CCGACGAGCA CGATCTCTCC ACCATGCAGG ACTTCGCGGC CTGGGTACGC
GACGGAGGCG AGGTGAAACT CGCCGGCTCG GCGGAGTTCG TGGAGAGCGA TGCCGCCCTG
CCCAGCTTCC AGCGTGCGTA CGACTTCACG CTCGACCAGG AGCAACTGCT GGTGCTTTCC
GGCGGCAACA CGGCCGCCAC CATCCGCGCG GCGGCCAATA ATACCAGCGG TACCAACGCG
GCGATGGTCT ACGGCACCGA TGGCGCGATC GCCGCGGCCG ACCTCAGGGT CATGGACGAT
ACCCAGGGCG TACAGATGGT CTATGCGCCG GCGCCGGTGA TCCGCCAGGC GACGCTCGAC
GCCTACCCCG AGATTCCGGA GCTGCTCGGC CCCTTGTTCG AGGGACTCGA CCGCGAGACG
CTGCAGACCC TCAACAGCCG CATTCAGGTA GATGGCATGC CGGCGAGTGA CGTCGCACGC
GATTACCTCG AATCGCAAGA CCTGCTCGAC TAA
 
Protein sequence
MPGTFQHLAT RVLGLAAVTA LSISPVFAAD PVRVSSKIDT EGALLGNMMV QLLEHADVPV 
EENLQLGPTN IVRSALLEDE IDLYPEYTGN GAFFTQTTDD PAWKQAEAGY EKIRAYDKQH
NDLVWLTPAP ANNTWAIALR RDIADEHDLS TMQDFAAWVR DGGEVKLAGS AEFVESDAAL
PSFQRAYDFT LDQEQLLVLS GGNTAATIRA AANNTSGTNA AMVYGTDGAI AAADLRVMDD
TQGVQMVYAP APVIRQATLD AYPEIPELLG PLFEGLDRET LQTLNSRIQV DGMPASDVAR
DYLESQDLLD