Gene Csal_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1701 
Symbol 
ID4028539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1933469 
End bp1934443 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content66% 
IMG OID637966889 
Productperiplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system-like protein 
Protein accessionYP_573752 
Protein GI92113824 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.656408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCAACC AGCTACCGAC CGGGCGCAGC GCCCTGATGT TCAGCACGGC CCTGGCCGCT 
TTGCTTCTGG CAGCGCCTTC CGTGTCGGCC AATGAGGCCG AGCAGGAAGC GTCGGCCAGC
AAGCACGGAC CGGAAACCAT CGCCAACCAG TTCGTGTTCG GCTCCGGCAA GGAGTGCCCG
CATGAGCCGT ACTGCCTGCC CGCGCTCGAG GAAGAGTATG GCTTGCACTT CGCCGATTTC
GTGGTCACCG ACCCCGGGGG GCCGCGCACG CGCGAGGCGC TTCTGAACGG CGACATCCAG
ATTGGCGTGC TCTTCACTAC CAACGGGTAT CTCGCCACTG ACCGCTTCGT CCTGCTGGAG
GACGATCGTA ACGCCCAGCC GGCGGAAAAC GTCATTCCGG TGGCCCACCA GTCCATCGGC
GACGCCTACC CCGAGCTCGG CGAGGTGCTC GATCCGCTCA GCGCCGTCCT GACGACCCCG
GAGCTGGCCG AGATGAATCG ACGCTTCGCG CTCGACGGGG TGGACGCGGA GACCATCGCC
CGAGAGTGGC TCAAGGAGCA CGGCGCGCCC GCGCCTGCGG AGAGTGCGCC GGAAAAAGAG
GGCCCCACCA TCGTGGTCGG CTCCGGTAAC TTCGCCGAGA GCATCATCCT GGCGGAAATG
TACCACCAGG CGCTCGACCA GGCGGGATAT CCCACCCGGC ATCGGCAGGA AATCGGCAAC
CGCGCCACCT ATCTTCCCCT GCTCGAAAGT GGCGAGATCG ACCTGTTCCC GGAATACACC
GGCAGTCTGG GTGGCTGGCT GAACACGCTC GCCGACACGA GCGGGCAACC CCTGTCGGCA
TTGCTGCCCG AACATGACCT GGTGGGCTTC GAGCCAGCCC CTGCGCAAGA CAAGAACGGC
TTCGTGGTCA CCGCCGAGAC CGCACAGCGC TATGATCTCG AAAAGATCAG CGATCTGGCG
AAGCCCGCTC CCTGA
 
Protein sequence
MSNQLPTGRS ALMFSTALAA LLLAAPSVSA NEAEQEASAS KHGPETIANQ FVFGSGKECP 
HEPYCLPALE EEYGLHFADF VVTDPGGPRT REALLNGDIQ IGVLFTTNGY LATDRFVLLE
DDRNAQPAEN VIPVAHQSIG DAYPELGEVL DPLSAVLTTP ELAEMNRRFA LDGVDAETIA
REWLKEHGAP APAESAPEKE GPTIVVGSGN FAESIILAEM YHQALDQAGY PTRHRQEIGN
RATYLPLLES GEIDLFPEYT GSLGGWLNTL ADTSGQPLSA LLPEHDLVGF EPAPAQDKNG
FVVTAETAQR YDLEKISDLA KPAP