Gene Csal_2357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2357 
Symbol 
ID4027466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2647936 
End bp2648838 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content65% 
IMG OID637967561 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_574405 
Protein GI92114477 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0367531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACAAT CCCTGCTCCC GCTGATCGGC GGCCTGGCTC TCGCCAGCAC CCTCGTCAGC 
CCCCTCACCA GTACCGCCGC CCACGCCGAG GACATCACCG TCGGCGGCAA GAACTTCACC
GAACAGCTCA TCCTGTCAAG CATGACCACG CAATACCTCC AGGCCCATGG CTACGAGGTG
GACCAGCGCG CCGGCATGGG CACCACGGTG CTGCGCCGCG CCCAGGAGAG CGGCCAGGTG
GATCTCTACT GGGAATACAC CGGCACCTCG CTGATCAGCT ACAACAAGGT GACCGAGGAC
CTCTCGCCGG AGGCCACCTA CGAGCGCGTC AAGGAACTCG ACGCCGAGAA GGGGCTGATC
TGGCTCGAGC CCTCCGAGGC CAACAACACC TACGCCCTGG CCATGCGCAA AGACGACGCC
GAAGCACGCG GCATCGCCAC CATTTCCGAT CTCGCCGACG TCATCAATGG CGGCCAGGAA
CTTGTGCTCG CCTCCAACGC GACCTTCTAC TCGCGCGATG ACGGCCTGCG CCCGATGCAG
GAGACCTATG GCTTCGAGTT CGGCCGGCGC AACGTGAAGC GGATGGACCA GGGCCTGACC
CTGACCTCGC TGGATCAGGA AGAAGTCGAC GTGGCGATGA CCACGGCGAC CAACGGGCGC
ATCCCGGCCC TGGACCTGAC CGTCCTCGAG GACGACAAGA ACTTCTTCCC CGACTATGCG
CTGACCCCGG TGGTCCGCGA GGAAACGCTC GAGGCGAACC CCGATCTCGA CGAACGCATG
AACGCGCTCT CCGCCCTGCT CGATGACAGC ACCATGGCGC GCCTCAACGC CAAGGTCGAC
GTCGACAAGC AGCCCGTCGA GAAGGTCGCC GAGCGCTTCC TCGAGGAGCA CGACCTGCTG
TAA
 
Protein sequence
MRQSLLPLIG GLALASTLVS PLTSTAAHAE DITVGGKNFT EQLILSSMTT QYLQAHGYEV 
DQRAGMGTTV LRRAQESGQV DLYWEYTGTS LISYNKVTED LSPEATYERV KELDAEKGLI
WLEPSEANNT YALAMRKDDA EARGIATISD LADVINGGQE LVLASNATFY SRDDGLRPMQ
ETYGFEFGRR NVKRMDQGLT LTSLDQEEVD VAMTTATNGR IPALDLTVLE DDKNFFPDYA
LTPVVREETL EANPDLDERM NALSALLDDS TMARLNAKVD VDKQPVEKVA ERFLEEHDLL