Gene Csal_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1948 
Symbol 
ID4027188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2204424 
End bp2205530 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID637967144 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_573999 
Protein GI92114071 
COG category[R] General function prediction only 
COG ID[COG4239] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0329294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTC GCCGGCACTT TCGCGCCTCG CCCATCGCAC GCCGCCGCTG GCAGGTCTTT 
CGCGGCAACC GGCGTGCCTG GTGGTCGTTA TGGATCTTCG CCACACTGCT GGTGATCAGC
CTCGGCGCCG AGCTGATCGC CAACGACAAA CCCATCGTCA TGCAGTACCA GGGGCAATGG
TACGTCCCCG TGCTGATCGA CTACCCGGAA ACCGAGTTCG ACGGCTTCCT GCCCACGGCC
ACCGATTACC GCGACCCGGT CGTGCGCCGC CAGATCGAGG AGCGAGGCTG GATGCTATGG
CCGCCCGTGG GTTTCTCCTA CGAGACGCTG GACCGCGAGC TCGACGGGCC CGCGCCCTCG
CCGCCCTCGG CGCGACACTG GCTGGGCACC GACGACCATG GTCGCGACGT CTTCGCCCGC
GTGCTGTACG GCTTTCGCCT GTCGGTGGTG TTCGCGGTCA TCCTCACCGC CGGCTCGATG
GCCCTCGGGG TCCTGATCGG CGGCGTGCAA GGGTATTTCG GCGGCAAGGT CGATCTGATC
GGACAGCGCA TCATCGAGAT CTGGTCGGGC ATGCCGGTAC TGTTCCTGCT GATCATCCTG
GCCAGCCTCG TGCAGCCCAA CCTGTGGTGG CTGCTGGGCA TCATGCTGCT GTTTTCCTGG
CTGGGCCTGG TGGATATCGT GCGCGCCGAG TTCCTGCGCG CCCGCAACCT GGAGTATGTC
CGTGCGGCGC GCGCCATGGG CCTGCCCTCG CGCTTGATCA TGTGGCGCCA TGTCCTGCCC
AACGCCATGG TCGCCACCCT GACCTTCATC CCGTTCCTGT TCACCGGGGC GATCACCACG
CTCACTGCCC TGGACTTTCT GGGTTTCGGC TTGCCGCCGG GCTCGCCGTC GCTCGGCGAA
CTCGTCGCCC AGGGCAAGAA CAATCTGCAG GCGCCCTGGC TGGGGATCAC CGCTTTCCTG
AGCCTGGGCG TCATGCTGTC GCTGCTGGTC TTCATCGGTG AAGGGCTGCG CGATGCTTTC
GACCCTCGCC ACGTGCTCAG CCCCCAGACC TCGCCCACAG CGAGTGCCTC GAGCGGCCTT
CCAGGGAGAA GCGCGCATGG ACAATGA
 
Protein sequence
MTTRRHFRAS PIARRRWQVF RGNRRAWWSL WIFATLLVIS LGAELIANDK PIVMQYQGQW 
YVPVLIDYPE TEFDGFLPTA TDYRDPVVRR QIEERGWMLW PPVGFSYETL DRELDGPAPS
PPSARHWLGT DDHGRDVFAR VLYGFRLSVV FAVILTAGSM ALGVLIGGVQ GYFGGKVDLI
GQRIIEIWSG MPVLFLLIIL ASLVQPNLWW LLGIMLLFSW LGLVDIVRAE FLRARNLEYV
RAARAMGLPS RLIMWRHVLP NAMVATLTFI PFLFTGAITT LTALDFLGFG LPPGSPSLGE
LVAQGKNNLQ APWLGITAFL SLGVMLSLLV FIGEGLRDAF DPRHVLSPQT SPTASASSGL
PGRSAHGQ