Gene Csal_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1946 
Symbol 
ID4027186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2201512 
End bp2203347 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content64% 
IMG OID637967142 
Productextracellular solute-binding protein 
Protein accessionYP_573997 
Protein GI92114069 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.185407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTCT TCGTTCGCGC TTTGCTTGTC ATACCGGGTC TGTGGGCCCT TAGCCTGTCG 
GCCCTGGCCG TGCCCGCCGC CGACGTCGCC ACCGTGGGCG GCATTTCGCT CTATGACAGC
CCGGCGCTCC CCGACGACTT CACGCATCTG CCTTACACCA ACCCCGACGC GCCCAAGGGC
GGAGAGCTGC GTCAGGCCGC GCAAGGCAGT TTCGATTCGA CGAACGGCTT CATCATCCAG
GGCAATCCCG CCGATGGCCT CAGTCATGTC TACGACACCT TGATGGAAGC CAGCGCCGAC
GAGCCCTTCA CCATGTATGG CCTGCTCGCC GGCGGCATCC GCCTCGACCC CGACCGTCAC
TGGATGGAAA TCGACCTGCG TCGGTCGGCC CGGTTCCACG ACGGCCACCC GGTAACCGCC
GAGGACGTGG TGTTCAGCTT CCGCCTGCTG CGCGACCAGG GACAACCCTT CTATCGCGCC
TACTATGCCG GCGTCGATCA GGTCGAAGCC CTGGACGACG ATACCGTGCG CTTCGAGTTC
AGCGACAACG AGTCCCGCGA GTTGCCGCTG ATCCTGGGAC AATTGCCGGT GCTGCCCAAG
CATTACTGGC AATCGCGCGA TTTCACCTCG CCGACGCTGG ACAAGCCTCT GGGCTCGGGC
CCCTACGAAG TGGCCAGCAT CCTTCCCGGG CGGCGTATCA TGTATCGCCG TGTCGACGAT
TACTGGGGGC GGAATCTGCC CATCAACCGG GGTCGCCACA ACATCGACCG CCTGGTCTAC
GACTACTACC GGGACCAGAC CGTGGCGCTC GAGGCCTTCA AGGCAGGCAA TCTCGACCTG
CGCCGTGAAA GCAGTGCCAA GAACTGGGCC ACCGCTTACG ACACGCCCGC CCTGGAAGCG
GGCTTCATCA AGCGCATGAC CGTGCCCGAC GCACAGCCCG CGGGCATGCA GGCCTTCGTC
ATGAACCTGC GCCGCGCCCC GTTCCAGGAC CGACGCGTGC GCGAGGCCCT GACCCTGGCT
ACCGATTTCG ACTGGCTCAA CACGCACCTG TTCTATGGCG CCTACCAGGA AACCGATAGC
TACTTCGAGA GCTCGGAAAT GGAGGCGCAA GGTCTGCCCA GCGACGATGA ACTCGCGCTG
CTCGCCCCTT ATCGCGACAT CCTGCCCGAC GACGTCTTCG AAGAACCTCT CCCGATGTCG
CGCCCCGACA CGTTGCGCGC ACGCTTGAAA AAGGCGCTGT CACTGCTGCG CGAGGCCGGC
TACGAGGTGC GCGACGGCGT GCTCGTCGAT ACCGACACCG GGCGCCCCAT GCGCCTGCAA
TTCCTGCTCT ACGACACCCA GTTCGAGCGC GTCACGCTTC CCCTCATCCA GAATCTCGAG
CGCCTTGGCA TTCAGGCCAG CGTGCGTGTC GTCGACGTCA ACCAGTACCT GACACGCCGC
CGGAACTTCG ATTTCGACCT GATGATCGGC AGCTTTCCGC AATCGGCCAA TCCGGGCAAC
GAACAACGCG AGTACTGGAC GAGCGAATAT GCCGATGCTC CGCGCAGCCG CAACCTGATC
GGCCTGCGAA ACCCGGCGGT CGACGCCCTG GTCGATCGCC TGATCGGTGC CAACAGCCGA
CAGGCGCTGG ACACCACGGC ACGCGCGCTG GACCGCGTCC TGCGCTGGGG GTTCTATGTC
ATCCCCCAGT GGCATCTGGA TGGCACTCGC ATCGCGATGT GGGATAAATT CGGGTACCCG
CAACCCTTCC CCGAGTATAC GTTCGACCTG TCGAGCTGGT GGGTCGATCC GCAACGCGCT
GCCCGCGTCG AAGAACGTCA ACGCGGCGAA GGTTAA
 
Protein sequence
MSLFVRALLV IPGLWALSLS ALAVPAADVA TVGGISLYDS PALPDDFTHL PYTNPDAPKG 
GELRQAAQGS FDSTNGFIIQ GNPADGLSHV YDTLMEASAD EPFTMYGLLA GGIRLDPDRH
WMEIDLRRSA RFHDGHPVTA EDVVFSFRLL RDQGQPFYRA YYAGVDQVEA LDDDTVRFEF
SDNESRELPL ILGQLPVLPK HYWQSRDFTS PTLDKPLGSG PYEVASILPG RRIMYRRVDD
YWGRNLPINR GRHNIDRLVY DYYRDQTVAL EAFKAGNLDL RRESSAKNWA TAYDTPALEA
GFIKRMTVPD AQPAGMQAFV MNLRRAPFQD RRVREALTLA TDFDWLNTHL FYGAYQETDS
YFESSEMEAQ GLPSDDELAL LAPYRDILPD DVFEEPLPMS RPDTLRARLK KALSLLREAG
YEVRDGVLVD TDTGRPMRLQ FLLYDTQFER VTLPLIQNLE RLGIQASVRV VDVNQYLTRR
RNFDFDLMIG SFPQSANPGN EQREYWTSEY ADAPRSRNLI GLRNPAVDAL VDRLIGANSR
QALDTTARAL DRVLRWGFYV IPQWHLDGTR IAMWDKFGYP QPFPEYTFDL SSWWVDPQRA
ARVEERQRGE G