Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1946 |
Symbol | |
ID | 4027186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2201512 |
End bp | 2203347 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967142 |
Product | extracellular solute-binding protein |
Protein accession | YP_573997 |
Protein GI | 92114069 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.185407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTCT TCGTTCGCGC TTTGCTTGTC ATACCGGGTC TGTGGGCCCT TAGCCTGTCG GCCCTGGCCG TGCCCGCCGC CGACGTCGCC ACCGTGGGCG GCATTTCGCT CTATGACAGC CCGGCGCTCC CCGACGACTT CACGCATCTG CCTTACACCA ACCCCGACGC GCCCAAGGGC GGAGAGCTGC GTCAGGCCGC GCAAGGCAGT TTCGATTCGA CGAACGGCTT CATCATCCAG GGCAATCCCG CCGATGGCCT CAGTCATGTC TACGACACCT TGATGGAAGC CAGCGCCGAC GAGCCCTTCA CCATGTATGG CCTGCTCGCC GGCGGCATCC GCCTCGACCC CGACCGTCAC TGGATGGAAA TCGACCTGCG TCGGTCGGCC CGGTTCCACG ACGGCCACCC GGTAACCGCC GAGGACGTGG TGTTCAGCTT CCGCCTGCTG CGCGACCAGG GACAACCCTT CTATCGCGCC TACTATGCCG GCGTCGATCA GGTCGAAGCC CTGGACGACG ATACCGTGCG CTTCGAGTTC AGCGACAACG AGTCCCGCGA GTTGCCGCTG ATCCTGGGAC AATTGCCGGT GCTGCCCAAG CATTACTGGC AATCGCGCGA TTTCACCTCG CCGACGCTGG ACAAGCCTCT GGGCTCGGGC CCCTACGAAG TGGCCAGCAT CCTTCCCGGG CGGCGTATCA TGTATCGCCG TGTCGACGAT TACTGGGGGC GGAATCTGCC CATCAACCGG GGTCGCCACA ACATCGACCG CCTGGTCTAC GACTACTACC GGGACCAGAC CGTGGCGCTC GAGGCCTTCA AGGCAGGCAA TCTCGACCTG CGCCGTGAAA GCAGTGCCAA GAACTGGGCC ACCGCTTACG ACACGCCCGC CCTGGAAGCG GGCTTCATCA AGCGCATGAC CGTGCCCGAC GCACAGCCCG CGGGCATGCA GGCCTTCGTC ATGAACCTGC GCCGCGCCCC GTTCCAGGAC CGACGCGTGC GCGAGGCCCT GACCCTGGCT ACCGATTTCG ACTGGCTCAA CACGCACCTG TTCTATGGCG CCTACCAGGA AACCGATAGC TACTTCGAGA GCTCGGAAAT GGAGGCGCAA GGTCTGCCCA GCGACGATGA ACTCGCGCTG CTCGCCCCTT ATCGCGACAT CCTGCCCGAC GACGTCTTCG AAGAACCTCT CCCGATGTCG CGCCCCGACA CGTTGCGCGC ACGCTTGAAA AAGGCGCTGT CACTGCTGCG CGAGGCCGGC TACGAGGTGC GCGACGGCGT GCTCGTCGAT ACCGACACCG GGCGCCCCAT GCGCCTGCAA TTCCTGCTCT ACGACACCCA GTTCGAGCGC GTCACGCTTC CCCTCATCCA GAATCTCGAG CGCCTTGGCA TTCAGGCCAG CGTGCGTGTC GTCGACGTCA ACCAGTACCT GACACGCCGC CGGAACTTCG ATTTCGACCT GATGATCGGC AGCTTTCCGC AATCGGCCAA TCCGGGCAAC GAACAACGCG AGTACTGGAC GAGCGAATAT GCCGATGCTC CGCGCAGCCG CAACCTGATC GGCCTGCGAA ACCCGGCGGT CGACGCCCTG GTCGATCGCC TGATCGGTGC CAACAGCCGA CAGGCGCTGG ACACCACGGC ACGCGCGCTG GACCGCGTCC TGCGCTGGGG GTTCTATGTC ATCCCCCAGT GGCATCTGGA TGGCACTCGC ATCGCGATGT GGGATAAATT CGGGTACCCG CAACCCTTCC CCGAGTATAC GTTCGACCTG TCGAGCTGGT GGGTCGATCC GCAACGCGCT GCCCGCGTCG AAGAACGTCA ACGCGGCGAA GGTTAA
|
Protein sequence | MSLFVRALLV IPGLWALSLS ALAVPAADVA TVGGISLYDS PALPDDFTHL PYTNPDAPKG GELRQAAQGS FDSTNGFIIQ GNPADGLSHV YDTLMEASAD EPFTMYGLLA GGIRLDPDRH WMEIDLRRSA RFHDGHPVTA EDVVFSFRLL RDQGQPFYRA YYAGVDQVEA LDDDTVRFEF SDNESRELPL ILGQLPVLPK HYWQSRDFTS PTLDKPLGSG PYEVASILPG RRIMYRRVDD YWGRNLPINR GRHNIDRLVY DYYRDQTVAL EAFKAGNLDL RRESSAKNWA TAYDTPALEA GFIKRMTVPD AQPAGMQAFV MNLRRAPFQD RRVREALTLA TDFDWLNTHL FYGAYQETDS YFESSEMEAQ GLPSDDELAL LAPYRDILPD DVFEEPLPMS RPDTLRARLK KALSLLREAG YEVRDGVLVD TDTGRPMRLQ FLLYDTQFER VTLPLIQNLE RLGIQASVRV VDVNQYLTRR RNFDFDLMIG SFPQSANPGN EQREYWTSEY ADAPRSRNLI GLRNPAVDAL VDRLIGANSR QALDTTARAL DRVLRWGFYV IPQWHLDGTR IAMWDKFGYP QPFPEYTFDL SSWWVDPQRA ARVEERQRGE G
|
| |