Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0212 |
Symbol | |
ID | 4027177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 235428 |
End bp | 237044 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637965363 |
Product | extracellular solute-binding protein |
Protein accession | YP_572275 |
Protein GI | 92112347 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0260102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAACA AATTGAAAGG CTTCGCACTC ACCGCTCTGT GCGCGACGAT CTCGAGCCAG GCGCTGGCGC AACAGACACC TCAGGAAGGC GGCGACATCG TGGTCACCTA TCAGAACGAC GTGGCCACGC TCGACCCGGC GATCGGCTAC GACTGGCAAA ACTGGTCGAT GATCAAGAGC CTGTTCGACG GTCTGATAGA CTACGAGCCG GGTACCACAG AGCTGAAGAC GGACCTCGCC AAGGACTACG AGATCTCCGA CAATGGCCTG ACCTACACCT TCCATCTGCG CGAAGGCGTT ACCTTCCACA ACGGCCGCGA AATGGTTGCC AGCGACGTGA AGTACTCCCT GGAGCGGACG GTCAACCCCG AGACTCAGAG CCCCGGCGCC GGCTTCTTCT CCTCCATCGA GGGCTTCGAT GCCGTGGCCT CGGGCGAATC GATGGCGTTG AGCGGTATCA CCACGCCCGA CGATTACACC GTGAAGATCC AGCTGTCGGC GCCGGATGCG GCTTTTCTGC ACATCATGGC CCTCAACTTT GCCTCGGTGG TGCCCAAGGA GTCGGTCGAG AAATGGGGCC GCGATTTCGG CAAGCATCCG GTGGGCACGG GGGCCTTCGA GTTGCAGGAC TGGGCGCTCG GCCAGCGTCT GACCTTCGTC AAGAATGACG ACTACTACCG TGACGGTATT CCGTATCTCG ATCGCATCGA CTTCGAGATC GGCCAGGAAC CCAACGTGGC CCTGATGCGG TTGCAGCGCG GAGAGGTGGA TATCGCTGGC GACGGCATCC CGCCCGCGCA GTTCCTGCAA TTCCGAAACG ATCCCGAATT CAAGGATCTG ATGGTCATCG GCGACCAACT GCATACCGGC TATTTGACCA TGAACGTCAC CATGCCTCCG CTGGATGAGG TCGAGGTGCG TCGTGCCATC AACATGGCGA TCAACAAGGA GCGCATCGTG CGCATCATCA ATGGTCGCGC GATTCCTGCC AATCAGCCGC TGCCGCCGGC GATGCCCGGC TACGACGAGG AGTACGAGGG CTACCCCTAT GACGTGGCTC AGGCCAAGGC GCTTCTCGAG GAAGCCGGCT ACGGCGATGG GTTCGATACC GAGATCTATG TCATGAACAC CGATCCACAG CCGCGTATCG CCCAGGCGAT CCAGCAAGAC TTGGCGCAGA TCGGCGTACG TGCCGAAATC AAGTCGCTGG CGCAGGCCAA CGTGATCGCC GCGGGTGGCG ACAAAGAGCA GGCGCCGATG GTGTGGTCGG GCGGCATGGC GTGGATCGCC GATTTCCCCG ACCCTTCCAA CTTCTGGGGA CCGATCCTGG GTTGCGAGGG CGCAGTGCCC GGCGGCTGGA ACTGGGCGTG GTACTGCAAC GAAGAGCTGG ACGCCGAGGC TGACGCTGCC GACGCCATGG TCGCACCGGA CCAGCAGCAG GCACGCGCCG AGCGCTGGGG CAAGATATTC ACCACGGCGA TGGATGACGC GCCCTGGGTG CCGATCTTCA ACGAGCAACG CTTCACCGTG CACTCGGCAC GCATGGGTGG CGATGATGCC TTGTACGTGG ATCCGGTCCA CGTTCCCGTC AACTACGACT ACATCTGGGT GAAGTGA
|
Protein sequence | MNNKLKGFAL TALCATISSQ ALAQQTPQEG GDIVVTYQND VATLDPAIGY DWQNWSMIKS LFDGLIDYEP GTTELKTDLA KDYEISDNGL TYTFHLREGV TFHNGREMVA SDVKYSLERT VNPETQSPGA GFFSSIEGFD AVASGESMAL SGITTPDDYT VKIQLSAPDA AFLHIMALNF ASVVPKESVE KWGRDFGKHP VGTGAFELQD WALGQRLTFV KNDDYYRDGI PYLDRIDFEI GQEPNVALMR LQRGEVDIAG DGIPPAQFLQ FRNDPEFKDL MVIGDQLHTG YLTMNVTMPP LDEVEVRRAI NMAINKERIV RIINGRAIPA NQPLPPAMPG YDEEYEGYPY DVAQAKALLE EAGYGDGFDT EIYVMNTDPQ PRIAQAIQQD LAQIGVRAEI KSLAQANVIA AGGDKEQAPM VWSGGMAWIA DFPDPSNFWG PILGCEGAVP GGWNWAWYCN EELDAEADAA DAMVAPDQQQ ARAERWGKIF TTAMDDAPWV PIFNEQRFTV HSARMGGDDA LYVDPVHVPV NYDYIWVK
|
| |