Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2193 |
Symbol | |
ID | 5708188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2525189 |
End bp | 2526712 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641271674 |
Product | extracellular solute-binding protein |
Protein accession | YP_001537045 |
Protein GI | 159037792 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0522887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.720743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCC TCGACGAGCG GGAGCGCGGC AACAGCGGAC CTGCGGTGCC GCTGCGGTTC CTGGGGCCCG TCGACGGCGG ACTGCCCGAC GAACCGCCGT CACATGTGGT CGGCACCGGA CAGATCCTCC GGCTCCTCAC CCGACAGCTG TTCGCGTACC CCGCAACCAC GGATCCGTCC GACCGCCCGG TACCCGACGC CGCAGTAGCC CTACCGACCG TTGGCAACGG TGGAGTCAGC GCCGATGGAC GCACCTGGCG CGTGCGGGTG CGCCGCGGCG TCCGGTGGGA CGCCGCAGCG GCGCGGGAGT TGACCGCCGG GGACGTGGTG CGGGGGCTCA AACGCACCGC ACACCCGCTG GCCCGGTCGA TCCGGCCCTA CTTCGGAGCC ATGATCGAGG GCATGGCCGA CTACTACCGG GCCTATGACG AGGCGTTCGG GCACTGGGCG GGGCATGCTC CGGCGTTCGC CCAGTTCCAG CAGCGCACCC ACATCCCCGG CCTCTGGGCG GAGGGAGACA GCGTGTGGCT GCGGCTCGTC GAGCCCGCCA GCGACCTCGT CGACCTGCTC GCCACCGGGT TCGCCGCGGC GGCCCCGCGA GAGTTCGACT ACCACGTGCC GGACAGCTCG CAGCTGTACC GGTGTACGCC CTCATCCGGG CCGTACCGAA TCGCGACACG GCTGACGTCA GGTCGGGGCC TCGTCCTGGA GCCGAACCCC CGGTGGGATC CGGCTACCGA CCCGGTACGT CGCCTGGTCT CTGGGCGGAT CGAGGTCGTT GCGACGGCTG ACGGCAAGCC GCTGTCCCCG GCCACCGGAT TCGGGGTGCT CTCCTGGTCG GTACCGTCGC CGCCGGCCCG GTTCGTCGGT AGGGGGTGCA GCTATTACCT CGCTGCCGGA CCCGGCCGGG CCGGCCGGGC TCGGGCGCTA CGACACGCGT TGTCCTGGGC CATCGACCGG GCGGCCCTGG CCGACGCCGT AGCCGGGGCG GGAGCCGAGG GCGTCGCCGT GCAGCATGGT GTCCTGCAAC CCGGTCAGCG GGGCGCGGCG CACGTTGCCG GTGCCGCTCC CGATGTCGGT GACCCGGACC GAGCCCGGCA CCTGCTCGCC GACGTGGGAC TCGGGGTGGG GGCCCGCCTG TCCCTGGGGG TGCCGGCGGG ATCCCGGACG GCGGCGGTGG CCACCGGGCT GGCCGCGGTG CTCGCGAGGT ACGGCATCGA CGTCGTGCGG GTACCGACGA CTGCGGTCGG TGTCGATCTG GTGCTGCACG AATGGGCCCC GGCATGGGCC GGCAACGCCA ACCGTGACCT GCTGCACCGG GTGTGGCCGG CCGACCGGGT GGCCACGGAA CCGGCAAGGT CCGCCGCGGA GGCGCTGCGC GCGCTGGAAC CGGAGCGGGC GGCGACGTGG TGGCGTCGTT TCGATCGACT GGCCACCGCC GATCTACGGC TGGTGCCACT GATCGTGGCT GCCTGGCCAC GGGCCCGGGC CGATGCCACG GCCACCGGCG TGGCGCGATG GTGA
|
Protein sequence | MTTLDERERG NSGPAVPLRF LGPVDGGLPD EPPSHVVGTG QILRLLTRQL FAYPATTDPS DRPVPDAAVA LPTVGNGGVS ADGRTWRVRV RRGVRWDAAA ARELTAGDVV RGLKRTAHPL ARSIRPYFGA MIEGMADYYR AYDEAFGHWA GHAPAFAQFQ QRTHIPGLWA EGDSVWLRLV EPASDLVDLL ATGFAAAAPR EFDYHVPDSS QLYRCTPSSG PYRIATRLTS GRGLVLEPNP RWDPATDPVR RLVSGRIEVV ATADGKPLSP ATGFGVLSWS VPSPPARFVG RGCSYYLAAG PGRAGRARAL RHALSWAIDR AALADAVAGA GAEGVAVQHG VLQPGQRGAA HVAGAAPDVG DPDRARHLLA DVGLGVGARL SLGVPAGSRT AAVATGLAAV LARYGIDVVR VPTTAVGVDL VLHEWAPAWA GNANRDLLHR VWPADRVATE PARSAAEALR ALEPERAATW WRRFDRLATA DLRLVPLIVA AWPRARADAT ATGVARW
|
| |