Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3814 |
Symbol | |
ID | 5705309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4346542 |
End bp | 4347537 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273236 |
Product | periplasmic solute binding protein |
Protein accession | YP_001538598 |
Protein GI | 159039345 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.906416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0575455 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACC GCCGCGTTCC GCGTACCCTG GCCGCCGCCT CCGCCGCCCT ACTCACCCTC GGCGCCGCCG CCTGCTCCGA CAGCCAGGCA GACGCCGACC CGCAGCGAGT TGACGTGGTC GCCGCGTTCT ACCCACTCCA GTTCCTGACC CAGCAGATCG GGGGCGATAC GGTAACCGTC AGCAACCTGG TCAAGCCCGG CGCCGAGCCA CACGACATCG AGCTGAGCCC GAGCCAGGTT GGTGACGTGG CAGGCGCGGA GCTGATCGTC TACCTCAAGG GCTTCCAACC GCAGGTCGAC GACGCGGTGC AGCAGAACTC CGCCGACCGG GCGTTCGACG TGGCCACCGT CGAGCCGCTG CTCGACGCCA CGGGCGACAA CCACAACCAC GACCACGGAC ACGAGGGCGA GGCCGGACAC GAAGGTGAGG CCGGTCACGA AGGTGAAGGT GAAGCCGGAC ACGAGGGTGA GGCCGGCACC AAGGATCCAC ACCTGTGGCT GGACCCGACC CGGCTCGCCA CCATCGGCGA CCAACTCGCC GACCGGCTCG CGCAGGCCGA CCCCGAGCAC GCTGACGGGT ACACCGCCCG AGCGAAGGAT CTCCGGACCA AGCTGGAGCA GCTCGACGCG GAGTTCACCG CCGGTCTGAA GACCTGCCAA CGGCGGGAGA TCGTGGTCAG CCACACCGCC TTCGGCTACC TGACCACGCG CTACCAGCTG GAGCAGATCG GCATCACCGG CCTGAGCCCG GAACACGAGC CGTCGCCGCA GCGGCTGGCC GAGGTGATCG AGGAGGCCAA GGAGCACCAG GCCACCACGA TCTTCTTCGA GACGCTGGTC AGCCCGAAGG TCGCCGAGAC CATCGCCGCC CAGGTCGGGG CCGAGACCGC GGTGCTCGAC CCGCTCGAGG GGCTGTCCGC CGACAACGGC GGGGACTACT TCTCGGTGAT GCGGACCAAC CTCGCCAACC TGCAAAAGGC TCTGGGCTGC TCATGA
|
Protein sequence | MNNRRVPRTL AAASAALLTL GAAACSDSQA DADPQRVDVV AAFYPLQFLT QQIGGDTVTV SNLVKPGAEP HDIELSPSQV GDVAGAELIV YLKGFQPQVD DAVQQNSADR AFDVATVEPL LDATGDNHNH DHGHEGEAGH EGEAGHEGEG EAGHEGEAGT KDPHLWLDPT RLATIGDQLA DRLAQADPEH ADGYTARAKD LRTKLEQLDA EFTAGLKTCQ RREIVVSHTA FGYLTTRYQL EQIGITGLSP EHEPSPQRLA EVIEEAKEHQ ATTIFFETLV SPKVAETIAA QVGAETAVLD PLEGLSADNG GDYFSVMRTN LANLQKALGC S
|
| |