Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2552 |
Symbol | |
ID | 5706406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2905350 |
End bp | 2906681 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272015 |
Product | extracellular solute-binding protein |
Protein accession | YP_001537385 |
Protein GI | 159038132 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000382941 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGAAGTT CCAGGGCGCT GCTGGCGGCG CTGCTCTCGA CAGTGCTGTT CGCCACCGGC TGCGGATCCG GGCTCGGGGC CGCGTCCGAC GGCACGGGGC CGGTGCGACT GTTGGTCTTC GGCGCACCCG AGGAGTTGGC CGCGTACCGC ACGCTGATCG AGGCGTACGG TCAGGCACGG CCCGGGAACG AGGTGCAGCT CATCGAGGCG AGCGACCGCA AGGACCTGCT GGCCCGGCTG GCCACGTCGG TCGCCGGGGG CGCCCCGCCG GACCTGTTCC TGATGAACTA TCGCTTCTAC GGCCAGTTCG CCGCGAAGAA CGTGGTCGAG CCGTTGGACG AGCGCATCGC CGCGTCCGAG AAAGTGGATC CCGACGACTA CTACCCGGTG GCGATGAACG CCTTCACCTG GGGCGGCAAA CAGCTCTGCC TGCCACAGAA CGTGTCCAGT CTCGCCGTCT ACTACAACCG CACCCTGTTC GCCAAGTACC AGGTCCCCGA GCCGAAGGCC GGCTGGACCT GGAACGACAT GGTCGGTACC GCCATCGCCA TGACCCGGGA CGCCCGCGGT GTGGTGGTCA AGGGCACCGA GAGCGAGGGC GCCGCCGTCC GGCCAGCCGT ACACGGGCTC GGCGTCGAGC CGTCGATCAT TCGCGTTGCC CCGTTCGTGT GGTCCGCCGG CGGCGAGATC GTCGACGACC CGAACCGGCC GACCCGGCTC ACCCTGGACA CCCCGGTCGG ACGGGAGGCA CTGAAGAACC TGGTCGACCT ACGGCAGGCG TACGGCGTGG TGCCCACGGA CGAGGAGGTC GAGGCCGAGG ATGACGAGTC CCGCTTCGCC AACGGCCGAC TCGCCATGCT GATGTCCTCG CGGCGCTCCA CCACCACCTT CCGCTCGATC ACTGACTTCG AGTGGGACGT CGCCCCGCTG CCGGTCTACC AGGACCAGGT CGGGGTGCTG CACTCGGACG CGTACTGCAT GACCCGGAGC GCGAAGCGTA AGGACGCGGC ATGGCGGTTC CTGGAGTTCG CCATCTCCGC CGAAGGACAG CGGATCATCG CCGCCACCGG AAGGACGGTA CCGTCGCACA TCGACGTCTC GCGCTCCTCG GTGTTCCTCG GCCCGTCCCA GCCGCCGCGC AGCGCGACGG TCTTCCTCGA CACGATTCCC ACCCTCCGGA CACTGCCGAC CGTCTCCACC TGGCCCGAAG TCGAGGATGT GACCGCCGGG ATCCTGGAGA ACGCGCTGTA CCGGGGCGAC CGGTTGGACG ATGTCATCCG CGCCGTCGAT GAGCAGACCC GCCCGCTGTT CGCACGTGGT GAGCACGGGT GA
|
Protein sequence | MRSSRALLAA LLSTVLFATG CGSGLGAASD GTGPVRLLVF GAPEELAAYR TLIEAYGQAR PGNEVQLIEA SDRKDLLARL ATSVAGGAPP DLFLMNYRFY GQFAAKNVVE PLDERIAASE KVDPDDYYPV AMNAFTWGGK QLCLPQNVSS LAVYYNRTLF AKYQVPEPKA GWTWNDMVGT AIAMTRDARG VVVKGTESEG AAVRPAVHGL GVEPSIIRVA PFVWSAGGEI VDDPNRPTRL TLDTPVGREA LKNLVDLRQA YGVVPTDEEV EAEDDESRFA NGRLAMLMSS RRSTTTFRSI TDFEWDVAPL PVYQDQVGVL HSDAYCMTRS AKRKDAAWRF LEFAISAEGQ RIIAATGRTV PSHIDVSRSS VFLGPSQPPR SATVFLDTIP TLRTLPTVST WPEVEDVTAG ILENALYRGD RLDDVIRAVD EQTRPLFARG EHG
|
| |