Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4778 |
Symbol | |
ID | 5704445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5408324 |
End bp | 5410126 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641274176 |
Product | extracellular solute-binding protein |
Protein accession | YP_001539522 |
Protein GI | 159040269 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.584214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000356618 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACGAGCA GATTTCCCCG CCGCGCCCTG CGTGGCGCGG TGGCGGCAGC CACCACGATC GCTCTCGGGG CGGGCCTGGT CGGCTGCGGT GACAGCAGTG GATCGTCCCA AGACGGCGGA AGTCAGGGCA AGGACACCGT GACGGTCGCC TTGCGTACCC CCAACTGGAT CCTGCCGATC TCGGCACCCG GATTCACGCA GGGTGAAAAC GCCATCTTCA ACCAGTCGCT CTACCGGCCC CTGTACCAGT ACCGGCTCGA CGGCACGGCG CAGTACAACA TCGACCCACA ACGCTCGATG GCCGAGCCAC CGCAGGTGAG CGAGGACGGT CGGACCCTGA CGATCACGCT GAAGGACAAC ACCTGGTCCG ACGGCAAGCC CATCACCACC AAGGACATCC AGTTCTGGTA CGACCTGGTC ACGGCGAACA AGGACAAGTG GGCGTCCTAC CGGGCCGGCG GCTTCCCCGA CAACGTCGCG GAGTGGTCGG TCCAGGACGA GAAGACCTTC TCGATCACCA CCACGAAGGT CTACAACACC GCGTGGTTCG TCGACAACCA ACTCAACCGC ATCACGCCCC TGCCCCAGCA CGCCTGGGAC AAGGACTCCG CGACCGCCGA CGTGAGCGAC CTCGCCAGCA GCCCGGAGGG CGCCGAGAAG GTCTTCGACT TCCTCACCGC CGCCGCGAAG GACCCCAAGA CGTACGACTC CAACGAGTTG TGGAAGGTCA CCAGCGGCGC GTGGAAGCTG GAGAAGTACG TGCCCAACGG TGAGGTCACC CTCGCCGCCC AACCGAACTA CTCCGGTACC GACAAACCGA AGCTGGCCAC GGTCGTGTTG CGCCCGTTCA CCAGCGACGA CGCCGAGTTC AACGTGCTCC GCGCCGGTGA CATCGACTAC GGGTACGTGC CAGCGGCCAA CCTGTCCCAG GAGAGCTACC TCGAGTCCAA GGGATACACG GTCTCGCCGT GGTACGGCTG GTCGATCACC TACCTGCAGC TGAACTACAA CAACCCGAAA ACCGGCGTGC TGTTCAAGCA GCCCTACCTT CGGCAGTCGC TGCAGATGCT CATCGACCAG CCGACGATCA GCAAGGTCAT CTGGTCGGAC ACCGCCGCGC CGACCTGCGG CCCGGTACCG GCCAAGCCCG GCACCAACAC CGACGCCGCC GGATGCGCCT ACTCCTTCGA CCCGGCGAAG GCCAAGGAAC TGCTGGAGAG CCACGGCTGG AAGGTGACCC CGGACGGGCA GACCACCTGC CAGTCACCGG GCACCGGCCC GAACCAGTGC GGTGACGGAA TCGCCGCCGG CACGGCGCTG GAGTTCACCG TCACCAGCCA GACCGGGTTC GCCGCCACGA CCAAGATGTT CGCCGAGATC AAGTCACAGA TGGCCAAGCT CGGCATCCAG CTGACGATCA AGGAGGTGCC GGACTCGGTC GCGGTCACCC CGGCGTGCGA GCCGACCGAG GGGACCTGCG ACTGGGACAT GTCCTTCTTC GGCTCGCAGG GCAGCTGGTA CTACCCGGCC TTCGCCAGCG GCGAGCGGCT CTTCGCCACC GACGCCCCGG TCAACCTGGG CAGCTACAGC AATCCGGAGG CCGACAAGCT CATCGAGGCC ACCCAGTTCG CTGGCGACGA GAGCGCGCTC ACGGCGTACA ACGACTTCCT GGCCAAGGAC CTGCCTGTGC TGTGGATGCC GAACCCGGTG TACCAGGTCT CGGCGTACCG CTCCGGCCTG CAGGGAGTCG AGCCGCAGGA TCCGATGAAT CTCATGTACT TCCAGGACTG GTCCTGGAAG TAA
|
Protein sequence | MTSRFPRRAL RGAVAAATTI ALGAGLVGCG DSSGSSQDGG SQGKDTVTVA LRTPNWILPI SAPGFTQGEN AIFNQSLYRP LYQYRLDGTA QYNIDPQRSM AEPPQVSEDG RTLTITLKDN TWSDGKPITT KDIQFWYDLV TANKDKWASY RAGGFPDNVA EWSVQDEKTF SITTTKVYNT AWFVDNQLNR ITPLPQHAWD KDSATADVSD LASSPEGAEK VFDFLTAAAK DPKTYDSNEL WKVTSGAWKL EKYVPNGEVT LAAQPNYSGT DKPKLATVVL RPFTSDDAEF NVLRAGDIDY GYVPAANLSQ ESYLESKGYT VSPWYGWSIT YLQLNYNNPK TGVLFKQPYL RQSLQMLIDQ PTISKVIWSD TAAPTCGPVP AKPGTNTDAA GCAYSFDPAK AKELLESHGW KVTPDGQTTC QSPGTGPNQC GDGIAAGTAL EFTVTSQTGF AATTKMFAEI KSQMAKLGIQ LTIKEVPDSV AVTPACEPTE GTCDWDMSFF GSQGSWYYPA FASGERLFAT DAPVNLGSYS NPEADKLIEA TQFAGDESAL TAYNDFLAKD LPVLWMPNPV YQVSAYRSGL QGVEPQDPMN LMYFQDWSWK
|
| |