Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4336 |
Symbol | |
ID | 5060821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 4912192 |
End bp | 4913991 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640476598 |
Product | extracellular solute-binding protein |
Protein accession | YP_001161142 |
Protein GI | 145596845 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.168516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.284884 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCA GATTTCCCGG CCGCGCCCTG CGTGGCGCGG TGGCGGCAGC CGCCACGATC GCCCTTGGGG CGGGTCTGGT CGGCTGCGGT GACAACAGTG GATCCAAGGA CGGCGGTAGT CAGGGCAAGG ACACCGTGAC AGTCGCCTTC CGTACCCCCA ACTGGATCCT GCCGATCTCG GCACCAGGCT TCACCCAGGG GGAGAACGCC ATCTTCGGCC AGGCGCTCTA CCGCCCCCTC TACCAGTACC GGCTGGACGG CACGGCGCAG TACAACATCG ACCCAGAACG CTCGATGGCC GAGCCCCCGC AGGTGAGCGA CGACGGCCGC ACGCTGACGA TCACGTTGAA GGACAACTCC TGGTCCGATG GCACGCCCAT CACCACCAGG GACATCCAGT TCTGGTACGA CCTGGTCTCG GCGAACAAGG ACAAGTGGGC GTCGTACCGG GCCGGCGGCT TCCCCGACAA CGTTGACGCG TGGTCGATCA AGGATGAGAA GACCTTCTCG ATCACCACCA CGGAGGTCTA CAACACCGCG TGGTTCGTCG ACAACCAGCT CAACCGCATC ACGCCCCTGC CCCAGCACGC CTGGGACAAG GACTCCGCGA CCGCCGACGT GAGCGACCTC GCCAGCAGCC CGGAGGGCGC CAAGAAGGTC TTCGACTTCC TCATCGCCGC CTCGAAGGAC CCCAAGACGT ACGACTCCAA CGAGTTGTGG CAGGTCACCA GCGGCGCGTG GACGCTGGAG AAGTACGTGC CCAACGGTGA GGTCACCCTC GCCGCCCAGC CGAACTACTC CGGTACCGAC AAACCGAAGC TCTCCACGGT CGTGTTGCGG CCGTTCACCA GCGATGACGC CGAGTTCAAC GTGCTCCGCG CCGGCGACAT CGACTACGGG TACGTGCCGC CGGCCAACAT GTCCCAGAAG AGCTACCTCG AGTCCAAGGG ATACACGATC TCACCGTGGT ACGGCTGGTC GATCACCTAC CTCCAGCTGA ACTACAACAA CCCGAAGTCC GGCGTGCTGT TCAAGCAGCC CTACCTTCGG CAGTCGCTGC AGATGCTCAT CGACCAGCCG ACGATCAGCA AGGTCATCTG GTCGGACACC GCCGCGCCGA CCTGCGGCCC GGTACCGGCC AGGCCCGGCA CCAACACCGA CGCCGCCGGG TGCGCGTACT CCTTCGATCC GGCGAAGTCC AAGGAACTGT TGGAGAGCAA CGGCTGGAAG GTGACCCCGG ACGGGCAGAC CACCTGCCAG TCGCCGGGCA CCGGCCCGAA CCAGTGCGGT GAGGGAATCG CCGCCGGCAC ACCGCTCGAG TTCACGGTGA CCAGCCAGAC CGGGTTCGCC GCCACGACCA AGATGTTCGC CGAGATCAAG TCACAGATGG CCAAGCTCGG TATCCAGCTG ACGATCAAGG AGGTGCCCGA CTCGGTCGCG GTCACGCCGA CCTGTGAGCC GACCGAGGCG AGCTGCTCGT GGGACATGTC CTTCTTCGGC TCGCAGGGCA GCTGGTACTA CCCGGCCTTC GCCAGCGGCG AGCGGCTCTT CGCCACCGAC GCCCCGGTGA ACCTGGGCAG CTACAGCAAT CCTGAGGCGG ACAAGCTCAT CGAGGCCACT CAGTTCGCTG GCGACGAGAG CGCGCTCATG GCGTACAACG ACTTCCTGGC CAAGGACCTG CCCGTGCTCT GGATGCCGAA TCCGGTGTAC CAGGTCTCGG CGTACCGCTC CGGCCTGCAG GGCGTCGAGC CTCAGGACCC GATGAACCTC ATGTACTTCC AGGACTGGTC CTGGGAATAG
|
Protein sequence | MTSRFPGRAL RGAVAAAATI ALGAGLVGCG DNSGSKDGGS QGKDTVTVAF RTPNWILPIS APGFTQGENA IFGQALYRPL YQYRLDGTAQ YNIDPERSMA EPPQVSDDGR TLTITLKDNS WSDGTPITTR DIQFWYDLVS ANKDKWASYR AGGFPDNVDA WSIKDEKTFS ITTTEVYNTA WFVDNQLNRI TPLPQHAWDK DSATADVSDL ASSPEGAKKV FDFLIAASKD PKTYDSNELW QVTSGAWTLE KYVPNGEVTL AAQPNYSGTD KPKLSTVVLR PFTSDDAEFN VLRAGDIDYG YVPPANMSQK SYLESKGYTI SPWYGWSITY LQLNYNNPKS GVLFKQPYLR QSLQMLIDQP TISKVIWSDT AAPTCGPVPA RPGTNTDAAG CAYSFDPAKS KELLESNGWK VTPDGQTTCQ SPGTGPNQCG EGIAAGTPLE FTVTSQTGFA ATTKMFAEIK SQMAKLGIQL TIKEVPDSVA VTPTCEPTEA SCSWDMSFFG SQGSWYYPAF ASGERLFATD APVNLGSYSN PEADKLIEAT QFAGDESALM AYNDFLAKDL PVLWMPNPVY QVSAYRSGLQ GVEPQDPMNL MYFQDWSWE
|
| |