Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_0216 |
Symbol | |
ID | 5056654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 246131 |
End bp | 247792 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640472488 |
Product | extracellular solute-binding protein |
Protein accession | YP_001157079 |
Protein GI | 145592782 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.923383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGCAT CCAGGCCGAA GGTCGCTGTC GCGGCCGTCG CGGTCGCGGC CCTCGCGGTA GCCGGCTGCG CCGAGAGCGA CCGCGAGGAT TCCGGTGGTA GCAACAACGA CACCCTCGTC TTCGGCGTCG CCGGAGACCC GAAGGTGCTC GACCCGAGCT TTGCCAGCGA CGGCGAGTCG CTGCGCGTGG CCCGTCAGGT CTTCGAGACC CTGGTCCGCC CCGAGGAGGG TGGCACCAAG GTGAGCCCGG GCCTCGCGGA GTCCTGGACT CCGGACGAGG CGGGCACCAC CTGGACCTTC AAGCTCCGCT CGGGCGTGAA GTTCCACGAC GGCACCGACT TCGACGCCGA GGCCGTCTGC GTCAACTTCA ACCGTTGGTA CAACGCCACC GGCCTCATGC AGAGCCCGGA CGTGACCGCC TACTGGCAGG ACGTGATGGG CGGCTTCGCC CAGAACGAGG ACGAAGGGCT CTCGGAGAGC CTCTTCAAGT CCTGCACCGC CAAGGACGCC ACCACGGTCG ATCTGACCTT CACCCGGGTG TCCAGCAAGA TCCCGGCCGC CCTGATGTTG CCGTCGTTCT CCATCCACAG CCCGACGGCG CTGGAGCAGT ACGACGCGAG CAACGTCGGC GGCACCGCGA CGGACGTCCA GTACCCCGAG TACGCGACCG CGCATCCGAC CGGCACCGGG CCCTTCAAGT TCAAGTCCTG GGACATCGCC AACAAGTCGC TCACCATCGA GCGAAACGAC GACTACTGGG GCGAGAAAGC CAAGCTGAAG ACCCTCATCT TCCGGACCAT CCCGGATGAG AACGCGCGCA AGCAGGCGCT GAGCTCCGGC GACATCCAGG GCTACGACCT GGTCGGGCCG GCTGATGTCG AGCCGCTGAA GGGCGAGGGC TTCAACGTCC TGACCCGGCC GGCGTTCAAC ATCCTCTACC TGGGCATGAA CCAGCAGGGG AACCCGAAGC TGGCTGACCT CAAGGTCCGG CAGGCGATCG CCCACGCGAT CAACCGGCAG GCCCTGGTCG ACTCCAAGCT CCCCCCGGGT GCGAAGGTCG CGACCAACTT CTTCCCGGAC ACGGTCGAGG GGTGGAACGG CGACGTCACC ACCTACGACT ACGACGTCGA CAAGGCCAAG CAGCTGCTGG CCGAGGCCGA CGCGACGGAC CTGACGCTGC GGTTCCACTA CCCGACCGAG GTCACCCGTC CGTACATGCC GAACCCGAAG GATCTCTTCG AGCTGGTCTC GGCGGACCTC CAGGCGGCCG GCATCACGGT CGAGCCGATT CCGCTCAAGT GGAGCCCGGA CTACCTCAAC GCCACCACCT CCGGTAGCGA GCACGACCTG CACTTCCTCG GCTGGACCGG TGACTACGGC GACGCCTACA ACTTCATCGG CACCTTCTTC GACCGGCAGA AGGACGAGTG GGGCTTTGAC AACCCGGCCC TCTTCGAGCA GTTCCAGGAC GCCGACTCCA CCGCGGATAT GGCGTCACGG GTGGAGAAGT ACAAGGGTCT GAACAAGACC ATCATGGACT TCCTGCCCGG GGTGCCGATC TCGCACTCGC CGCCGGCGAT CGTGTTCGGC AAGGACGTCA CCGGCATCAA GGCCAGCCCG CTCACCGACG AGCGGTTCGC GAAGGCTGAG TTCACGTCCT GA
|
Protein sequence | MRASRPKVAV AAVAVAALAV AGCAESDRED SGGSNNDTLV FGVAGDPKVL DPSFASDGES LRVARQVFET LVRPEEGGTK VSPGLAESWT PDEAGTTWTF KLRSGVKFHD GTDFDAEAVC VNFNRWYNAT GLMQSPDVTA YWQDVMGGFA QNEDEGLSES LFKSCTAKDA TTVDLTFTRV SSKIPAALML PSFSIHSPTA LEQYDASNVG GTATDVQYPE YATAHPTGTG PFKFKSWDIA NKSLTIERND DYWGEKAKLK TLIFRTIPDE NARKQALSSG DIQGYDLVGP ADVEPLKGEG FNVLTRPAFN ILYLGMNQQG NPKLADLKVR QAIAHAINRQ ALVDSKLPPG AKVATNFFPD TVEGWNGDVT TYDYDVDKAK QLLAEADATD LTLRFHYPTE VTRPYMPNPK DLFELVSADL QAAGITVEPI PLKWSPDYLN ATTSGSEHDL HFLGWTGDYG DAYNFIGTFF DRQKDEWGFD NPALFEQFQD ADSTADMASR VEKYKGLNKT IMDFLPGVPI SHSPPAIVFG KDVTGIKASP LTDERFAKAE FTS
|
| |