Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2653 |
Symbol | |
ID | 5059116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 2978870 |
End bp | 2980423 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640474909 |
Product | extracellular solute-binding protein |
Protein accession | YP_001159475 |
Protein GI | 145595178 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0051152 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.33796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTACCC CTGTCAGATC CCGCCCACTC CGGCGGGGGC TGCTACCCCT CGTCACCGTC GCCCTGCTGC TGGCCGGTTG TGGCGCCGAC ACCACGACCG GTAGTTCCGT CGACGAGCCC GGCACGCCGG TCGACGGCGG CACCCTGCGC TACGTCGTAC CGGGTTCGCC GGCGACCGCG AGCAACGATC CGCACGGCGG ACTCGGCAAC GAATCCGACC TCATGCGGTT CGCGCTGACC TACGACGTGC TCACCGTGCC CGGCACCGAC GGGACGCCGC AACCGCGGCT GGCCCAAACG TGGGAGGCGA ACCAGAGCCT GGACCGCTGG ACTTTTCATC TGCGCGAGGA CGCCACCTTC ACCGACGGCC AGCCGGTACT CGCCAAGGAC GTGCTCTACT CCCTGACCCG GATAGCGGAC AAGGCCGCGG AGAACTACGG CCGACTGGCC GACTTCGACA TGGCAGCCGC CAGCGCACCC GACGACCACA CGGTGATCCT GGCGACCCGT ACGCCGATGG CCGAAGCGCC GAAGGCGCTG GAGTCGATCA GCTTTGTCGT TCCCGAGGGC AGCACGGACT TCGCCGAGCC GGTACGCGGT TCGGGGCCGT TTCGGGTGAC CGAGACCGAC GCCCAAACCG CCGTACTTCT GCGTAACGAC GACTGGTGGG GCGAGCGACC GCACCTGGAC CGGATCGAGA TCCGGGCGGT CGCCGACCCG CAGGCCCGTG CGGCCGCCGT CACCTCCGGG CAGGCGGACG TAGCCGGTAG CGTCAGCCCG GCGGCCGTCA AGGCCGCCGA GGCCGGCGGT GATGTGCAGG TCGTCCGCCG CAAGGGTGTC ACCGAGTACC CGATCATCAT GCGGCTGGAC TCGGCACCGT TCGATGACCC GCGAGTACGG GAGGCGCTCC GGCTCGCGAC CGATCGACAG GCACTCGTCG ACACGGTGTT CCTCGGATAC GGTCAGATCG CCAACGATCT ACCCACCCCG TACGACCCGT CGTACCCGCA GGGTCTGACG CAGCGCACCC AGGACCTTGA CCGGGCCAGG GAACTGCTCG AGCAGGCCGG GCACGCGGAC GGGCTGGCGT TGACCCTGCA CACCACGACG TCGTACCCCG GCATGGACAC CGCGGCCACC CTGTGGGCCC GGCAACTCGC CGACGTCGGC GTACAGGTCG ACGTGAAGGT GGAGCCGGCG GACACCTACT GGACCGCCAT CTACGCCAAG AAGGACTTCT ACGTCGGATA CTACGGCGGC ATTTCCTTCC CCGACCTGGT ACGCGTCGGC CTGCTTGCCG CATCGCCGAC CAACGAGACC GCCTGGCGCA ACGCGTCGTT CGACGCCGAG TTCAATGCCG CCATGGGCAT CCTGGACCCG ACCGAGCGCA ACGCCCGACT GGCCGGCATC CAGCAGGACC TCTGGCGCGA CGGAGGGTAC GTGGTGTGGG GCGTCGGAGA TGGGTTGGAT CTGACCGCCC CCGGTGTGCG CGCTCTGCCC GACGGTCCCG GCTTCCAGCG GATGTTCATC GAACGCGCCT GGAAGACGAG GTGA
|
Protein sequence | MLTPVRSRPL RRGLLPLVTV ALLLAGCGAD TTTGSSVDEP GTPVDGGTLR YVVPGSPATA SNDPHGGLGN ESDLMRFALT YDVLTVPGTD GTPQPRLAQT WEANQSLDRW TFHLREDATF TDGQPVLAKD VLYSLTRIAD KAAENYGRLA DFDMAAASAP DDHTVILATR TPMAEAPKAL ESISFVVPEG STDFAEPVRG SGPFRVTETD AQTAVLLRND DWWGERPHLD RIEIRAVADP QARAAAVTSG QADVAGSVSP AAVKAAEAGG DVQVVRRKGV TEYPIIMRLD SAPFDDPRVR EALRLATDRQ ALVDTVFLGY GQIANDLPTP YDPSYPQGLT QRTQDLDRAR ELLEQAGHAD GLALTLHTTT SYPGMDTAAT LWARQLADVG VQVDVKVEPA DTYWTAIYAK KDFYVGYYGG ISFPDLVRVG LLAASPTNET AWRNASFDAE FNAAMGILDP TERNARLAGI QQDLWRDGGY VVWGVGDGLD LTAPGVRALP DGPGFQRMFI ERAWKTR
|
| |