Gene Strop_2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2653 
Symbol 
ID5059116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2978870 
End bp2980423 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content69% 
IMG OID640474909 
Productextracellular solute-binding protein 
Protein accessionYP_001159475 
Protein GI145595178 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0051152 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.33796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTACCC CTGTCAGATC CCGCCCACTC CGGCGGGGGC TGCTACCCCT CGTCACCGTC 
GCCCTGCTGC TGGCCGGTTG TGGCGCCGAC ACCACGACCG GTAGTTCCGT CGACGAGCCC
GGCACGCCGG TCGACGGCGG CACCCTGCGC TACGTCGTAC CGGGTTCGCC GGCGACCGCG
AGCAACGATC CGCACGGCGG ACTCGGCAAC GAATCCGACC TCATGCGGTT CGCGCTGACC
TACGACGTGC TCACCGTGCC CGGCACCGAC GGGACGCCGC AACCGCGGCT GGCCCAAACG
TGGGAGGCGA ACCAGAGCCT GGACCGCTGG ACTTTTCATC TGCGCGAGGA CGCCACCTTC
ACCGACGGCC AGCCGGTACT CGCCAAGGAC GTGCTCTACT CCCTGACCCG GATAGCGGAC
AAGGCCGCGG AGAACTACGG CCGACTGGCC GACTTCGACA TGGCAGCCGC CAGCGCACCC
GACGACCACA CGGTGATCCT GGCGACCCGT ACGCCGATGG CCGAAGCGCC GAAGGCGCTG
GAGTCGATCA GCTTTGTCGT TCCCGAGGGC AGCACGGACT TCGCCGAGCC GGTACGCGGT
TCGGGGCCGT TTCGGGTGAC CGAGACCGAC GCCCAAACCG CCGTACTTCT GCGTAACGAC
GACTGGTGGG GCGAGCGACC GCACCTGGAC CGGATCGAGA TCCGGGCGGT CGCCGACCCG
CAGGCCCGTG CGGCCGCCGT CACCTCCGGG CAGGCGGACG TAGCCGGTAG CGTCAGCCCG
GCGGCCGTCA AGGCCGCCGA GGCCGGCGGT GATGTGCAGG TCGTCCGCCG CAAGGGTGTC
ACCGAGTACC CGATCATCAT GCGGCTGGAC TCGGCACCGT TCGATGACCC GCGAGTACGG
GAGGCGCTCC GGCTCGCGAC CGATCGACAG GCACTCGTCG ACACGGTGTT CCTCGGATAC
GGTCAGATCG CCAACGATCT ACCCACCCCG TACGACCCGT CGTACCCGCA GGGTCTGACG
CAGCGCACCC AGGACCTTGA CCGGGCCAGG GAACTGCTCG AGCAGGCCGG GCACGCGGAC
GGGCTGGCGT TGACCCTGCA CACCACGACG TCGTACCCCG GCATGGACAC CGCGGCCACC
CTGTGGGCCC GGCAACTCGC CGACGTCGGC GTACAGGTCG ACGTGAAGGT GGAGCCGGCG
GACACCTACT GGACCGCCAT CTACGCCAAG AAGGACTTCT ACGTCGGATA CTACGGCGGC
ATTTCCTTCC CCGACCTGGT ACGCGTCGGC CTGCTTGCCG CATCGCCGAC CAACGAGACC
GCCTGGCGCA ACGCGTCGTT CGACGCCGAG TTCAATGCCG CCATGGGCAT CCTGGACCCG
ACCGAGCGCA ACGCCCGACT GGCCGGCATC CAGCAGGACC TCTGGCGCGA CGGAGGGTAC
GTGGTGTGGG GCGTCGGAGA TGGGTTGGAT CTGACCGCCC CCGGTGTGCG CGCTCTGCCC
GACGGTCCCG GCTTCCAGCG GATGTTCATC GAACGCGCCT GGAAGACGAG GTGA
 
Protein sequence
MLTPVRSRPL RRGLLPLVTV ALLLAGCGAD TTTGSSVDEP GTPVDGGTLR YVVPGSPATA 
SNDPHGGLGN ESDLMRFALT YDVLTVPGTD GTPQPRLAQT WEANQSLDRW TFHLREDATF
TDGQPVLAKD VLYSLTRIAD KAAENYGRLA DFDMAAASAP DDHTVILATR TPMAEAPKAL
ESISFVVPEG STDFAEPVRG SGPFRVTETD AQTAVLLRND DWWGERPHLD RIEIRAVADP
QARAAAVTSG QADVAGSVSP AAVKAAEAGG DVQVVRRKGV TEYPIIMRLD SAPFDDPRVR
EALRLATDRQ ALVDTVFLGY GQIANDLPTP YDPSYPQGLT QRTQDLDRAR ELLEQAGHAD
GLALTLHTTT SYPGMDTAAT LWARQLADVG VQVDVKVEPA DTYWTAIYAK KDFYVGYYGG
ISFPDLVRVG LLAASPTNET AWRNASFDAE FNAAMGILDP TERNARLAGI QQDLWRDGGY
VVWGVGDGLD LTAPGVRALP DGPGFQRMFI ERAWKTR