Gene Strop_4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4336 
Symbol 
ID5060821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4912192 
End bp4913991 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content65% 
IMG OID640476598 
Productextracellular solute-binding protein 
Protein accessionYP_001161142 
Protein GI145596845 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.168516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.284884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCA GATTTCCCGG CCGCGCCCTG CGTGGCGCGG TGGCGGCAGC CGCCACGATC 
GCCCTTGGGG CGGGTCTGGT CGGCTGCGGT GACAACAGTG GATCCAAGGA CGGCGGTAGT
CAGGGCAAGG ACACCGTGAC AGTCGCCTTC CGTACCCCCA ACTGGATCCT GCCGATCTCG
GCACCAGGCT TCACCCAGGG GGAGAACGCC ATCTTCGGCC AGGCGCTCTA CCGCCCCCTC
TACCAGTACC GGCTGGACGG CACGGCGCAG TACAACATCG ACCCAGAACG CTCGATGGCC
GAGCCCCCGC AGGTGAGCGA CGACGGCCGC ACGCTGACGA TCACGTTGAA GGACAACTCC
TGGTCCGATG GCACGCCCAT CACCACCAGG GACATCCAGT TCTGGTACGA CCTGGTCTCG
GCGAACAAGG ACAAGTGGGC GTCGTACCGG GCCGGCGGCT TCCCCGACAA CGTTGACGCG
TGGTCGATCA AGGATGAGAA GACCTTCTCG ATCACCACCA CGGAGGTCTA CAACACCGCG
TGGTTCGTCG ACAACCAGCT CAACCGCATC ACGCCCCTGC CCCAGCACGC CTGGGACAAG
GACTCCGCGA CCGCCGACGT GAGCGACCTC GCCAGCAGCC CGGAGGGCGC CAAGAAGGTC
TTCGACTTCC TCATCGCCGC CTCGAAGGAC CCCAAGACGT ACGACTCCAA CGAGTTGTGG
CAGGTCACCA GCGGCGCGTG GACGCTGGAG AAGTACGTGC CCAACGGTGA GGTCACCCTC
GCCGCCCAGC CGAACTACTC CGGTACCGAC AAACCGAAGC TCTCCACGGT CGTGTTGCGG
CCGTTCACCA GCGATGACGC CGAGTTCAAC GTGCTCCGCG CCGGCGACAT CGACTACGGG
TACGTGCCGC CGGCCAACAT GTCCCAGAAG AGCTACCTCG AGTCCAAGGG ATACACGATC
TCACCGTGGT ACGGCTGGTC GATCACCTAC CTCCAGCTGA ACTACAACAA CCCGAAGTCC
GGCGTGCTGT TCAAGCAGCC CTACCTTCGG CAGTCGCTGC AGATGCTCAT CGACCAGCCG
ACGATCAGCA AGGTCATCTG GTCGGACACC GCCGCGCCGA CCTGCGGCCC GGTACCGGCC
AGGCCCGGCA CCAACACCGA CGCCGCCGGG TGCGCGTACT CCTTCGATCC GGCGAAGTCC
AAGGAACTGT TGGAGAGCAA CGGCTGGAAG GTGACCCCGG ACGGGCAGAC CACCTGCCAG
TCGCCGGGCA CCGGCCCGAA CCAGTGCGGT GAGGGAATCG CCGCCGGCAC ACCGCTCGAG
TTCACGGTGA CCAGCCAGAC CGGGTTCGCC GCCACGACCA AGATGTTCGC CGAGATCAAG
TCACAGATGG CCAAGCTCGG TATCCAGCTG ACGATCAAGG AGGTGCCCGA CTCGGTCGCG
GTCACGCCGA CCTGTGAGCC GACCGAGGCG AGCTGCTCGT GGGACATGTC CTTCTTCGGC
TCGCAGGGCA GCTGGTACTA CCCGGCCTTC GCCAGCGGCG AGCGGCTCTT CGCCACCGAC
GCCCCGGTGA ACCTGGGCAG CTACAGCAAT CCTGAGGCGG ACAAGCTCAT CGAGGCCACT
CAGTTCGCTG GCGACGAGAG CGCGCTCATG GCGTACAACG ACTTCCTGGC CAAGGACCTG
CCCGTGCTCT GGATGCCGAA TCCGGTGTAC CAGGTCTCGG CGTACCGCTC CGGCCTGCAG
GGCGTCGAGC CTCAGGACCC GATGAACCTC ATGTACTTCC AGGACTGGTC CTGGGAATAG
 
Protein sequence
MTSRFPGRAL RGAVAAAATI ALGAGLVGCG DNSGSKDGGS QGKDTVTVAF RTPNWILPIS 
APGFTQGENA IFGQALYRPL YQYRLDGTAQ YNIDPERSMA EPPQVSDDGR TLTITLKDNS
WSDGTPITTR DIQFWYDLVS ANKDKWASYR AGGFPDNVDA WSIKDEKTFS ITTTEVYNTA
WFVDNQLNRI TPLPQHAWDK DSATADVSDL ASSPEGAKKV FDFLIAASKD PKTYDSNELW
QVTSGAWTLE KYVPNGEVTL AAQPNYSGTD KPKLSTVVLR PFTSDDAEFN VLRAGDIDYG
YVPPANMSQK SYLESKGYTI SPWYGWSITY LQLNYNNPKS GVLFKQPYLR QSLQMLIDQP
TISKVIWSDT AAPTCGPVPA RPGTNTDAAG CAYSFDPAKS KELLESNGWK VTPDGQTTCQ
SPGTGPNQCG EGIAAGTPLE FTVTSQTGFA ATTKMFAEIK SQMAKLGIQL TIKEVPDSVA
VTPTCEPTEA SCSWDMSFFG SQGSWYYPAF ASGERLFATD APVNLGSYSN PEADKLIEAT
QFAGDESALM AYNDFLAKDL PVLWMPNPVY QVSAYRSGLQ GVEPQDPMNL MYFQDWSWE