Gene Strop_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0216 
Symbol 
ID5056654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp246131 
End bp247792 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content66% 
IMG OID640472488 
Productextracellular solute-binding protein 
Protein accessionYP_001157079 
Protein GI145592782 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.923383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCAT CCAGGCCGAA GGTCGCTGTC GCGGCCGTCG CGGTCGCGGC CCTCGCGGTA 
GCCGGCTGCG CCGAGAGCGA CCGCGAGGAT TCCGGTGGTA GCAACAACGA CACCCTCGTC
TTCGGCGTCG CCGGAGACCC GAAGGTGCTC GACCCGAGCT TTGCCAGCGA CGGCGAGTCG
CTGCGCGTGG CCCGTCAGGT CTTCGAGACC CTGGTCCGCC CCGAGGAGGG TGGCACCAAG
GTGAGCCCGG GCCTCGCGGA GTCCTGGACT CCGGACGAGG CGGGCACCAC CTGGACCTTC
AAGCTCCGCT CGGGCGTGAA GTTCCACGAC GGCACCGACT TCGACGCCGA GGCCGTCTGC
GTCAACTTCA ACCGTTGGTA CAACGCCACC GGCCTCATGC AGAGCCCGGA CGTGACCGCC
TACTGGCAGG ACGTGATGGG CGGCTTCGCC CAGAACGAGG ACGAAGGGCT CTCGGAGAGC
CTCTTCAAGT CCTGCACCGC CAAGGACGCC ACCACGGTCG ATCTGACCTT CACCCGGGTG
TCCAGCAAGA TCCCGGCCGC CCTGATGTTG CCGTCGTTCT CCATCCACAG CCCGACGGCG
CTGGAGCAGT ACGACGCGAG CAACGTCGGC GGCACCGCGA CGGACGTCCA GTACCCCGAG
TACGCGACCG CGCATCCGAC CGGCACCGGG CCCTTCAAGT TCAAGTCCTG GGACATCGCC
AACAAGTCGC TCACCATCGA GCGAAACGAC GACTACTGGG GCGAGAAAGC CAAGCTGAAG
ACCCTCATCT TCCGGACCAT CCCGGATGAG AACGCGCGCA AGCAGGCGCT GAGCTCCGGC
GACATCCAGG GCTACGACCT GGTCGGGCCG GCTGATGTCG AGCCGCTGAA GGGCGAGGGC
TTCAACGTCC TGACCCGGCC GGCGTTCAAC ATCCTCTACC TGGGCATGAA CCAGCAGGGG
AACCCGAAGC TGGCTGACCT CAAGGTCCGG CAGGCGATCG CCCACGCGAT CAACCGGCAG
GCCCTGGTCG ACTCCAAGCT CCCCCCGGGT GCGAAGGTCG CGACCAACTT CTTCCCGGAC
ACGGTCGAGG GGTGGAACGG CGACGTCACC ACCTACGACT ACGACGTCGA CAAGGCCAAG
CAGCTGCTGG CCGAGGCCGA CGCGACGGAC CTGACGCTGC GGTTCCACTA CCCGACCGAG
GTCACCCGTC CGTACATGCC GAACCCGAAG GATCTCTTCG AGCTGGTCTC GGCGGACCTC
CAGGCGGCCG GCATCACGGT CGAGCCGATT CCGCTCAAGT GGAGCCCGGA CTACCTCAAC
GCCACCACCT CCGGTAGCGA GCACGACCTG CACTTCCTCG GCTGGACCGG TGACTACGGC
GACGCCTACA ACTTCATCGG CACCTTCTTC GACCGGCAGA AGGACGAGTG GGGCTTTGAC
AACCCGGCCC TCTTCGAGCA GTTCCAGGAC GCCGACTCCA CCGCGGATAT GGCGTCACGG
GTGGAGAAGT ACAAGGGTCT GAACAAGACC ATCATGGACT TCCTGCCCGG GGTGCCGATC
TCGCACTCGC CGCCGGCGAT CGTGTTCGGC AAGGACGTCA CCGGCATCAA GGCCAGCCCG
CTCACCGACG AGCGGTTCGC GAAGGCTGAG TTCACGTCCT GA
 
Protein sequence
MRASRPKVAV AAVAVAALAV AGCAESDRED SGGSNNDTLV FGVAGDPKVL DPSFASDGES 
LRVARQVFET LVRPEEGGTK VSPGLAESWT PDEAGTTWTF KLRSGVKFHD GTDFDAEAVC
VNFNRWYNAT GLMQSPDVTA YWQDVMGGFA QNEDEGLSES LFKSCTAKDA TTVDLTFTRV
SSKIPAALML PSFSIHSPTA LEQYDASNVG GTATDVQYPE YATAHPTGTG PFKFKSWDIA
NKSLTIERND DYWGEKAKLK TLIFRTIPDE NARKQALSSG DIQGYDLVGP ADVEPLKGEG
FNVLTRPAFN ILYLGMNQQG NPKLADLKVR QAIAHAINRQ ALVDSKLPPG AKVATNFFPD
TVEGWNGDVT TYDYDVDKAK QLLAEADATD LTLRFHYPTE VTRPYMPNPK DLFELVSADL
QAAGITVEPI PLKWSPDYLN ATTSGSEHDL HFLGWTGDYG DAYNFIGTFF DRQKDEWGFD
NPALFEQFQD ADSTADMASR VEKYKGLNKT IMDFLPGVPI SHSPPAIVFG KDVTGIKASP
LTDERFAKAE FTS