Gene Strop_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2403 
Symbol 
ID5058866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2697742 
End bp2699067 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content69% 
IMG OID640474662 
Productextracellular solute-binding protein 
Protein accessionYP_001159228 
Protein GI145594931 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGAC CCAAGGCGCT GCTGGCGGCG CTGCTGGCTA CGGTGCTCGT CGCGACCGGC 
TGTGGAGCCG GACCCGGGAC CGCCTCCGAC GGGCCGGTGC GGCTGCTGGT TTTCGGTGCC
CCCGAGGAGC TGGCCGCGTA TCGCACGCTG ATCGAGGCGT ACGGTCAGGC CCGGCCCGGC
AACGAGGTGC AGCTCATCGA GGCGAGCGAC CGCAAGGACC TGCTGGCCCG GCTCGCCACC
TCGGTCGCCG GGGGTGCCCC GCCGGACCTG TTCCTGATGA ACTACCGCTT CTATGGCCAG
TTCGCCGCGA AGAACGTGGT CGAGCCCTTG GACGAGCGGA TCGCCGCCTC CGAGAAGGTG
AATCCCGCTG ACTACTACCC GGTGGCGATG GAGGCCTTCA CCTGGGGCGG CCAACAGCTC
TGTCTACCGC AGAACGTCTC CAGTCTCGCC GTCTACTACA ACCGCACCCT CTTCGCCGAA
TACCAGGTCC CCGAGCCGAA GGCGGGCTGG ACCTGGAACG ACATGGTCGG CACCGCCATC
GCGATGACCC GGGACGACCG TGGTGTGATG GTCAAGGGCA CCGAGAGCGA GGGGGCCGCT
GTCCGCCCGG CCGTGCACGG GCTCGGCGTC GAGCCGTCGA TCATCCGCGT CGCCCCGTTC
ACGTGGTCCG CCGGCGGCGA GATTGTCGAT GACCCGGATC GGCCGACCCG ACTCACCCTG
GACACCCCGA CCGGCCGAGA GGCGCTGAAG AACCTGGTCG ACCTCCGCCA GGCGTACGGG
GTGGTTCCCA CCGACGAGGA GGTCGAGGCC GAGGACGACG AGTCCCGCTT CGCCAACGGT
CGGCTCGCCA TGCTGATGTC CTCGCGACGC TCCACCACCA CCTTCCGCTC GATCACCGGC
TTCGAGTGGG ACGTCGCCCC ACTGCCGGTC TACCAGGACC AGGTGGGGGT GCTGCACTCC
GACGCGTACT GCATGACTCG GGGTGCCAAG CGTAAGGATG CGGCGTGGCG GTTCCTGGAG
TTCGCCATCT CCGCCGAGGG GCAGGAGATC ATCGCCGCCA CCGGGCGGAC CGTGCCGTCG
CACATCGGTG TCTCACAGTC CCCGGTGTTC CTCGACTCGT CCCAACCACC ACGTAACGCG
ACGGTCTTCC TCGACACGGT CCCCACCCTG CGGACGTTGC CGACAGTTTC CACCTGGCCG
GAGGTCGAGG ATGTGACCGC CGGGATCCTG GAGAACGCGC TGTACCGGGG CGACCGGTTG
GACGACGTCA TCCGCGCTCT GGATGAGCAG ACCCGCCCGC TGTTCGCCCG TGGTGAGCAC
GGGTGA
 
Protein sequence
MRRPKALLAA LLATVLVATG CGAGPGTASD GPVRLLVFGA PEELAAYRTL IEAYGQARPG 
NEVQLIEASD RKDLLARLAT SVAGGAPPDL FLMNYRFYGQ FAAKNVVEPL DERIAASEKV
NPADYYPVAM EAFTWGGQQL CLPQNVSSLA VYYNRTLFAE YQVPEPKAGW TWNDMVGTAI
AMTRDDRGVM VKGTESEGAA VRPAVHGLGV EPSIIRVAPF TWSAGGEIVD DPDRPTRLTL
DTPTGREALK NLVDLRQAYG VVPTDEEVEA EDDESRFANG RLAMLMSSRR STTTFRSITG
FEWDVAPLPV YQDQVGVLHS DAYCMTRGAK RKDAAWRFLE FAISAEGQEI IAATGRTVPS
HIGVSQSPVF LDSSQPPRNA TVFLDTVPTL RTLPTVSTWP EVEDVTAGIL ENALYRGDRL
DDVIRALDEQ TRPLFARGEH G