Gene Strop_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2303 
Symbol 
ID5058766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2599215 
End bp2600522 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content70% 
IMG OID640474565 
Productmajor facilitator transporter 
Protein accessionYP_001159131 
Protein GI145594834 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.533597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.893433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTAAAC CCCTGGTCCG GGCAGATCCG GCACCCCCGC GCCGGCCGCG CTTCCCGCGG 
CTGCTCCAAC AGACGGACTT CCGCCGCTAC TGGTCCGCCC AGACCGTCTC GCTCTTCGGC
GACCAGATCA CCATGCTCGC CATGCCGCTG CTGGCCGTGC TCGCGCTCGG CGCCGGACCC
GCCGAGATGG GCTACCTGAC CGCCGCCTCG CTGCTGCCGA ACCTGCTCTT CTCACTGCCA
GCCGGCGCGT GGATGGACCG CTACCCGCGC CGACGTCAAG TAATGATCAT CGCCGACCTT
GGCCGGGCGG GCCTTCTGCT GGCCGTGCCG CTGCTGTGGT GGGCGGACGC ACTCAACCTG
CCGCTGCTCT GTGTCGTGGC CTTCCTGATC GGGGTCCTCT CGGTGTTCTT CGGGGTGGCC
CACAGCAGCC TGTTCGCCAG CCTGGTCCAG CGGCAGAACT ATGTGGACGC CAACAGTCTG
ATCAGCGGCA GCAGAGCCAT GTCCGACGTG GCCGGCCCGA GCATCGGCGG CGTACTCGTG
CAGGTGCTCA CCGCTCCGGT CGCGCTGGTC GCCGACGTGC TCACCTACCT GACGTCGGCT
GTCTTCCTGA CCCGGACAAC GGTCACCGAG CATCCCGCAC AGACCGGCCC CGGCATGGGC
ATGGCCACCG GCGTGCGCTA CGTGGCCCGC TCCGCGGTGC TACGCGCGGT CCTGCTCGGC
CTCACCACAC TCAACCTGTT CAATTTCATG TTCGCCGCGC TCTTCGTGCT GTACGTGACC
ACCGAGCTGG GCGTCTCCCC CGGGGTACTG GGCCTGGTCA TCGGGGCCGG CGCGTTCGGC
GGACTGCTCG GCGCGGCGGT CACCGGCCCG CTCAGTCGCC GGATTGGCAT CGGCCCGGCG
GTGATTCTCG GCCTCGTCGT CTTCCCGGCC CCACTGATCC TCGTGCCGCT GGCCGGGGGA
CCGCGGCCGC TGGTGCTCGC GCTGCTGCTC ACTGCCGAGT TCGTCTCCGC CCTGGGCGTC
ATGATCCTCG ACATCGCCGC CGGATCGGTG CAGATCGCGG CCACACCGAA GACGATGCTC
GCCGTGGTCT CCGGCGTCCG GCGCACCGTC AACTACGGCA TCCGGCCCGT CGGCGCGCTG
ATCGGCGGCG CGCTCGGCGC CACGATCGGG GTCCGTCCCG CCCTCTGGAT CGCCAGCCTT
GGTGCCCTGC TCGGAGCGTT CTGGGTCGTC TTCTCCCCGC TGCGCACCAT GCGTACCCTG
CCCGAGGAAT GGCATCCGCC CGCGGCGGGC CCGGTGGGCA GCAACTAG
 
Protein sequence
MTKPLVRADP APPRRPRFPR LLQQTDFRRY WSAQTVSLFG DQITMLAMPL LAVLALGAGP 
AEMGYLTAAS LLPNLLFSLP AGAWMDRYPR RRQVMIIADL GRAGLLLAVP LLWWADALNL
PLLCVVAFLI GVLSVFFGVA HSSLFASLVQ RQNYVDANSL ISGSRAMSDV AGPSIGGVLV
QVLTAPVALV ADVLTYLTSA VFLTRTTVTE HPAQTGPGMG MATGVRYVAR SAVLRAVLLG
LTTLNLFNFM FAALFVLYVT TELGVSPGVL GLVIGAGAFG GLLGAAVTGP LSRRIGIGPA
VILGLVVFPA PLILVPLAGG PRPLVLALLL TAEFVSALGV MILDIAAGSV QIAATPKTML
AVVSGVRRTV NYGIRPVGAL IGGALGATIG VRPALWIASL GALLGAFWVV FSPLRTMRTL
PEEWHPPAAG PVGSN