Gene Strop_0659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0659 
Symbol 
ID5057100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp740924 
End bp742195 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content60% 
IMG OID640472926 
Productmajor facilitator transporter 
Protein accessionYP_001157514 
Protein GI145593217 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.823087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGACA GGAATCGAAT CCCCCTTCTT TCCGCGTATC GCCCGGTCCT CACCGACAAC 
CGGTACTCCA GACTGATCTC CGCTTCAGCC GTTTCCTTCC TTGGCGATGG GATGGCACTT
GTAGGAATAC CCTGGTTGGC GATACAGCTA GCAAAGCCGA ATCAAGGGGG AGTTGTCGCT
GGACTGTCGC TGGCGGCGTA TTCGCTGCCC GGCGCGGTGG GAGCTCTTGC GCTAGGCCGA
TGGTTCGGCA GCATGAGCTC ACAGAAGTTG GTCGTCGCCA ACACCGGTCT GCGAGCTCTC
GGGCTGGCAG CGATTGCCGC ACTGGCGGCC ACCGACCGAC TCAACACCTT TACCTACATT
GCCCTCCTCG CCGCTTCGTC TCTCTTTTCG GCTTGGGGGA ACGCCGGTCT CTACGCTATT
GTCAGTCGGT TGTTCGCCGA TGGCCGTCAG CTTCCCGCAA ATGCCCTGCT GAGCGTTTCC
CAGCAGATTA GCATCATCAT CGGCCCGGCG ATTGCCGGGA TAATCTTGCT GGAACTAGAC
GGCTCGGCAA TCCTCGCAGT CAATGCGATC TCGTACCTGA TTCTTCTCAC TACGGTCGCC
CGCCTGGAGC TTCCACAACA ACTGGCGACC ACTCGGCGGA CTGGCGTCAT GGCCGGCTTC
CGGATACTCG CGCAGCGACC CCATCTTGTT GCCCTGTTGC TAGTCACGGC CGCCTTCTTC
TTCCTTTACG GTCCGGTCGA GGTTGGTCTT CCGCTCTATG TGGCCGGCAG CTTGGCTGGT
AGCGGCCAGC TACTCGGAGC ATTCTGGACC GCGTTCGGAG TCGGCGCAGT GATTGGCGGC
TTCGCAGGTG GCACCTTGGC CCGCGCGCCT AGATGGCCGG CTGTAATTGC CATAATCTTC
GGCTGGGGGC TAACCCTGCT TCCGTTTGGC ATGACCAACA GCACTTTAGC AACCATGACT
TCTTTCGCCC TCGGCGGACT GATCTACGGC CCATTTCCAG CGTTCACTAT TTCTCTTTTC
CAGCAGGCCG CAGACGCTTC TGAACTTACG TCCGTCCTGG CAGCCCGGTC AGCCGTCACT
ACCACGGCAA CACCTTTGGG TGCGGCGCTC GGCGGCCCAC TGGTCGCACT TTGGGGCGCC
GCAGACACAT TACTGTATTC CGGCATTGCC ACGATCACGC TCGCGGTGCT CGTCACCGCA
TCGATGCCAC TTCTCAAGAG ATCCACTCTG GGCGGTGAGG TCAAGGATCA GTTCACGGTC
ATCGGGCGCT AG
 
Protein sequence
MVDRNRIPLL SAYRPVLTDN RYSRLISASA VSFLGDGMAL VGIPWLAIQL AKPNQGGVVA 
GLSLAAYSLP GAVGALALGR WFGSMSSQKL VVANTGLRAL GLAAIAALAA TDRLNTFTYI
ALLAASSLFS AWGNAGLYAI VSRLFADGRQ LPANALLSVS QQISIIIGPA IAGIILLELD
GSAILAVNAI SYLILLTTVA RLELPQQLAT TRRTGVMAGF RILAQRPHLV ALLLVTAAFF
FLYGPVEVGL PLYVAGSLAG SGQLLGAFWT AFGVGAVIGG FAGGTLARAP RWPAVIAIIF
GWGLTLLPFG MTNSTLATMT SFALGGLIYG PFPAFTISLF QQAADASELT SVLAARSAVT
TTATPLGAAL GGPLVALWGA ADTLLYSGIA TITLAVLVTA SMPLLKRSTL GGEVKDQFTV
IGR