Gene Strop_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1492 
Symbol 
ID5057945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1702965 
End bp1704692 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content68% 
IMG OID640473760 
Productmajor facilitator transporter 
Protein accessionYP_001158336 
Protein GI145594039 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.926089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.297712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCTC AGGAACCGCA GCGGGCGGGG CGGCGAGAGT GGTGGGGTCT TGCTGCCCTG 
GCCCTGCCGA CGCTGTTGGT GGCGATGGAC CTCACGGTGC TGCATCTGGC GGTGCCGGCG
CTCAGTGAAG CGTTCAAGCC GAGCAGTTCG GAACTGCTCT GGATCACGGA CATCTACGGC
TTCCTCATCG CCGGCTTCCT GATCACGATG GGGTCGCTCG GCGACCGGAT CGGCCGACGT
CGGCTGTTGT TGATCGGCGC GTTCGGGTTC GGCGTCGCGT CGTTGCTGGC GGCTTTCTCC
ACCACACCGG AGATGCTGAT CCTCACCCGG GCGCTGCTCG GCGTCGCCGG GGCCACGCTG
ATGCCCTCGA CGCTGGCGTT GATCCGCAGC ATGTTTCAGA GCCCGGCGCA GCGGCCGATG
GCGATCGGTG TCTGGATGAC CAGCTTTGTC ACCGGGACCG CCATCGGACC GCTGGTCGGT
GGGGTTCTCC TTGGGTCCTA CTGGTGGGGG TCGGTGTTCC TTCTCGGCGT GCCGGTGATG
GTGCTGTTGT TGATCATTGG TCCGATGCTG CTTCCAGAGG AGGCGGATCC TGATGCTGGC
CGCCCGGACC TGCACAGCGC TCTCCTCTCG CTGGTCACCA TTCTGGCCGT AGTGTTCGGC
TTCAAGCAGT TCGCCGGGGG GTCCAGCGGT CCGCTGCCGT GGACGTGCGT GGCCCTGGGC
GCCGGTCTCG GCTGGATCTT TGTTCGTCGG CAGAGGCTTT TGCGGGACCC CCTGCTCGAC
CTGGCGCTAT TCACCCGCCG TACGTTCGGG GTGGCCGTCG CCACGCAGAC CCTGACGGTC
TTTGCGATGG CGGGCGTGCA GTTCTTCGCG GGCCAGTACC TTCAGCTCGT CCTCGGCATG
TCCCCGCTGC GAGCCGGACT CTGGTCCCTG CCGTGGACGC TCATGGGCGT CGCCGGGGCG
ATGCTGGCGC CGTTTGTCGT CCGCCGTCTG TCGCGGTCGA CAGTGATGGC GGGCGGCCTG
GCGCTCGGCG CTGCGGGGTT CGGCGCTCTG GCCGGCATCG ACTCGACTAC CGGACTCACC
TTGCTGGTGG TCAGCATGGT CATCGCATCC CTCGGAATCT CCGGCACGAT GACCCTGAGC
ACGGACCTAG CCGTCGGGGC GGCTCCTGAG GAACGCGCCG GGTCGGCGTC GGCTATCTCC
GAGACCGGAA GCGAACTCGG CCTCGCCCTG GGCATCGCGG TACTGGGCAG TGTCGGCAGC
GTCGTCTATC GCCAGGAACT GGCCAGGTCC GCGCCCGTGT CAACCCTGCC CGAGTCGATG
GAGGTCGCCA CCGACACGCT GGGCGGGGCG GTCAGCGTGG CAAACGAGCT GCCCGGCGAG
GCTAGAGACG CACTACTCGC CTCTGCCCGG GAGGCGTTCA CCACCGGGAT GGCGGTGACA
TCGGCGATCA GCAGTGCGGT GCTGATCGCC CTGGCGATCG CCGTCGGCAT CCTGCTGCGC
ACCGTGGTCC ATCGACGAGT CCGGCCTCCT GGACGAAGGC GCATGACCGC CGCGAATCAG
CCCGGTCGGC CGGGACCGGC CGACCGGGCT CCGGCCTCAG GCCGTCGGCG CCGGTTCCAG
CAACGCGCGA ACGTCCCATA CCCAGACGTC CAGTTCACGC TTGCCGGGGC CGACCAGTTG
TTCGACGGTC AGCCGCAGCG GCTCCGCATG GTGCTGGTCC ACCGGTAG
 
Protein sequence
MDAQEPQRAG RREWWGLAAL ALPTLLVAMD LTVLHLAVPA LSEAFKPSSS ELLWITDIYG 
FLIAGFLITM GSLGDRIGRR RLLLIGAFGF GVASLLAAFS TTPEMLILTR ALLGVAGATL
MPSTLALIRS MFQSPAQRPM AIGVWMTSFV TGTAIGPLVG GVLLGSYWWG SVFLLGVPVM
VLLLIIGPML LPEEADPDAG RPDLHSALLS LVTILAVVFG FKQFAGGSSG PLPWTCVALG
AGLGWIFVRR QRLLRDPLLD LALFTRRTFG VAVATQTLTV FAMAGVQFFA GQYLQLVLGM
SPLRAGLWSL PWTLMGVAGA MLAPFVVRRL SRSTVMAGGL ALGAAGFGAL AGIDSTTGLT
LLVVSMVIAS LGISGTMTLS TDLAVGAAPE ERAGSASAIS ETGSELGLAL GIAVLGSVGS
VVYRQELARS APVSTLPESM EVATDTLGGA VSVANELPGE ARDALLASAR EAFTTGMAVT
SAISSAVLIA LAIAVGILLR TVVHRRVRPP GRRRMTAANQ PGRPGPADRA PASGRRRRFQ
QRANVPYPDV QFTLAGADQL FDGQPQRLRM VLVHR