Gene Strop_3945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3945 
Symbol 
ID5060426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4498504 
End bp4499769 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID640476205 
Productmajor facilitator transporter 
Protein accessionYP_001160753 
Protein GI145596456 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.736514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGCCC TCGAAGACCT ACCTCCGGGT GGTGTGGGCG TAACGGCGGG GCGGCGACTG 
CCCACCTCGT ACCTGCTCTG GCTCGCCGGG ATCCTCGCCT CGCTGCTGGG CAACTCGCTC
TTCTACTTCG CCCTCGGTTG GGAGGCGAGT GCGCACGGCG GTGCCGTCGC CGGTCTTGTT
CTCACCGCCA TCACGCTGCC GCGGGTGCTG CTGCTCCTGA TAGGTGGAGC GGTCGGGGAC
CGGGTGAGTG CCCGTCGAGT TCTCATCATC GGCGATGCCG TGATGCTTGT CTTCTCCGTC
GCCCTGGCGG CGTCGGCCTA CCACCTCGGG GCGCCGCCCT GGCTGCTGAT CGCCGCCGGC
GTCGCCGTGG GGGTCGCGGA CGCCTTCTAT CTGCCAGCGT CCGGATCGAT GCCGCGACGG
CTGGTAAGCC AGGACCAACT TTCGCAGGCC CTGGCGTTGC GCCAGGTTGG CGGTCAGCTG
GTCGCCATGG GCGGCGGTCC GCTCGGCGGC GTTCTCGTCG GGTTGGCCGG GTTGGCGGGG
GCCGCCCTGG TCAACGCGGT GACCTTCGCC GCGGTGCTGA CGCTTCTGAT CATCATCCGG
CCTCGGTACA ACGGGCCCGC CACCGCGCGC AGCGGAGGGG TTGTGCGCGA CGCGGTCGAC
AGCATCCGCG TCGGCTTCCG TGATCCGGTC CTGCGTCCCG GGCTAACGCT GACCGGGGCC
GCGGCAGGTT TCCTGCTTCC GGTGCTTCCG TTGCTGGTCC CACTGCTTGC GCGGGCGGAG
GACTGGGGGG CGGCGGCTGG TGGTCTGATC TTTGGGGCGC AGGGCGTTGG TATGGCCATC
GTCACCCTGG CTGTCGTGCG TCGCGGCCCG CTCGGCCGGC CCGGTCTACT CGCTGCCTGC
GGCCTATTGA TCGCTGGTGC TGGAGTTGCT GGGCTGGCGC TCTCCTCCAC TGTGGGGATC
GCCGTCGGCG TGGGGCTGAT CATGGGGTTC GGGAGTGGGC TCTTCGCCTC GCACCTGGGT
CCGCTGATCC TCGGCGTGAC TCCGGACACT CACCTCTCTC GCATTCAGGC CCTGCTGACA
CTGGTGCAGA GCCTGGCTTC GTTGATCATG GTTAATGTGC TCGGCCTCAT CGTCGATCAC
CGCGGAGCGG CGGTGGCGAT CCTGATCTGC GCAGCGGCCA CGAGCTGCGT TGGGCTGCTG
GGCCTACGGT CCGCGCCGCT GCGTACCAGT CGCTTCGGAC TGAACACCAC CTCCGTCGAC
CGATGA
 
Protein sequence
MIALEDLPPG GVGVTAGRRL PTSYLLWLAG ILASLLGNSL FYFALGWEAS AHGGAVAGLV 
LTAITLPRVL LLLIGGAVGD RVSARRVLII GDAVMLVFSV ALAASAYHLG APPWLLIAAG
VAVGVADAFY LPASGSMPRR LVSQDQLSQA LALRQVGGQL VAMGGGPLGG VLVGLAGLAG
AALVNAVTFA AVLTLLIIIR PRYNGPATAR SGGVVRDAVD SIRVGFRDPV LRPGLTLTGA
AAGFLLPVLP LLVPLLARAE DWGAAAGGLI FGAQGVGMAI VTLAVVRRGP LGRPGLLAAC
GLLIAGAGVA GLALSSTVGI AVGVGLIMGF GSGLFASHLG PLILGVTPDT HLSRIQALLT
LVQSLASLIM VNVLGLIVDH RGAAVAILIC AAATSCVGLL GLRSAPLRTS RFGLNTTSVD
R