Gene Strop_4112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4112 
Symbol 
ID5060594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4675370 
End bp4676851 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content69% 
IMG OID640476373 
Productmajor facilitator transporter 
Protein accessionYP_001160920 
Protein GI145596623 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.965802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGGCCC GCGAGCCCCG TTCCGAGCCG GCGCCGGTGG GCGCGGGGTC GCGCGGACGG 
GCCCGGCGGT GGCTGGTAGC CGCGGTCTGC CAGGTGGTCG TATTCCTCGG GACGGCCACC
ACCGCGATGT TGGCCATCGC GGTACCTCAG ATCACGTCGA CCCTGGGCCT CGGCGAGGAC
GGCCAGCAGT GGATGGTTGC CGGCTACGCG CTGGCGTACG CCCTGATGTT GGTACCCGGC
GGTCGGCTCG GTGACGTGTG GCGACGGCGC GTCGTCTTCG TCTCCAGCCT GGTGCTCAAC
GGGCTGGCGG CCATGTCCGC CGCCCTTGCG CAGACGCCGG TCTGGCTGGT GCTCTCCGTG
TTGGTGCAGG GCGGTGCCCT CGGGGTGGTC AGCCCTCAGG TCCTTGGGCT TTTCCAGCAG
TTGTTCAACA AGGAGGAGCG GGGCCGTCCA TACGGTCTGC TCGGGGTGGC GTTCGCCGTC
GCCCTCGGAG CCGGACCGGT CCTGGGCGGG GCGCTGGTCG ACGTCAGCCC GGAGAACGGC
TGGCGGCTGG TCTTCGTCGT GAGCGCCTCG ATCGCTTTCC TGGCAGCTGC CCTGGGGTTT
CTCCTGATAC CCGCGTCGAC GACACCCCCG GATCGTTTCT GGTGGCGACG GACGGACCCG
GTGGGCATCG TCTTGTTCGT CGTCGGGATG GTGGCGCTCT GGATCCCGAT GGTGCAGGAG
GCGGCATGGG GCCCTGTTCT GGGGGTGCTG GCGCCGGTCG GCGTGGTCGT TCTCGCCGGC
TTCGTGTTCT GGGAGCGGCG TCAGGTGAAA CGGGGGTCGC CGCTGGTTGA CCTGAGCCTG
CTGCGGGTCC GGTCGTACGG TCTGGGCGCG GTCATCGCGG TGTTGTTCGG CGCCTATGAC
GCGCTGTACT ACGTATTCGC GCTGTATCTG CAGGATGGGG TGGGCCACAG CCCGCTCACC
ACCGGTCTCG TGATGGTGCC GATTGCCGGG GGCACCGCGG CGGGGGCGGT CGTCGGGGGT
CGGCTGGCCT GGCGGGCAGG TCGCCGAATA GTAGCCGTCG GACTGCTGAC GTCCTTGGTC
GGACTGGCGG CGGTCATGGT TGGTGACCTC TTCCTGCCGA CCTTCGGCAG CCCGCACTCG
GCGGCCCTGC CACTGCTGCT TGCCGGTCTC GGCGCGGGCT TCGTCCTCAG CGGCATGGGC
AGCGGACTGA CCAACATCCC GAACCAGGCC GTGACCATGT CACAGGTATC CAGTACGCGG
GCCGGCAGCG CTGCCGGGAT GTTGCAGACC GGGCACCGGC TTGGAATCTC GGCCGGAACG
GTCGGCGTCA GCACCGCACT GTTCGCAACG TTGGACCGTA CCGGCGGTAA CTGGTTGGCG
GCCTTCCGGA CCACCTTGTT GATCATCGTG GCGTGCGTCC TCGTCGCACT CCTGATCGCC
CTGATGGACA TCTTCACCAG AAAGGAGGGG GCAGCTCGGT GA
 
Protein sequence
MTAREPRSEP APVGAGSRGR ARRWLVAAVC QVVVFLGTAT TAMLAIAVPQ ITSTLGLGED 
GQQWMVAGYA LAYALMLVPG GRLGDVWRRR VVFVSSLVLN GLAAMSAALA QTPVWLVLSV
LVQGGALGVV SPQVLGLFQQ LFNKEERGRP YGLLGVAFAV ALGAGPVLGG ALVDVSPENG
WRLVFVVSAS IAFLAAALGF LLIPASTTPP DRFWWRRTDP VGIVLFVVGM VALWIPMVQE
AAWGPVLGVL APVGVVVLAG FVFWERRQVK RGSPLVDLSL LRVRSYGLGA VIAVLFGAYD
ALYYVFALYL QDGVGHSPLT TGLVMVPIAG GTAAGAVVGG RLAWRAGRRI VAVGLLTSLV
GLAAVMVGDL FLPTFGSPHS AALPLLLAGL GAGFVLSGMG SGLTNIPNQA VTMSQVSSTR
AGSAAGMLQT GHRLGISAGT VGVSTALFAT LDRTGGNWLA AFRTTLLIIV ACVLVALLIA
LMDIFTRKEG AAR