Gene Sros_5051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5051 
Symbol 
ID8668345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5574208 
End bp5575716 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003340584 
Protein GI271966388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.261408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCTC TGCCCACATT GCTGTTGGCC ATGGACGTCA CAGTGCTCTA CCTCGCAGTG 
CCGCGTTTGG CGGCCGATCT ACGACCCAGC GGCGAGCAGA TGTTGTGGAT CACCGACGTC
TACGGGTTCA TGATCGCCGG ATTCCTCGTG ACGATGGGGG CGCTGGGCGA CCGGATCGGG
CGACGCAGGC TGCTGATGTG GGGAGCCGGG GCGTTCGGCA TGGCCTCGGT GGCCGCCGCG
TACGCCCCCA GCGCGGAGGC GCTGATCGCC GCCCGGGCGC TGCTCGGGAT TGCGGGGGCG
ACGCTGATGC CCTCCACGCT GGCGCTGATC AGCAACATGT TCCGGGACGC GCGGCAGCGG
GGAACCGCCA TCGGCATCTG GGCGGCGAGC ATGTCCGGCG GAGTCGCCCT GGGGCCGGTG
GTCGGCGGGG CGCTGCTGGA ATCGTTCGGG TGGGGGGCGG CGTTCCTGAT CGCCGTGCCG
GTGATGGCGC TGCTGCTGGT GGGCGGGCCG CTGCTGCTCC CGGAACACCG TGACACCGCC
GCCGGGCGGC CCGACCTGGT CAGCGTCGCG CTTTCCCTGA TCGCGATGCT GACGATCGTG
TACGGCGTCA AGCTGCTGTC CCACGGGGGC GACCCGGTCC TGAGCGGCGG GATCGTTCTG
GCCGGTCTGG CCGCCGGAGC GGTGTTCTGG CAGCGGCAGC GCGGACTCGC CGACCCGGTG
CTCGACGTGG CGCTCTTCCG GAATCGCGCC CTCACCGGCG CCCTGCTCGT CCTGCTCCTG
GGCCTGGCCG CCACCGCCGG CACGTACCTG TTCGTCACGC GGTTCCTCCA GGGAGTCGAG
GGCCTGTCCC CGCTGGCGGC GGGCCTGTGG CTGGTGCCCT CGTCGGTCGC TATGATCCTC
ACCTCCCTGG TCGCCCCCAT CCTGGTACGG CGGCTGCCCG AGCGGGTCGT GGTCGCCGGG
TCCCTGGCGG TATCGGCGGC CGGTTTCCTC CTGCTGGCCC TGCTCGACCA GGCGGCCGGG
CTCCCCCTCC TGATCGCCGG TATCGTCGTC GTCTACGTCG GCCAGGGCCC GATCATGACG
CTGGGCACCG ACCTCGTTGT CGGCTCCGCC CCGCCCGAGA AGGCCGGATC CGCCGCGGCC
ATGTCCGAAA CGAGCACCGA ACTCGGCCTG GCCCTGGGCG TGGCCGTCCT GGGCAGCGTC
GGAGCCGCCG TCTACCGTCA GGCCGTGCCC GCCGCGCTTC CCGCCGACCT CCCGCCGGAG
GCGACCTCCG CCGCCCGCGA CTCGATCGAG GCCGCGGTCA CCGCCGTCGC CCATCTTCCG
CCGGCCCAGG CGGGAGCCGT ACTCGCCCTC GCCCGGGAGG CGTTCACCGC AGGCCTCAAC
CTCGTCGCCG GGATCGGCGC AGTCGCCACC CTGGCCCTGG CCGTACTCGC CTTGGTGGCC
CTCCGCAGAC GGCCACCGGT CGCCGAACCC GCCCCCGCCG AGACGGCGGA GGTTCTCGGC
CGGGCCTGA
 
Protein sequence
MLALPTLLLA MDVTVLYLAV PRLAADLRPS GEQMLWITDV YGFMIAGFLV TMGALGDRIG 
RRRLLMWGAG AFGMASVAAA YAPSAEALIA ARALLGIAGA TLMPSTLALI SNMFRDARQR
GTAIGIWAAS MSGGVALGPV VGGALLESFG WGAAFLIAVP VMALLLVGGP LLLPEHRDTA
AGRPDLVSVA LSLIAMLTIV YGVKLLSHGG DPVLSGGIVL AGLAAGAVFW QRQRGLADPV
LDVALFRNRA LTGALLVLLL GLAATAGTYL FVTRFLQGVE GLSPLAAGLW LVPSSVAMIL
TSLVAPILVR RLPERVVVAG SLAVSAAGFL LLALLDQAAG LPLLIAGIVV VYVGQGPIMT
LGTDLVVGSA PPEKAGSAAA MSETSTELGL ALGVAVLGSV GAAVYRQAVP AALPADLPPE
ATSAARDSIE AAVTAVAHLP PAQAGAVLAL AREAFTAGLN LVAGIGAVAT LALAVLALVA
LRRRPPVAEP APAETAEVLG RA