Gene Sros_5250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5250 
Symbol 
ID8668544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5762681 
End bp5764114 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content68% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003340762 
Protein GI271966566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.411045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACA AGCCCCCTGC CAGAGCCGGA GTCCGGGAGT GGACCGCCTT GGCGGTCTTG 
GGTGTTCCCG CCGTGCTGGT CATGATGAAC ATGTCCGTGC TCTATCTGGC GCTGCCAAGC
CTGAGCGCCG ATCTGGAGCC CGACGGCCCT CAACTGCTGT GGATCACCGA CATCTACGGC
TTCATGGTTG CCGGATCGCT GGTCACGATG GGCACGCTCG GCGACCGTCT CGGGCACCGC
AGGATCCTGC TGGTCGGCGC CGTCGCGTTC ACCGGCGCAT CCGTTATCGC CGCATATACC
CCCAGCGCCG GCCTGCTCAT CGCAGCACGG GCGGTCCAGG GAGTCGCCGC CGCCGCTCTG
GCGCCTTCCT CGCTGGCGCT GATCCGCACC CTGTTCGTCG ACGTTCGGCA GCGCACCCTG
GCCATCACGA TCTGGATGAT GGCTTTCATG GGCGGCGGTG CGCTCGGCCC GCTGGTCGGT
GGTGTGCTGC TGGAGTACTT CTGGTGGGGG GCGGTGTTCC TGCTCGCCGT TCCGACTATG
GCCCTGCTGC TGGTCACCGG CCCTTTCCTG ATCCCTGAAT CCCGCGCTTC GGCTTCGGGG
CGACTGGATG CGGTCAGTGT GGTCCTGTCG CTGCTGACTC CTGTGACGAT CGTGTTCGGC
ATCAAGGATC TCGCCGTCCA CGGCTTTGCC CTGCCGTCCG CCGGTGTCCT GGTTGCGGGC
CTGGGGATCG GCGCGGTCTT CGTACGCCGC CAGCGGCGGC TGTCCAATCC GCTCCTGGAC
CTGGGACTGT TCCGTATCCC CGCCTTCGCC GTATCCGTCA CAGGCATGGT GCTGGTCGGC
ATCGTGCTGT TCGGGACCAG CCTGCTCACC TCGCAATACC TGCAACTGGT GCTCGGCCTC
TCTCCGTTGA AGGCCGGCCT GTGGCAGCTG CCCACCGCGG TCACCGGTAC CGTCGTGGCC
CTGGTGGTCT CCGGCCTCAC CGGCCGAATC CGCCCGGCCG TCCTCATGAG TGCCGGAGCC
ACGTTCGCCG TCATCGGTCC CATCCTGCTC ACCCAGGTGG ACAGAGACCC CGTCATCCTG
GTGTCCGGCT CCGTTCTTCT CTTCGCCGGC CTGACCCCGT TCATGGCGCT GGGAACCAAC
CTGGTCCTCG GTGCGGCACC ACCCGAACGC GCCGGGGCCG CCTCAGCGAT CTCCGAGACC
GGCGCCGAAC TCGGCGGCGC GCTCGGCGTC GAGGCCGGAC TTGATCCCGC CCAGGTAGGT
GAACTTTTGA CCGGCGACGC GTTCGCCGCT GAAGTCCGAG CCGACGAACG CCGCGCCGCC
CAGAGGGGAA TCCGAGGCGT GCCCGCACTC GTCATTGACG GAGCCCCGCC AGTCTCGGCC
GTTCAGGAAC CTGCCGCTCT GGCAAGCCTC CTCGAACGCG CGACACGCTT CTGA
 
Protein sequence
MAHKPPARAG VREWTALAVL GVPAVLVMMN MSVLYLALPS LSADLEPDGP QLLWITDIYG 
FMVAGSLVTM GTLGDRLGHR RILLVGAVAF TGASVIAAYT PSAGLLIAAR AVQGVAAAAL
APSSLALIRT LFVDVRQRTL AITIWMMAFM GGGALGPLVG GVLLEYFWWG AVFLLAVPTM
ALLLVTGPFL IPESRASASG RLDAVSVVLS LLTPVTIVFG IKDLAVHGFA LPSAGVLVAG
LGIGAVFVRR QRRLSNPLLD LGLFRIPAFA VSVTGMVLVG IVLFGTSLLT SQYLQLVLGL
SPLKAGLWQL PTAVTGTVVA LVVSGLTGRI RPAVLMSAGA TFAVIGPILL TQVDRDPVIL
VSGSVLLFAG LTPFMALGTN LVLGAAPPER AGAASAISET GAELGGALGV EAGLDPAQVG
ELLTGDAFAA EVRADERRAA QRGIRGVPAL VIDGAPPVSA VQEPAALASL LERATRF