Gene Sros_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4089 
Symbol 
ID8667383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4547771 
End bp4549090 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content74% 
IMG OID 
Productmajor facilitator transporter 
Protein accessionYP_003339740 
Protein GI271965544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGA CCGCGCGCGG TGGCCTGCTG CGCCGTCACC GCGACTTCCG GCTGCTGTGG 
TGCGGTGAGG TCGCGGGCAA GTTCGGCGCC GCCGTCACCG GTGTGGCGAT GCCGCTGATC
GCCGTCTCCA CCCTGCACGC CGGCACCTTC GAGGTCAGCC TCATTTCCGC CGCCACCTGG
CTGCCCTGGC TCCTCATCGC CCTGCCGGCC GGTGTCTGGG TGGACCGGCT GCGCCGCAGG
CCGATCATGC TGGGCGCCGC GGCCGTCTCC CTCCTGCTGT TCGCCGGCGT CCCGGTCGCC
GCCTGGTGCG GTCTCCTGAG CATCGGCCTG CTGCTGGCCG TCGCCCTGCT GGCGGGCACG
GCGGCGGTGT TCTTCCAGAC CGCCTACAGC GCCTATCTCC CCTCCATCCT GGAGCCCGCC
GATCAGCCCG AAGGCAACGC CAAGCTGCAC GGAAGCGCCT CCGCCGCGCA GATCGCCGGG
ATCGGCTCCG GCGGTCTGAT CGCGCAGCTG GCCGGGGCGG TGAACGGGAT GTTCGCCAAC
GCCGCGACGT TCCTGGTGTC CCTGCTGTGC CTGGCGGGCA TCCGGCATCG CGAGCCGCGT
CCCGCCCAGG CCGAACGCCC GCCCGGGGCG CTGGTCAAGG AGGTCGGCGA AGGTCTGCGG
CTGGTCGCCG GCGACCCGTG GTTCCGCACG TTGACGCTCT TCGGGGCCAC CTCCAACATC
GCCCTGGTCG GTTACCAGTC GATCCTGGTG GTCTTCCTGG TCCGCGATGT CGGCCTGGCC
CCCGGGGCCG TCGGCGGGCT GATCGCGGCG GCGAGCACCG GAGGGGTCGC CGGGGCCTTC
GCCGCCCGCC GGGTCGCCGC GAGGATCGGC ACCGCCCGCG CGACGCTGCT GTTCGAGCTG
GGGCTCGCCC TCTTCGCCGT GCTCATCCCG CTCACCTTCG GCGGCGCGGG GCTGCTGCTG
TACGTCGCGG GCGGTTTCTG CGTCTCCGCC GGCGTGGTGG CCGGCAACGT CATCAAGGCG
AGCTTCCAGC AGAGCTACTG CCCGCCCGGG CTGCTCGGCC GGCTCACCGC GAGCACCGCG
TTCCTCAACT ACGGCACCAT CCCGCTCGGC GCGCTGCTCG GCGGCGCGCT CGGCGCCGCA
CTGGGGGTCC GCCCGGCGAT GTGGATCATG ACGGCGGGCG TCCCGCTGGC CGCGCTGTTC
CTGTGGTTCT CCCCGATCCG GCGATGCCGT GACCTGCCGT CGTCGCCGTC CCCGGGCATG
CCGCGAGCCG GCGACGACGG AGCGGTCGCC GGGTCGGTGA CGTCTGCCCT TCTCCCGTGA
 
Protein sequence
MSGTARGGLL RRHRDFRLLW CGEVAGKFGA AVTGVAMPLI AVSTLHAGTF EVSLISAATW 
LPWLLIALPA GVWVDRLRRR PIMLGAAAVS LLLFAGVPVA AWCGLLSIGL LLAVALLAGT
AAVFFQTAYS AYLPSILEPA DQPEGNAKLH GSASAAQIAG IGSGGLIAQL AGAVNGMFAN
AATFLVSLLC LAGIRHREPR PAQAERPPGA LVKEVGEGLR LVAGDPWFRT LTLFGATSNI
ALVGYQSILV VFLVRDVGLA PGAVGGLIAA ASTGGVAGAF AARRVAARIG TARATLLFEL
GLALFAVLIP LTFGGAGLLL YVAGGFCVSA GVVAGNVIKA SFQQSYCPPG LLGRLTASTA
FLNYGTIPLG ALLGGALGAA LGVRPAMWIM TAGVPLAALF LWFSPIRRCR DLPSSPSPGM
PRAGDDGAVA GSVTSALLP