Gene Sros_5012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5012 
Symbol 
ID8668306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5537958 
End bp5539430 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content70% 
IMG OID 
Productmajor facilitator transporter 
Protein accessionYP_003340550 
Protein GI271966354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.675946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0652477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC AGAGCACGCA CGTGGCGACA GCGGGATCCG GGCCCGCCGG TGCCGCCCGA 
CGGCAGCGCC GCCCCGGCCT GGTGCTCGCC ATCGTGCTGA TCTGTCAGAC GATGTTCATC
CTGGACACCA ACGTGGTGAA CATCGCCCTC TCCGACATCC AGCACGACCT GGGTTTCTCC
GCGACCGGCC TGTCCTGGGT GCTCAATGCC TACATGCTGG CCTTCGGCGG TCTGCTGCTG
CTCGGCGGTC GCGCGGGCGA CATCGTCGGC CAACGGCGGG CTCTCATCGG CGGGGTCGTC
ATCTTCACCG TCGCCTCACT GGCCGGAGGG CTGGCCACAT CGGCCGAGAT GCTGCTGGCG
GCCCGTGCCG GTCAGGGGAT CGGCGCCGCC ATTGCCGCAC CGACCGTGCT CGCCCTCATC
ACCGTAGGCT TTACGGACCC TGCACGCCGG GCGCGGGCGC TGGGGGCTTA CGCGGCGGTC
TCCGGCTCCG GCGCCGCGAT CGGCCTCATC GCCGGTGGCA TGCTCACCGA CTGGATCTCC
TGGCGGTGGG TGCTGTTCAT CAACGTGCCG ATCGGCATCG TCCTGCTGAT CCTGGCCCCG
CTGTTCATCA CCGAGACCGA TCGCCACCCA GGCCGCTTCG ATCTGGCGGG TTCGCTGACC
TCGACGCTCG GCATGACCGC GCTCGTCTAC GGCTTCATCC GCGCCGCCGA GAAGGGCTTC
GGCGATGTGG TCACCCTCGG CGCGCTCAGC GCGGCCGCCG TCCTGCTGGT CCTGTTCGTG
GCCGTCGAGT CCCGTGCCCG GCAGCCGATC GTCCCGCTAC GGCTGTTCGC CGACCGCAAC
CGCGCCGGCG GCTACGTCGT GCTGCTGTTC GTCCTGGCCG CGGGCAACGG CATGTTCTTC
TTCCTCACCC AGTTCCTGCA GCAGGTCCTC GGCTACAGCC CTCTGCAGGC GGGATTCGCC
TTCGTGCCGC TGGCCCTCGT GATCCTGGCC TCTTCCGGCG TGGCGGCCCG GCTGCTGCCG
CGCCTGGGCG CCAGGACGCT GATCGCCGTC GGCGCCGCGG CGATCAGCCT GGCCCTGCTG
TGGATGACAC AGCTGACGCC CCAGGCCGGC TACGCCACGG CACTGCTCGG CCCCATGATG
ATCGCCGGTC TCGGCATGGG CATCCTGCTG GTGGGAGCCA CCACCGTGTT GTCCTCCGGG
ATCCGAGCCG AGGATGCCGG TGCCGCCTCC GGCCTGCTCA ATGTCATGCA GCAGATCGGC
GGTGCGCTGG GGCTGGCCAT CCTCGTCACG GTGTTCGGCG CCGCCACCCG CGCTTCGCGC
GGACCGGCCC ACCAGGTCCT CACCGAGGGA GTCACCACGG CCTTTACCGT CGCGGCCGCG
TTCACCGCCT GCACAATCCT CCTGGCCCTG ACCATGAAGA ACAGGCAGGC TCCCACAGCC
GGATCCGCGG CCGCGGAGGA GCTCACCCGG TGA
 
Protein sequence
MTEQSTHVAT AGSGPAGAAR RQRRPGLVLA IVLICQTMFI LDTNVVNIAL SDIQHDLGFS 
ATGLSWVLNA YMLAFGGLLL LGGRAGDIVG QRRALIGGVV IFTVASLAGG LATSAEMLLA
ARAGQGIGAA IAAPTVLALI TVGFTDPARR ARALGAYAAV SGSGAAIGLI AGGMLTDWIS
WRWVLFINVP IGIVLLILAP LFITETDRHP GRFDLAGSLT STLGMTALVY GFIRAAEKGF
GDVVTLGALS AAAVLLVLFV AVESRARQPI VPLRLFADRN RAGGYVVLLF VLAAGNGMFF
FLTQFLQQVL GYSPLQAGFA FVPLALVILA SSGVAARLLP RLGARTLIAV GAAAISLALL
WMTQLTPQAG YATALLGPMM IAGLGMGILL VGATTVLSSG IRAEDAGAAS GLLNVMQQIG
GALGLAILVT VFGAATRASR GPAHQVLTEG VTTAFTVAAA FTACTILLAL TMKNRQAPTA
GSAAAEELTR