Gene Sros_9246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9246 
Symbol 
ID8672594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10196097 
End bp10197764 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003344607 
Protein GI271970411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.566604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG CCCCCGCGCA CCCACAGCCC GGACCGGACC GGGCCGGACC CGCCGCACCG 
TCAGCGTTCG GTCAGCGCCC CGTCAACACG GTCGTCGAAG CTCCGTGCAT GACTGTTCCC
TCACCCCGCA AGTGGGGCAC GCTGGTGATC GCCTGCCTCG CCATGCTGCT GCTCGCCATC
GACCTGACGG TGCTGCACCA GGCCGCACCC AAACTGGTCG AGGAGATGCG TCCCTCGGCC
CCCCAGTTCC TGTGGATCGT CGACGTGTAC GGCTTCGCGC TGGCCGGGCT CCTGGTCACC
ATGGGCAACG TCGGCGACCG GATCGGGCGG AGACGGCTGC TGCTCATCGG CATGACGGCC
TTCGGGGCCG TCTCCGCCCT GACGGCCTAC GCGCCGACGC CGGAGTGGCT CATCGTGGCA
CGGGCGCTGC TCGGCGTCGC CGGCGCCACC ATCATGCCGT CGACCCTGTC GATGATCAGA
AACGCCTTCA CCGACCCCGG GGAGCGGACC ACCGCGGTCG GCATCTGGAG CAGCGTCTCC
GCGCTGGGCT TCGCCCTCGG TCCCGTGGTC GGCGGGGCGC TTCTCAACTC CTTCTGGTGG
GGCTCCGTCT TCCTCGTCAA CGTGCCGGTC GCCGTGCTGA TCGTCGTCGT CGGATCCGTC
GTGCTGCCCG AGTCGCGCAA CCCCCGGCCG GGGCGGCTCG ACCTGGTGAG CGTGCCGCTC
TCCGTCGTCG GCGTCATCGC CGTCATCTAC GCCCTCAAGA CGGCGGCCCA CGACGGCGTC
GCCGGGGCCG GCGTCTGGGT GGCGGCCGTC GCCGGCCTCG TCTCGCTCGT CCTGTTCACC
CGGCGGCAGA CCCGGCTGGC GGAGCCCCTC ATCGACGTCC GGCTCTTCGG CCACCGGGCC
TTCTCCGGGG CGGTCGGGGC CAACGTCGTG TGCATCTTCT CGATGCTCGC CGCGTCGCTC
GCCTTCGCCC AGTACTTCCA GCTCGTCCTG GGCTGGTCGC CTCTCGTGTC GGGACTCGCC
GGCCTGCCCG GCGGGCTGGG CGCGGCGGTG GGCGGGGCAC TGGCGTCCCC GCTGGTCACC
GCCCTCGGCC GGGCACGGGT CGTCGCCTTC GGGCTGGGGC TGAGCGCCGC CGGATTCGTC
ATGTACGGCC AGGTGGACAT GGACACGGGC TACGCGTACA TGATGACCGC GATGATCATC
ACCTCCATGG GGACCGGATT CACCTTCGCC GTCACCAACG ACACCATCCT CGCCTCGGTC
CCGAGGGAGC GCGCGGGCGC GGCGTCCGCG ATCGCGGAGA CCGCCCAGGA GATGGGCGGA
GCGCTGGGCA TCGCCGTACT CGGGAGCGTG CTGAACGGCG CCTACCGCAA CAACCTGCGG
CTCCCGGCCG AGGTGCCCGC CGATGCGGCG GACCAGATCA GGGAGTCGCT CGGCGGAGCA
CTGGAGACCG CCGCCGCCCT GCCCGCGCGG CTGGCGGGGA CGGTCACCGA GACCGCCCGG
CAGACGTTCG TCGACAGCAT GCAGATGACC GTGATGACCG GCGCGGTCCT GCTGGCCCTC
CTCGCGGTGG CCGCGCTGCC CACCCTGCGC GGCGTCCCCA AGGTGATCGC CGAGGTGGAC
CTGGACGACC AGGGCAGAGC CCAGGACGGG CCCGTCGCGG CCCGATAG
 
Protein sequence
MAAAPAHPQP GPDRAGPAAP SAFGQRPVNT VVEAPCMTVP SPRKWGTLVI ACLAMLLLAI 
DLTVLHQAAP KLVEEMRPSA PQFLWIVDVY GFALAGLLVT MGNVGDRIGR RRLLLIGMTA
FGAVSALTAY APTPEWLIVA RALLGVAGAT IMPSTLSMIR NAFTDPGERT TAVGIWSSVS
ALGFALGPVV GGALLNSFWW GSVFLVNVPV AVLIVVVGSV VLPESRNPRP GRLDLVSVPL
SVVGVIAVIY ALKTAAHDGV AGAGVWVAAV AGLVSLVLFT RRQTRLAEPL IDVRLFGHRA
FSGAVGANVV CIFSMLAASL AFAQYFQLVL GWSPLVSGLA GLPGGLGAAV GGALASPLVT
ALGRARVVAF GLGLSAAGFV MYGQVDMDTG YAYMMTAMII TSMGTGFTFA VTNDTILASV
PRERAGAASA IAETAQEMGG ALGIAVLGSV LNGAYRNNLR LPAEVPADAA DQIRESLGGA
LETAAALPAR LAGTVTETAR QTFVDSMQMT VMTGAVLLAL LAVAALPTLR GVPKVIAEVD
LDDQGRAQDG PVAAR