Gene Sros_6118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6118 
Symbol 
ID8669416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6709947 
End bp6711155 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily protein MFS_1 
Protein accessionYP_003341592 
Protein GI271967396 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.27425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.39659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCAC AGAGCGGCGG CAGCATCCTG CGCCAGCCGA TGTCCGTCTG GGCCACCGCG 
TTCGCCGCGG TCGTCGCCTT CATGGGCATC GGCCTGGTCG ACCCGATCCT GCCCTCCATC
GCCCAGGGGC TGCACGCCGG CCCGAGCCAG GTCTCCCTGC TGTTCACCAG CTACTTCCTG
GTCACCGCCG TGGCCATGCT CGTCACCGGG TGGGTCTCCA GCCGGATCGG CGGCAAGCGC
ACCCTGCTGC TCGGCCTGGC CCTGGTCGTG GTCTTCGCCG CCCTGGCCGG AACCTCCGGC
AGCGTCGGCG AGCTGATCGG CTTCCGGGCC GGCTGGGGCC TGGGCAACGC CCTGTTCGTG
GCCACCGCCC TGGCGGTCAT CGTCGGCGCG GCCAGCGGCG GCGCCGAATC CGCCATCATC
CTCTACGAGG CGGCCCTCGG CCTGGGCATC TCCATGGGCC CGCTGGTCGG CGCCGCCCTC
GGCGACTGGA ACTGGCGCGC GCCCTTCTTC GGCACAGCCT CCCTCATGGC CGTCGGCTTC
GTCCTCATCG CCACCCTGCT GAGGACCACC CCCACGCCGG CCGTGAAGAT CCGCCTGGGC
GACCCGCTCC GCGCCCTGGC CCACGGCGGC CTGTCGACCA CCGCCTTCAC CGCGTTCTTC
TACAACTTCG CCTTCTTCAC GATCCTGGCC TTCACCCCCT TCATCCTGGG CATGTCGCCC
TACGGCATCG GCGCGGTCTT CTTCGGATGG GGCGCCTGCG TCGCGCTCGC CTCGGTCTTC
GGCGCTCCCG CCCTGCAGCG GCGGCTCGGC TCGGTCAAGG TGCTCCACCT GTCCCTGGGG
GTGCTGGCCG CCCTGCAGGT CGGCATCGCG CTGTCGGGCC ACGCCGGCAT CGTCGTCTTC
GTCATCCTGT CGGGCATCCC GATCGGCCTG AACAACACCG TGTTCACCGA GGCGGCCATG
GAGGTCTCCG ACGCGCCGCG GCCGGTCGCC TCCGCCGGAT ACAACTTCGT GCGCTGGATG
GGCGGGGCCC TCGCGCCGTT CATCGCCACC AAGCTGGGGG AGGACGTCGC CCCCGCCCTG
CCGTACGTCC TCGGCGCGCT CTGCTGCCTG GCGGGCATGG CCGTCCTGTA CGGCAGGCGC
CACCACATGC GGAGCCTGGA GCGCGTCGAC GCCGACCACG CCTTCCGGGA CGCCCCCGTG
GCGGCCTGA
 
Protein sequence
MSAQSGGSIL RQPMSVWATA FAAVVAFMGI GLVDPILPSI AQGLHAGPSQ VSLLFTSYFL 
VTAVAMLVTG WVSSRIGGKR TLLLGLALVV VFAALAGTSG SVGELIGFRA GWGLGNALFV
ATALAVIVGA ASGGAESAII LYEAALGLGI SMGPLVGAAL GDWNWRAPFF GTASLMAVGF
VLIATLLRTT PTPAVKIRLG DPLRALAHGG LSTTAFTAFF YNFAFFTILA FTPFILGMSP
YGIGAVFFGW GACVALASVF GAPALQRRLG SVKVLHLSLG VLAALQVGIA LSGHAGIVVF
VILSGIPIGL NNTVFTEAAM EVSDAPRPVA SAGYNFVRWM GGALAPFIAT KLGEDVAPAL
PYVLGALCCL AGMAVLYGRR HHMRSLERVD ADHAFRDAPV AA