Gene Sros_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4500 
Symbol 
ID8667794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5017507 
End bp5018724 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID 
Productmajor facilitator superfamily transporter 
Protein accessionYP_003340109 
Protein GI271965913 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCGTG CCGTACACCT GATGGCCCTC GGCGTCTTCG CCATGGTGAC CAGCGAGTTC 
GCCGTCGCCG GACTGATGCC GCAGATGGCC GACGGGCTGA GCGCCACCAT CCCGCAGATC
GGCTACCTCA TCACCGCCTT CGCGGTCGCC ATGGCCGCCG GCGGGCCCTT CCTCACCATC
GCGGTGCTGA AACTGCGCCC CAAACCGGCA CTCATGGTGC TGTTCGCGAT CTTCCTGGTG
GGCAACCTGC TGGCCGCCAC CGCCCCCTCC TATCCGATCA TGATGGCGGC ACGCATCATC
ACCGGCATCG CCGCCCAGGC GTTCTTCGGA GTCGCCATCT CACTGGCTGT CCGGCTGACG
CCTCCGCAGA GTCGCGGCCG GGCCATCGCG GTGGTCATGA ACGGGCTGAT GGTGGGCACC
CTGCTGGGGC TGCCGCTGTC CACCCTGATC GGAGAACACC TGGGCTGGCG CGCCGCGTTC
TGGGCCGTCA GCGCCGTGGC CGCCCTTGCC GCGCTCGCCA CGATGATCGG TGTGCCCCGG
CTGGAGCGCG CCGAAGGCGA CGGTGGCGAC TTCCGGCAGG AGATGCGCGT CTTCACGAAA
CCCAAGCTGT GGCTGGTGTT CGCGACCAGC ACCCTCATCA TCGGCGCGAC CTTCTCCGCC
TTCAGCTACC TAAACCCGAT CCTCACCCAG GTCACCGGAT TCAGCGCCGG AACCGTCCCG
CTTCTGCTCA TCGCCTACGG CGCCGCCACC GTGGTCGGCA ACAACATCGT CGGACGCCTC
GCCGACCGGC ACACCGTCAG CGTCCAGCTG TGGGGACTGG CCCTGAACCT GATCTTCCTG
ACCGGGTTCG CGCTGCTGGC CCACCTCAGC ATCCCCGCCG TGGTGCTCAT GCTCGGCATC
GGCCTGGTCG GCGTCACCAT GAACCCGGCG ATGGTCACCC GCGTCCAGCG AACCGGCAAC
GCCCGGCCAC TGGTCAACAC CGTCCACTCC TCCTTCATCA CCCTCGGCGT CATCATCGGC
TCCTTCGCCG GAGGAATGGC CATCGACGAC TTCGGCCTGC GAGCCCCGCT GTGGCTCGGC
GCCGGGTTGG CCGCACTCGG GATCCTCACC CTTGTCCCCG ACCTCATCCG CCGCTCCGCC
GCCAAACCCG CACCCGCGCC CAACGCGCCC GCCGCCGGGG GAGGGCCCGC CGTGGACGCC
CGAAAGCACA TGGTCTAG
 
Protein sequence
MPRAVHLMAL GVFAMVTSEF AVAGLMPQMA DGLSATIPQI GYLITAFAVA MAAGGPFLTI 
AVLKLRPKPA LMVLFAIFLV GNLLAATAPS YPIMMAARII TGIAAQAFFG VAISLAVRLT
PPQSRGRAIA VVMNGLMVGT LLGLPLSTLI GEHLGWRAAF WAVSAVAALA ALATMIGVPR
LERAEGDGGD FRQEMRVFTK PKLWLVFATS TLIIGATFSA FSYLNPILTQ VTGFSAGTVP
LLLIAYGAAT VVGNNIVGRL ADRHTVSVQL WGLALNLIFL TGFALLAHLS IPAVVLMLGI
GLVGVTMNPA MVTRVQRTGN ARPLVNTVHS SFITLGVIIG SFAGGMAIDD FGLRAPLWLG
AGLAALGILT LVPDLIRRSA AKPAPAPNAP AAGGGPAVDA RKHMV