Gene Sros_8490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8490 
Symbol 
ID8671824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9369081 
End bp9370271 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content71% 
IMG OID 
Productmajor facilitator superfamily permease 
Protein accessionYP_003343877 
Protein GI271969681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.249724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.30566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACCC CACGCGACAG ACAAGCACGA GTGGCGACCT GGGCCGCCTT CTTCGTCCAG 
GGTCTCTGCT TCGCCACCCT GCTCACCCAT GTGATCGACC TGCAGCACAG GTTCGGGCTG
AGCGACGGCG ACCTCACCCT CGTCCTGCTG CTCGTTCCGG TGATCGCCGG CATCGGGAGC
GTCGTGGCCG CCCCCCTGGC CGCCAGGTAC GGCAGCGGCC CGGTCCTGCG GATCTCCCAG
CTGGGCGTCT GCGCCGTCGT GGCCCTGTCC GGATGGAACA CCGAGCTCGC CGGGCTGTAC
GTGCTCAGCG CCGTGTTCGG CCTGTTCGTC GGGGCGGTGG ACGCGGCGAT GAACATGCAG
GCCGTCGCGG TCGAGCGCCG CTACGGCATG AGCGTGCTGA CCGGTTTCCA CGCCGTGTGG
AGCGTCGGCT CGATGCTGGG CGCCGGGTTC AACTCCGCCT TCACCGCGCT CGGGATGGAC
CTGGGCTGGT CGTTCTCCAT CCCGGTGGCC ATCGGTGCCG CGATCTCAGC GATCATGCTT
CCCCGGCTCT ACGACCGGGA CGGGGAGCGG GCTACGGCGC AGGCCGCGCG GACGGCGGCC
AGGGTGCCGT GGCGGCCGAT CATCCCGCTC TGCCTGGCGA TGGCGTTCCT CTACGTAGGC
GACGCCGCGG TCTCCAACTA CGGCACCGTC TACATGGAGA GCGCGCTGTC GGCGAGCGGC
TGGCTGGTGC CCTTCGCCTA TCTCGTCTAC CAGGCGGCCA TGCTCCTCGC GCGGGTCCCG
GGGGACTTCG CCGTGCGCAG GTACGGCCCT GCCCCGGTCG TCCGCGTGGG AGCGGTGATC
GCGGCCGTGG GCACGCTCGG CGTCGTCGCC GCCCCCGGTG TCCTGGTGGC CGTCCTGTCG
TTCGGCCTGA TCGGCATCGG CCTGTCGGTC ATCGCGCCGC AGTCGTTCTC GGCGGCCGGC
CGTCTCGGCG CCGGAGCCGA GACGGCGATC GCGCGCGTCA ACATGTTCAA CTACGTCGGC
TTCCTCGTCG GCGCGGCCGT GGTCGGCACC ATCAACGACA CCGTGGACGC GCGGATGGCG
TTCGTGCCCG CCGCCGTCGT GGTGGCCCTG ATCGTGCCGC TGGCCAAGGG CTTCCAGCCG
GATCCGGAGC TCACCGCTCA GAAGGTCAGA GGTTCAGAAG GTCAGAGGTA A
 
Protein sequence
MPTPRDRQAR VATWAAFFVQ GLCFATLLTH VIDLQHRFGL SDGDLTLVLL LVPVIAGIGS 
VVAAPLAARY GSGPVLRISQ LGVCAVVALS GWNTELAGLY VLSAVFGLFV GAVDAAMNMQ
AVAVERRYGM SVLTGFHAVW SVGSMLGAGF NSAFTALGMD LGWSFSIPVA IGAAISAIML
PRLYDRDGER ATAQAARTAA RVPWRPIIPL CLAMAFLYVG DAAVSNYGTV YMESALSASG
WLVPFAYLVY QAAMLLARVP GDFAVRRYGP APVVRVGAVI AAVGTLGVVA APGVLVAVLS
FGLIGIGLSV IAPQSFSAAG RLGAGAETAI ARVNMFNYVG FLVGAAVVGT INDTVDARMA
FVPAAVVVAL IVPLAKGFQP DPELTAQKVR GSEGQR