Gene Sros_4966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4966 
Symbol 
ID8668260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5487956 
End bp5489203 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily transporter transmembrane protein 
Protein accessionYP_003340509 
Protein GI271966313 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.42222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTCCA CCGCCCCGGC CGCCGCCCCC AAGACCGGGG GCGGCGGCCA GTCGCGTTTC 
ACCAGGGGCT GGCGGATCGT CGCCGCGCTG GCCGTCACCC AGACCATCGG CTACGGCGTG
CTCTACTACG CCTTCTCCGT CTTCCTCACC CCCATGGCCC GCGACCTGAA TGCGAGCGGC
GCCCAGATCG CCGCCGCGCT CACCGGCTCG ATCCTGATCG CCGCGCTGTG CGCGCCGCTG
GTGGGCCGCC GGCTGGACGC CCACGGCGGC CGGGGCCTGA TGACCGCCGG GTCGGTCCTC
GGCACGGGCG CCGTGCTGGC CTGGTCACGG GTGGAGAGCC TGCCGCAGCT GTATGCGGTG
TTCGCCGCGA TCGGCATCGC GTGTGCGATG GTGCTGTACG AGAGCGCCTT CGCCGTCATC
GTGAGCTGGT TCGACGGCCC CGTCCACGGG CGCGGCCGGG CCAATGCGCT GCTCGCGCTC
ACCGTCGTCG CCGGGTTCGC TTCCTCGATC TTCCTTCCGC TGACCGGGCT GCTGGTGGAC
TCCTACGGCT GGCGCCACGC CTTGGTGGTC CTGGCCCTGA TCTACGGGGT GGCGGCCATC
CCGCTGCACG CGCTCGTCGT GCGCCGCCCC GCCCGCACCG GCCGCCAGGA CACCACGACC
GAAGAGCGGG CCGGGATCGT CAGAGCCGCC ACCCGCCGGC GGCCGTTCTG GCTACTGGTG
ATCGCCTTTA CCGCCAATGG CGGCGCGGCG GCCGTGATGG CGGTCCTGCT GATCACCTAC
CTCATCCACC TGGGCCACTC CCCCGTCCTG GCCGCCACCC TGGCCGGGCT GCTGGGCGTG
CTGTCGGTGA CCGGGCGCCT GCTCACCACC GGCCTGCAAC GCCGTATGCC CGCCGCGCTC
ATCGCGGCGG CGATCTTCAC CCTGCAGGGC GTCGCCGCCG CGCTGCTACC GCTGGCCGGG
CGGACGGTGC CGGGCGCCGT CGGCTGCGTG CTGGGATTCG GGCTCGGATT CGGCATCGCC
TCCATCACCC TGCCGCATCT GCTGGTCGAC AGGTACGGCA CGGCGGCGTA TGCCTCGCTG
TCGGGCCGCA TCGCCGCCTT CTCCGTCGCC GACAAGGCCC TGGCCCCGCT GGGAGCGGTC
GCGCTCGCGC AGGCGGCCGG ATACGCGTGG GTCATGGCAG CAGTGGCCCT CGCCTGTGTG
GTCGCCGCGG TCGCGTTGCT GGCCTACCAT CGCGTATCGT TTAGATAG
 
Protein sequence
MSSTAPAAAP KTGGGGQSRF TRGWRIVAAL AVTQTIGYGV LYYAFSVFLT PMARDLNASG 
AQIAAALTGS ILIAALCAPL VGRRLDAHGG RGLMTAGSVL GTGAVLAWSR VESLPQLYAV
FAAIGIACAM VLYESAFAVI VSWFDGPVHG RGRANALLAL TVVAGFASSI FLPLTGLLVD
SYGWRHALVV LALIYGVAAI PLHALVVRRP ARTGRQDTTT EERAGIVRAA TRRRPFWLLV
IAFTANGGAA AVMAVLLITY LIHLGHSPVL AATLAGLLGV LSVTGRLLTT GLQRRMPAAL
IAAAIFTLQG VAAALLPLAG RTVPGAVGCV LGFGLGFGIA SITLPHLLVD RYGTAAYASL
SGRIAAFSVA DKALAPLGAV ALAQAAGYAW VMAAVALACV VAAVALLAYH RVSFR