Gene Sros_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0831 
Symbol 
ID8664103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp854339 
End bp855919 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content70% 
IMG OID 
Productmajor facilitator transporter 
Protein accessionYP_003336588 
Protein GI271962392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGA GTAGTGCCGT GAGCGGCACG CGGGCCGGTC GGCGGGAGTG GGCCGGGCTC 
GCGATGCTTG CCCTGCCCAG CATCCTGCTG TCGCTGGACG TGACCCTCCT GCATCTGGCG
GTGCCGCACC TGGGAGCGGC GCTCGCACCC AGCAGTACCC AGATGCTGTG GATCATCGAT
ATCTACGCCT TCATGATCGC CGGGTTCCTG GTCACCGCGG GCACCCTGGG CGACCGTATC
GGCCGGCGCA AGCTGCTGCT CGGCGGTGGG CTGGCCTTCG GTGCCGCCTC GCTGCTCGCC
GCCTACGCCG GCAGCGCCGA GATGCTGATC GTCGCCCGGG CTCTGCTCGG CATCGCGGGC
GCGACCCTCA TGCCCTCCAC GCTCGCTCTG ATCAGCAACA TGTTCCAGGA TCCCAAGCAG
CGGGGCACCG CCATCGGGAT CTGGGCCGCC AGCTTCTCCG TGGGCATCGC GCTCGGCCCG
GTGGTCGGCG GGGCCATGCT GGAGGCGTTC TGGTGGGGCT CGGTCTTCCT CCTCGCCGTC
CCCGTGATGG CTCTGCTGCT GATCACCGGA CCGCTGCTGC TGCCCGAGTA CAAGGACGAG
AACGCGGGCC GGATCGATCT ACCCAGCGTC GCCCTGTCCC TCGCCGCCAT CCTGCCCGCC
GTCTACGGCG TCAAGGAGAT CGCCAAGCAC GGTATGCAGA GCGCGCCCCT GATCGCACTG
GTGGTCGGCC TGGTCTTCGG CATCGTCTTC ACCCGCCGCC AGCTACGGCT TGAGAACCCG
ATGCTGGATC TGAGCCTGTT CCGCAGCCGG GCCTTCAGCG TCGCCCTCGG CGTGATGCTC
TTCGCCGCCG TCGCCATGGG CGGCATCTAC CTCTTCGTCA CCCAGTACCT GCAGATGGTC
GCGGGACTCT CGCCGCTGAG GGCGGGCCTG TGGCTGCTGC CCGCCGCGGG GCTCCTGATC
GCCTCGTCGA TGCTCGCCCC GATCGCGGCC CGCCGGATCC GCCCCGGCAT CGTCACCGCT
GTCGGACTGA TTCTTTCCGC CATCGGCTAC TTCATCCTCA CCCAGGCCGA CGCCGGCGAC
AACGGGCTGG CCTACGTCGT CATCGGCTTC AGCTTCATCT ACACCGGCAT CGGCCCCGTG
ATGGGCCTGA GCGTCTCGCT CATCGTCGGC TCCGCGCCCC CGGAGAAGGC CGGTGCCGCC
TCCGCCCTGC AGCAGACCAG CAGCGACCTG GGCCTCGCGG TCGGCATCGC GGCCCTGGGC
AGCCTCGGCA CCGCCGTCTA CCGCAACGGC GTCGCCGGCG AACTCCCCGC CGACATCCCG
GCCGAGGTGG CCGACACCAG CCGCGACACC CTCGCCCGGG CCCTGGACGC CACCCGTGAC
CTGCCCGGAT CGATGGTCGA CCAGATCGTC ACACCCGCCC GCGATGCCTT CACATCCGGC
CTTAATGTCG TCGCCCTCAT CGGAGCGATT CTGGTCACCG CACTCACGAT CCTCGCCCTC
ACCATGCTCC GCCACGTGGC CCCCACCAAC GCGGCCGCGG CGCAGCCCGA GGAGATCCCC
CTCAAGAAGG CACTCAGATG A
 
Protein sequence
MTSSSAVSGT RAGRREWAGL AMLALPSILL SLDVTLLHLA VPHLGAALAP SSTQMLWIID 
IYAFMIAGFL VTAGTLGDRI GRRKLLLGGG LAFGAASLLA AYAGSAEMLI VARALLGIAG
ATLMPSTLAL ISNMFQDPKQ RGTAIGIWAA SFSVGIALGP VVGGAMLEAF WWGSVFLLAV
PVMALLLITG PLLLPEYKDE NAGRIDLPSV ALSLAAILPA VYGVKEIAKH GMQSAPLIAL
VVGLVFGIVF TRRQLRLENP MLDLSLFRSR AFSVALGVML FAAVAMGGIY LFVTQYLQMV
AGLSPLRAGL WLLPAAGLLI ASSMLAPIAA RRIRPGIVTA VGLILSAIGY FILTQADAGD
NGLAYVVIGF SFIYTGIGPV MGLSVSLIVG SAPPEKAGAA SALQQTSSDL GLAVGIAALG
SLGTAVYRNG VAGELPADIP AEVADTSRDT LARALDATRD LPGSMVDQIV TPARDAFTSG
LNVVALIGAI LVTALTILAL TMLRHVAPTN AAAAQPEEIP LKKALR