Gene Sros_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0166 
Symbol 
ID8663432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp169609 
End bp171033 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content67% 
IMG OID 
Productintegrase domain-containing protein 
Protein accessionYP_003335961 
Protein GI271961765 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATC CGCTGCTGGG ATCGCTCGGC GACTTGGACG AGTGGCTGGA CCGGCGGGAT 
GTGCTGGATG GGCAGCCGTT CCTGCTCAGC CCCCAGGGCG AGTACGACGT GGCGTTGAAC
CGGTACTTCG AGCAGATCGG GATGGCGACG GCGCCGTGGA ACACACAGGC GGCGCACGCC
CGGGACCTGC GGAACTTCCT GGATTTTCTG TGGGCGAACC GCGGTGGAAG GCCGTGGCGC
GAGGCTACGC CGGAGGATCG GGCTGCCTAT GAGCGGTGGC GACGGAAGGA TCCTGCAGGG
CCGCGGGTAG AGCCCACCAC CTGGGATCGT GAGGTCGCAA CGGTCAACGC GTTCTTCGCC
TGGGTGGTTC GGCAGGGCTA CATCGAAGTC AGTCCGATCG TGCAGCGGGA GAGCCGGGAT
CGTCGCTCCC GCCCAGGACG GAGATCCACG CAGACGACCC CGGCGGAGGC CTCGCATACC
GGAGCGCGCC GTCATGTCGA GTGGCTGACG CCGGGCATGT ATCGGCGATG GCGGGACATC
GGGATCCGCG GCCACACCCC CGACGGGGCC TTGGACCCAT CGTTTAGAGG CAGGTTCGCC
TCGCGGAACG CCGCGTTCAC CGACCTGATG ATCCGCACCG GGCTGAGGAT CAGCGAGCAG
ATCGGGTTGT CGCTCTATGA ACTGCCCCGC ACGCAGGCCG GGATACTGAA CAGCCGGACC
TGGCTGCCCG CGCCGATCGC GAAATGGGGT TCAGCCCGCT ACGTCTACAT CCCGACCGGG
GTCTTGCGCG ATATCTGGGA CTACGTGGAG ATCGAGCGCG CGGACGCGGT GGAGCGGGCT
CGAGATCTGG GCCTCTACGA GCGGATCGTG GAGCCGCTGC TGATCGAGGA TCCCTCCCAA
CCGGTGGTGC GGATCGGTGG CCGCCGCCTG CCACTGACCA AACTCAGGCA AGCCGAACGC
GCCCGGGTCC TGGTCCGCAC CGATCACGGG TGGGAGCCGG CAGCGTTGTG GTTGAACGAG
TCTGGGCTGC CCGGATCGGC CGCAGGTTAC CGCGAGCTCT TCAAGGACGC CAACCGACGC
TGCCGTCGAC ACGGCCTGAC CGTGTCAACG CACCCGCACG GGCTGCGACA CAGCTTCGCG
GTGATCGAGC TGGAACACCT CTGGCGGGGC CATCTGGAGC AGCTCCAGGA GACCAACCCA
CAGGCACGGA TGACCTACCA GCGTGTCTAT GGCGATCCGC TGCTCTGGGT CAGCTGCAGG
CTCGGGCACC GGTCGATCGA GACCTCCGCG ATCTATCTGC ACACGCTGCA GGAGCTGGAG
ATGGAAACAC GCATGGCGCT GATCCCGGAC TGGTGGGAGC GCACGGGCGT CGATCCTGCC
CAACTCGACG ATCCGCCCAC TGACGGTGCT GAGGAGCACG CGTGA
 
Protein sequence
MDDPLLGSLG DLDEWLDRRD VLDGQPFLLS PQGEYDVALN RYFEQIGMAT APWNTQAAHA 
RDLRNFLDFL WANRGGRPWR EATPEDRAAY ERWRRKDPAG PRVEPTTWDR EVATVNAFFA
WVVRQGYIEV SPIVQRESRD RRSRPGRRST QTTPAEASHT GARRHVEWLT PGMYRRWRDI
GIRGHTPDGA LDPSFRGRFA SRNAAFTDLM IRTGLRISEQ IGLSLYELPR TQAGILNSRT
WLPAPIAKWG SARYVYIPTG VLRDIWDYVE IERADAVERA RDLGLYERIV EPLLIEDPSQ
PVVRIGGRRL PLTKLRQAER ARVLVRTDHG WEPAALWLNE SGLPGSAAGY RELFKDANRR
CRRHGLTVST HPHGLRHSFA VIELEHLWRG HLEQLQETNP QARMTYQRVY GDPLLWVSCR
LGHRSIETSA IYLHTLQELE METRMALIPD WWERTGVDPA QLDDPPTDGA EEHA