Gene Sros_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0119 
Symbol 
ID8663384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp120615 
End bp122075 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content73% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionYP_003335917 
Protein GI271961721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.458192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACGG CTCTCGGCCT GCTGGCCGTA CTGCTCCTCA CTCTTGCCAC CGGCTACTTC 
GTCGCCCAGG AGTTCGCCTT CGTCGCCGCC GACCGCGGCG TGCTGCGCGA ACAGGCCGAA
GCCGGGGACG CGGCGGCCAA GCGGGCGCTG GAGGTGACGG GCCGCCTGTC GTTCATGCTG
TCGGGCGCGC AGCTCGGCAT CACCGTGACC GCGCTGCTCG TCGGCTTCAT CGCCGAACCC
GCCATCGCCA CGGTCATCCG TCCGGGGCTG GAGGCCGCGG GCGTGCCGGA GGCGGCGGTC
CCGGGCATCG CGGTGGCGCT GGCCATCGCG GTCGCCACCG TCGTCCAGAT GGTGCTCGGC
GAGCTCGCCC CCAAGAACCT GGGCATCGCC CGTCCCGAGC CGGTGGCCAA ATTCCTCTCC
GGCTCGACCA TCGTCTACCT CAAAATCGCC GGGCCCGTCA TCCGGCTGTT CGACTCCGCC
GCCACGGGCC TGCTGCGCCG GGTGGGAGTG GAGCCGGTGG AGGAGGTCGA GCACGGCGCC
AGCCCCGAGG AGCTGTCGCG GATCATCTCC GAGTCGGCCA CGGCCGGCGA CCTTCCCCCG
CGGCTCTCGG AGCTGCTGGA GCGGGCGCTG GAGTTCGGCG ACCGCACCGC CGAGGACGTC
ATGGTGCCGC GCCCCCGCGT GGTGCTGCTC CGGGCCGAGC GCCCGATCTC CGACCTGCTC
GACGCCGTCC GCGAGCACGG CCACTCCCGG TATCCGGTGC TCTGCAAGGA CAACGGCGAG
GACGTGGTCG GCGTCACCGG CGTACGGGAG CTGCTCAAGT CCGGCCTGAC CGACGGGTCG
CTGGAGGAGA TCACCCGTCC CGCGCTGCTG GTCCCCGACT CGCTGCCGCT CCCGGTCGTG
CTGGAGCGCA TGCGCGCCGC CGGCGACGAC CTGGCCTGCG TCATCGACGA GTACGGCGGG
CTCGCCGGTG TGGTCACCGT CGAGGACCTG GCCGAGGAGC TAGTGGGCGA GCTGATCGAC
GAGAACGACC CCGAGCCCGC CGGCGTCGTC GCCAACGACG ACGGCACCTG GGACCTGCCC
GGCACGCTCC GGCTCGACGA GGTCGAGCGG GCGACCAAGC TGGAGCTCCC CGAGAGCGAC
GGCTACGAGA CCATCGCCGG CCTGGTGCTC GCCACCCTCG GCCGGATGGC CGAGCCCGGC
GACCAGGTCA CCGTCACGCT GACCCTTGAG ACCGACCTGC TGGAGAACGA CAGCGCCGAC
GAGGCCGACG CGGTGCTCAC CGTGCTGTCG GTGCACCGCA GGGTCCCCGA GTGGGTACGG
CTGGCACCGG CCGCGGCGGA GGTCCCCGGG AAGGCGGAGG TCCCCGGGAA GAACGGCGAG
CACGCGATGG CCGCGCCGCG GATGATCGAC CGCGGCGCCG ACAGACCCAT GATCGAGCGA
AGCGTGGTGG ACGGACGATG A
 
Protein sequence
MNTALGLLAV LLLTLATGYF VAQEFAFVAA DRGVLREQAE AGDAAAKRAL EVTGRLSFML 
SGAQLGITVT ALLVGFIAEP AIATVIRPGL EAAGVPEAAV PGIAVALAIA VATVVQMVLG
ELAPKNLGIA RPEPVAKFLS GSTIVYLKIA GPVIRLFDSA ATGLLRRVGV EPVEEVEHGA
SPEELSRIIS ESATAGDLPP RLSELLERAL EFGDRTAEDV MVPRPRVVLL RAERPISDLL
DAVREHGHSR YPVLCKDNGE DVVGVTGVRE LLKSGLTDGS LEEITRPALL VPDSLPLPVV
LERMRAAGDD LACVIDEYGG LAGVVTVEDL AEELVGELID ENDPEPAGVV ANDDGTWDLP
GTLRLDEVER ATKLELPESD GYETIAGLVL ATLGRMAEPG DQVTVTLTLE TDLLENDSAD
EADAVLTVLS VHRRVPEWVR LAPAAAEVPG KAEVPGKNGE HAMAAPRMID RGADRPMIER
SVVDGR