Gene Sros_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5100 
Symbol 
ID8668394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5615118 
End bp5616224 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content69% 
IMG OID 
Productputative hydrolase 
Protein accessionYP_003340628 
Protein GI271966432 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0429658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATC CGTTCCGCAT CGACATCCCC CAGGCCGACC TCGACGACCT GACCGACCGG 
CTCTCCCGCA CCCGCTGGCC CAACGAGGTC GCCGACGCCG GATGGGACTA CGGCTTCCCG
CTCGCGCGGC TCAAGGAACT GGCCGAATAC TGGCGCACCG GCTACGACTG GCGCGAGCAC
GAGGCCAAGC TCAACGAGCT CCCGCACTTC ACCACCGAGA TCGACGGCCA GAACATCCAC
TTCGTCCACG TCCGGTCTTC GAACCCGGAC GCGCTCGCGC TGATCCTCAC CCACGGCTGG
CCCGGTTCGT TCCTGGAGTT CCTCGATGTG ATCGAGCCGC TGTCGCGCGA CTTCCACCTG
GTGATTCCGT CCATCCCGGG TTTCGGCTTC TCCGGGCCGA CCCACGAGCG CGGCTGGGAC
ATCGTCCGGG TCGCGCGGGC CTGGGCTGAG CTGATGCGCC GTCTCGGGTA CGAGCGCTAT
GGCGCGCAGG GTGGCGACTT CGGCTCGGGC ATCTCGATGG CGCTCGGCGC GGTGGCACCC
GAGCAGGTCG TCGGGGTGCA CGTCAACTAC CTGCCGACCC GGCCGGTCCC GGACGCCGAC
ATCGAACTGT CCGAAACGGA TGAAGCCCGG CTGGACAAGG TCAGGCAGCT GATGGCGAAC
CGTCCTCCGT ACCAGGCTCT GCAGGCCAGC ACCCCGCAGA CCATCGGTTA CGCGCTGACC
GACTCGCCGG TCGGCCAGCT GGCCTGGATC GCCGAGCGCT TCGCACAGTG GACGGACCCT
CGCTCGCCGA TCAGTGACGA GCGGATGCTC ACCGACATCT CGCTGTACTG GCTGACCGCC
ACCGCGGCTT CCTCGGCGCG GCTGTCCCGA GAGGCTCCGC GGCGGATCGA GCCGTGCCCG
GTACCGGTCG GCGTGGCGGT GTTCGCGCAC GACATCACGC AGTCGGTGCG ACCGCTGGCC
GAGCGGCTGT ACGACATCAG GCACTGGTCG GAGTTCGAGC GCGGCGGCCA CTTCGCCGCG
ATGGAGGTGC CCGAGCTGCT CGCCGAGGAC GTCCGGGACT TCTTCCGTAC CCACATCAAG
GACGACGACC GGGTCACCAC CCGCTAG
 
Protein sequence
MINPFRIDIP QADLDDLTDR LSRTRWPNEV ADAGWDYGFP LARLKELAEY WRTGYDWREH 
EAKLNELPHF TTEIDGQNIH FVHVRSSNPD ALALILTHGW PGSFLEFLDV IEPLSRDFHL
VIPSIPGFGF SGPTHERGWD IVRVARAWAE LMRRLGYERY GAQGGDFGSG ISMALGAVAP
EQVVGVHVNY LPTRPVPDAD IELSETDEAR LDKVRQLMAN RPPYQALQAS TPQTIGYALT
DSPVGQLAWI AERFAQWTDP RSPISDERML TDISLYWLTA TAASSARLSR EAPRRIEPCP
VPVGVAVFAH DITQSVRPLA ERLYDIRHWS EFERGGHFAA MEVPELLAED VRDFFRTHIK
DDDRVTTR