Gene Sros_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2973 
Symbol 
ID8666260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3236175 
End bp3237443 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content69% 
IMG OID 
ProductMicrosomal epoxide hydrolase 
Protein accessionYP_003338670 
Protein GI271964474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACA CCCGATCTTT TCGTATCGAT ATCCCGCAGG CCCAGCTCGA CGATTTGGGT 
GCACGGCTGG CCAACACCCG CTGGCCGGAG GAACTGCCGG GGGTGGGCTG GAGCCGGGGG
GTGCCGCTGG GCTACCTGAA GGATCTCGCC GAGTACTGGC GCACCGGCTA CGACTGGCGG
GCGCGGGAGG CGGCGCTCAA CGCGTACCCG CAGTACCTCA CCACCATCGA TGAGCAGAAC
CTCCATTTCC TGCACGTGCC CTCACCCGAG CCCGACGCCA CGCCCTTGCT GCTGCTGCAC
GGCTGGCCTG GCGGGTTCAC CGATTTCCTC GACGTGATCG GCCCGCTGTC CGATCCCCGC
GCGCACGGCG GCGACCCGGC CGACGCCTTC CATCTGGTGA TCCCGTCGTT GCCGGGGTTC
GGGTTCTCCA CGCCGCTGGC CGGGCCGGGT ATGAACGCGG CCAGGATGGC CGGGGTGCTC
GTGCGGCTGA TGGCCCAGCT CGGGTTCCAG CGGTACGGCG TGCACGGCTA TGACACCGGC
TCATGGGTCG CCCCCCAGAT GGGCAGGCAG GATCCGGACC GCGTCGTAGG CGTCCACGTC
AACGCCATGA TCACCTTCCC GATCGGGGCG GAGGGGGAGA TGGAGGGCCT GTCCGAGGTC
GAGCAGCGGC GCTGGCAGGC GATGCAGAAT TTCAACGACG GCTATCTGCA GTGCAACTCC
AAGCGGCCGC AGACGGTGAC CTACGGTCTG CACGATTCAC CCGTCGGCCA GCTCGCCTGG
ATCGTGGAGA AGTTCAAGGA ACTCACCGAC CCCGAGGACG GGCTGCCCGA GGACAGCATC
GATCGCGATC GCATCCTGAC CGACGTCTGC CTGTACTGGC TGACCGGCAC AGCCGGATCT
GCGGCGCAGA TCTACTACGA GGAGATCTCC GCGAACGCCT GGGACGCCGA GGCCGCCGGC
GACTGGAGTG CCGACTCCGG TGCCAGCGCC GGAGCCGACG CCGGTGACTG GAACGCCGGC
TCCGGTGACA GCACCGGAGC CGGTGCGGGC TGGGGCGAAG GCGCAGAGGA GTGGGCGGCA
TCCCAGCCCG GAACGGTGCC GACCGGTGTG CTGGTCTCCA ACCATGACGT GACCATCCGC
CGCTGGGCCG AGCGCGACCA CCATGTCGTG CACTGGACCG AGCTCGGCAA GGGCGGGCAC
TTCCTCGCGA TGGAGGCGCC GGACCTGCTG GTCGGCGACG TTCGCGAGTT CTTCCGTAAG
GTGCGCTGA
 
Protein sequence
MTNTRSFRID IPQAQLDDLG ARLANTRWPE ELPGVGWSRG VPLGYLKDLA EYWRTGYDWR 
AREAALNAYP QYLTTIDEQN LHFLHVPSPE PDATPLLLLH GWPGGFTDFL DVIGPLSDPR
AHGGDPADAF HLVIPSLPGF GFSTPLAGPG MNAARMAGVL VRLMAQLGFQ RYGVHGYDTG
SWVAPQMGRQ DPDRVVGVHV NAMITFPIGA EGEMEGLSEV EQRRWQAMQN FNDGYLQCNS
KRPQTVTYGL HDSPVGQLAW IVEKFKELTD PEDGLPEDSI DRDRILTDVC LYWLTGTAGS
AAQIYYEEIS ANAWDAEAAG DWSADSGASA GADAGDWNAG SGDSTGAGAG WGEGAEEWAA
SQPGTVPTGV LVSNHDVTIR RWAERDHHVV HWTELGKGGH FLAMEAPDLL VGDVREFFRK
VR