Gene Sros_4618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4618 
Symbol 
ID8667912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5142216 
End bp5143415 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID 
Productputative epoxide hydrolase 
Protein accessionYP_003340222 
Protein GI271966026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.121576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGAA ACATGGCGAC AACGCCTGAG ATCGAGCCCT ACCCCATGCA CATTCCTGAG 
CGCGACTTGG CGGACCTACG GGCCCGTCTT GACGGGGTTC GGCTTCCCGA GCCAGAAACG
GTGCCGGACG CCGCGCAGGG TATCGAGTTG GATCAGCTGA GGATGCTGCT GGACACCTGG
CGACAGCACG ACTGGCGGGC CAGGGAAGAG CAGTGGAACA CGATTCCGCA TTACCGCGTG
CGCCTCGACG GCCTGCGTAT CGCGTTCTGG CACGTGCGCT CGCCCGAGCC CACAGCGCTC
CCCTTGTTGC TGACCCACGG CTGGCCGGGC TCCGTCCTCG AATTTGAGAA CGTCCTCGGG
CCGCTCAGTG ATCCGGTCGC CCATGGCGGT TCCGCGAGTG ACGCCTTCCA CGTCGTCGTC
CCGGCCCTGC CAGGGTTCGG GTTCAGCGAC CGTCCACGCG AGCGCGGCTG GCACCCCGCC
CGTACGGCGC GGGCGTGGGC CGAGCTGATG ACCGTGCTCG GCTACGAGCG GTTCGGCGCG
CACGGCGGAG ACTGGGGCGC GTTCGTCAGC ACCGAACTGG CGCGCCTGGT GCCCGAGCGC
GTCGCCGGCC TGCACCTGAC GATGCCGATC GCCTCTCCCC TGCCGGACGA CCGGATCTCG
CCGGACCCGG CGGAACAACG CATGCTGGAG CGGCGTGACA TTCATCTCGC GGACGGATAC
GGCTTCGGGA TGATCATGGG CACGCGCCCG CAGACGCTCG GCTACTCCCT GCTCGACTCG
CCTGCCGGCC TGGCGGCGTG GCTCGGTGAG AAGTTCGCCG CCTACTCCGA CACCCGCCCG
GAGGCCGGCG GCGGGGTGAG CGTGTCGCAA CAAGTGGACA ACATCGCCCT GTACTGGCTG
ACCAGGACCG GTGCCTCCAG CGCTCGCTGG TACTGGGAGA CCATGCGGTG GGTACCACGC
AGCGCCGAAG AAGAGAACGC ACAACCGGTG ACGGTGCCCA CCGCCTGTTC GCTGTTCCCG
GCCGAGCCGT GGCCGACCGC CCGGCGCTGG GCCGAACGCC GCTACCACGA CCTCCGGTCA
TGGCACGAGC TGGACCGCGG CGGCCACTTC CCCGGTCTGG AACAGCCAGA TCTTCTGGTC
TCCGAAATCA GGGCCGCATT CCATCACGTA CGTCACCATC CCGGCGATCT CACACGGTGA
 
Protein sequence
MSGNMATTPE IEPYPMHIPE RDLADLRARL DGVRLPEPET VPDAAQGIEL DQLRMLLDTW 
RQHDWRAREE QWNTIPHYRV RLDGLRIAFW HVRSPEPTAL PLLLTHGWPG SVLEFENVLG
PLSDPVAHGG SASDAFHVVV PALPGFGFSD RPRERGWHPA RTARAWAELM TVLGYERFGA
HGGDWGAFVS TELARLVPER VAGLHLTMPI ASPLPDDRIS PDPAEQRMLE RRDIHLADGY
GFGMIMGTRP QTLGYSLLDS PAGLAAWLGE KFAAYSDTRP EAGGGVSVSQ QVDNIALYWL
TRTGASSARW YWETMRWVPR SAEEENAQPV TVPTACSLFP AEPWPTARRW AERRYHDLRS
WHELDRGGHF PGLEQPDLLV SEIRAAFHHV RHHPGDLTR