Gene Sros_4853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4853 
Symbol 
ID8668147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5375559 
End bp5377040 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content70% 
IMG OID 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_003340414 
Protein GI271966218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.308799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.90716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCCG ACCCGGAGGA ACACACAGTG AGCACTCTCG TCAGGGCATC GTTCGTCATC 
GGCTTCGACG GCGACGACCA TGTCATCCAC CGCGACGCGT GCGTCGTCTA CGACCGCGAC
CGGATCGTCT ACGTCGGCCG CTCCTACGAC GGCCCGGTCG ACGAGGTGAT CGACGCCGGT
GAGGCGATCG TCGGGCCGGG TTTCATCGAC CTGGACGCGC TGGCCGACAT CGACCACGCC
ATCCTCGACA CCTGGCACGC CGATTCCGGC GGGCTGGGGT GGTCACAGGA CTACGCCGTC
AACCGGCGTC GTGCCGTTTT CCCGCTCGAG GACACGCTCT TCATGCGGGA GTACGCGCTC
ACCCAGCTCA TCCGCAACGG CATCACCACC GCGATGCCGA TCGCCGCCGA GACGCACAGC
GCCTGGGCCG AGTCCTACGA GGAGCTCGCC GGCGTCGTGG AGATCGCCGG GCGGCTCGGC
CTGCGCATGT ACCTGGGTCC GTCCTACCGC TCCGGCGTGC CGGTGCTGCG TGCCGACGGC
AGCAGGGACG TGCACTGGGA GCCCGAGCTC GGCGACAAGG GGCTCGCCGA CGCGATCCGC
TTCGTCCGGG ACGTCGACGG CGCCTACGAC GGCCGGATCC GAGGGGCGCT GCTGCCCTGC
CGCATCGAGA CCGTCACCCT CGACCTGCTG CGCGCCACCG CCCGAGCCGC CGGGGAACTC
GACTGCCTGG TCCGCCTGCA CTGCATGCAG GGGCTGACGG AGCTGCGGTT ACTGCGCGAA
TGGTACGGCC GGCACCCGCT GGACGTGCTG GCAGAGGTTG GCCTGCTCGG TCCCCGGCTG
CTGATCCCGC ACGCCCTGTA CCTCGGTGAC CCCGAGACGC CGTTCGAGGG ATCACCGGAC
CGGCTCGCGG CCCTGGCCGG GATCGTGCAC TGCCCCCTCA CCTTCGTCAG GTACGGCGAC
GCGCTCCGCG ACTTCGACCG TTACCGGGAG GCCGGTGTGA ACGTGGCGCT GGGCACCGAC
TCCTTCCCGC CCGACATGAT CCGCAACATG GACTACGGCA ACAACCTGGC CAAGCTGGTC
ACCGGACGGC TGGAGGCGGG CTCGGCCGCC GACTATTACC GGGCCGCCAC GCTGGGCGGT
GCCCGCGCGC TGGGCCGGGA CGATCTGGGC CGACTCGCCC CCGGCGCCAA GGCCGACCTG
GTCGTCGTGG ACCTCTCCGG CCCGCGCACC GGCCCCGTCG ACGATCCGGT CAGGACGTTG
ATGATGAACT GCACGGGCGC CGACGTGTCC ACCGTGGTGA TCGACGGCCG GCCGGTGATG
CGCGACCGGA CGATTCCGGG GGTGGATGAG GAGTCGATGC GGCTCCGCGC ACAGACCTAC
TTCGAGACGA TGAAGGCCGC CTACTCCGAG CGTGACCACA TGCGCCGCGA CCCGGCACTG
TTGTTCCCCG CGTCCTTCCG CATCGTCGAG GCCGGCTCAT GA
 
Protein sequence
MEADPEEHTV STLVRASFVI GFDGDDHVIH RDACVVYDRD RIVYVGRSYD GPVDEVIDAG 
EAIVGPGFID LDALADIDHA ILDTWHADSG GLGWSQDYAV NRRRAVFPLE DTLFMREYAL
TQLIRNGITT AMPIAAETHS AWAESYEELA GVVEIAGRLG LRMYLGPSYR SGVPVLRADG
SRDVHWEPEL GDKGLADAIR FVRDVDGAYD GRIRGALLPC RIETVTLDLL RATARAAGEL
DCLVRLHCMQ GLTELRLLRE WYGRHPLDVL AEVGLLGPRL LIPHALYLGD PETPFEGSPD
RLAALAGIVH CPLTFVRYGD ALRDFDRYRE AGVNVALGTD SFPPDMIRNM DYGNNLAKLV
TGRLEAGSAA DYYRAATLGG ARALGRDDLG RLAPGAKADL VVVDLSGPRT GPVDDPVRTL
MMNCTGADVS TVVIDGRPVM RDRTIPGVDE ESMRLRAQTY FETMKAAYSE RDHMRRDPAL
LFPASFRIVE AGS