Gene Sros_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1781 
Symbol 
ID8665059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1897265 
End bp1898413 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content73% 
IMG OID 
Productimidazolonepropionase 
Protein accessionYP_003337514 
Protein GI271963318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.381762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGC TTCTCGACCG GATCGGACTG CTCTACACCG GTGATCCGGA GCGGGAGGAG 
ATCGAGGACG CCGCCATCGT GGTCGAGGAC GGCCGGGTGG TATGGACCGG GACCGCCGGT
GCCGACCCCG GCGCCGACGA GCGCGTGGAC GTGGGCGGGC GCTGCGTGAT CCCCGGGTTC
GTGGACAGCC ACGCCCATCT GGTCTTCGCC GGCGACCGCA CCGCGGAGTT CACCGCCCGC
ATGTCGGGGG AGCCCTACAC CGCCGGCGGG ATCCGCACCA CGGTCGCGGC CACCCGCGCG
GCCGGCGACG CCCTTCTCGC GGAGCGGACC GCCGCCCTGG TCACCGAGAT GCTCGCCCAG
GGGACCACCA CCGTCGAGAT CAAGAGCGGC TACGGCCTCA CGGTCGAGGA CGAGCGCCGG
TCCCTGGAGA TCGCCCGGCG GTTCACCCGG GAGACCACTT ATCTGGGCGC GCACGTCGTC
CCGCCGGACG CCCCCTCCGC CGACGACTAC GTCCGCATGG TCACCGGGGA GATGCTGGAG
GCCTGCGCGC CGTACGCCAG GTGGGTGGAC GTGTTCTGCG AGCGCGGGGC GTTCGACGCC
GACCAGACCA GGGAGATCCT GCTCGCCGGG ACCAAGGCCG GGCTGCTGCC CCGGATCCAC
GCCAACCAGC TGGGCAACGG GCCGGGCGCG CAGATCGCCG CCGAGATGGG CGCCGCCTCC
GCCGACCACT GCACCCACCT GACCGACGAG GACGTCTCCG CGCTGTCCTC GGCCGGAGTG
GTGGCCACCC TGCTGCCCGG CGCGGAGTTC TCCACCCGCT CGCCGTACCC GGACGCGCGC
CGGCTGCTGG ACGCCGGGGT GACCGTCGCG CTGGCCACCG ACTGCAACCC CGGCTCCTCC
TTCACCTCGT CCATGCCGTT CTGCCTGGCG CTGGCCGTCC GGGAGATGCG GATGACACCG
CTGGAGGCGG TCAGGGCCGC CACGTACGGC GGAGCCATGG CGTTGCGCCG CGACGACGTC
GGCACGCTGA GGGTGGGGGC CCGCGCCGAT CTGGTGATCC TGGACGCCCC GTCCTACGTC
CATCTGGCTT ACCGGCCGGG GGTACCGCTG GCGGCGCAGG TGTGGAAGGA GGGCCACCGC
CTGGTTTGA
 
Protein sequence
MSTLLDRIGL LYTGDPEREE IEDAAIVVED GRVVWTGTAG ADPGADERVD VGGRCVIPGF 
VDSHAHLVFA GDRTAEFTAR MSGEPYTAGG IRTTVAATRA AGDALLAERT AALVTEMLAQ
GTTTVEIKSG YGLTVEDERR SLEIARRFTR ETTYLGAHVV PPDAPSADDY VRMVTGEMLE
ACAPYARWVD VFCERGAFDA DQTREILLAG TKAGLLPRIH ANQLGNGPGA QIAAEMGAAS
ADHCTHLTDE DVSALSSAGV VATLLPGAEF STRSPYPDAR RLLDAGVTVA LATDCNPGSS
FTSSMPFCLA LAVREMRMTP LEAVRAATYG GAMALRRDDV GTLRVGARAD LVILDAPSYV
HLAYRPGVPL AAQVWKEGHR LV