Gene Sros_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2271 
Symbol 
ID8665553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2454855 
End bp2456063 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content72% 
IMG OID 
Productimidazolonepropionase (imidazolone-5-propionate hydrolase) 
Protein accessionYP_003337996 
Protein GI271963800 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.282195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCC GCCTTCTGAC CAACATTGGC CGGCTCTGGA CCGGCAATGA CGTGTGCAGC 
AACGCGGCGA TCCTGGTCCA CAACGACCGG ATCGCGTGGG TCGGCCGTGC GGCGGACCTG
CCGCAGAGCG TTCCAGGCGT GGTGGACGAC ATCGTCGATG TCGACCATGT CGAGAACCTG
GGCGGCGCGC TGGTCACGCC CGGTCTGATC GACGCTCACA CCCACCCGGT CTACGCGGGA
AACCGCTACG CCGAGATGGC GATGCGCTCC GGCGGCTCCA CGCCGTCCGC GATCACCGCC
GCCGGCGGCG GCATCGGCTC CACCGTCACG GTGACCCGAG GCACCGACCC GTGGACCCTG
TGCAACGGTG TCCGGGAGCG CCTTCGCGAG TGGCTGCTCA GCGGCACCAC CACCGTGGAG
GCCAAGACCG GCTACCACCT CACCCGCGAC GGCGAGCTGG CCGACGTGCG ACTCCTGCGC
GAGCTCGAAA AAGAGCCGAT GATGCCGCGC GTGCACGTCA CCTTCATGGC CGCGCACGTC
GTCCCGCCGG AATACTTCGG CCGTCAGCGC GACTACGTCG AAGCCGTGGG CGCGTGGTGC
GCCGACGCGG CCGCGGCGGG AGCCGACAGC GTCGACGTCT ACTGCGACGA GGGGCACTTC
ACCACCGAAG AGGCCCGCTG GGTCCTCGCC TCCGGCCGCA ACGTCGGCCT GCTGCCCCGC
GTGCACGCCG GCGCCTACAG CCGCCGCGGC GCCGTCCAGC TCGCGGCCGA GCTCGGCTGC
GCCTCCGCCG ACCTGCTCCA CCACACCTCC GACGAGGACA TCTCGATCCT GGCCCGCTAC
GGCGTCCCCG CCGTGGTCTG CCCGGGAACC GCCCTCCAGC GCGGCAGCCT GCCACCGGTC
CGCCGCATGC TCGCCCAGGG CGTCACGGTG GCACTCGGCA GCGACCACAA CCCCGGTCAC
TGCGGAATCA CCTCGATGTC CCTGGTCATC AGCCTCGCCG TGGCCGCCTT CGGCATGAGC
GTCGGCGACG CGCTCCGTGC CGCGACGCTC GGCGGAGCCA CCGTCCTCGG CGTTCCCGAC
CGGGGCGTCC TCGCTCCCGG CCGCCTGGCC GACATCGTCC AGTGGGACGC CGACCACGAA
GGCGCCTTCG CGTGGGCCTT CGGCCTCAAG CCCCGCCGGG TCTGGCGCGG CGGCAACCCC
GTCCAGTAG
 
Protein sequence
MTVRLLTNIG RLWTGNDVCS NAAILVHNDR IAWVGRAADL PQSVPGVVDD IVDVDHVENL 
GGALVTPGLI DAHTHPVYAG NRYAEMAMRS GGSTPSAITA AGGGIGSTVT VTRGTDPWTL
CNGVRERLRE WLLSGTTTVE AKTGYHLTRD GELADVRLLR ELEKEPMMPR VHVTFMAAHV
VPPEYFGRQR DYVEAVGAWC ADAAAAGADS VDVYCDEGHF TTEEARWVLA SGRNVGLLPR
VHAGAYSRRG AVQLAAELGC ASADLLHHTS DEDISILARY GVPAVVCPGT ALQRGSLPPV
RRMLAQGVTV ALGSDHNPGH CGITSMSLVI SLAVAAFGMS VGDALRAATL GGATVLGVPD
RGVLAPGRLA DIVQWDADHE GAFAWAFGLK PRRVWRGGNP VQ