Gene Strop_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1107 
Symbol 
ID5057554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1252012 
End bp1253190 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content74% 
IMG OID640473374 
Productimidazolonepropionase 
Protein accessionYP_001157956 
Protein GI145593659 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.792195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTC TGCTGCTGGA CAACATCGGA GAACTGGTCA CGAACACCGC TGCCGGCGAG 
GGGCCGCTGG GCATCCACCG CGACGCCGCC GTGCTCGTCG AGGACGGCGT GGTGGCCTGG
GTCGGCCCGA ACGGGAGTGC ACCCGCCGCC GACCGACGCA TCGACGCCGA GTCGGCGGCC
GTGCTACCCG GCTTCGTGGA CAGCCACGCC CACCTGGTCT TCGCCGGGGA CCGGGCCGTC
GAGTTCGCCG CCCGGATGGC CGGCCAGCCG TACACCGGCG GCGGTATCCG CACCACGGTC
GGCGCGACCC GGGCCGCCAC CGACGACGAG CTACGCGCCA CGGCCCACCG GCTACACGCG
GAGGCGCTGC GGCAGGGCAC CACCACCATC GAGATCAAGA GCGGGTACGG CCTCACCGTC
GTCGACGAGG CCCGCTCGCT GCGGATCGCC GCCGAGGTGA GCTCGGAGAC CACGTTCCTC
GGGGCGCACC TCGTCCCCGC CGAGTACGCC GACCGGCCGG ACGACTACGT CGACCTGGTC
TGCGGCCCGA TGCTGGCCGC CGCCGCGCCG TATGCCCGCT GGGTGGACGT CTTCTGTGAG
CGGGGCGCCT TCGACGCCGA CCACACTCGC GCGATCCTGA CCCGTGGGCA GGCCGCCGGG
CTGGGCACCC GGCTGCACGC CAACCAGCTC GGCCCGGGCC CGGGGGTCCA GCTCGGAGTG
GAGCTGGGGG TGGCCAGTGT CGACCACTGC ACCCACCTCA CCGACGCCGA CGTCGACGCG
CTCGCCGGGG CCGACGGCGC GACCGTCGCC ACGCTGCTGC CGGGGGCGGA GTTCTCCACC
CGCTCGCCCT ACCCGGATGC CCGGCGGCTG CTCGACGCGG GCGTGACCGT GGCGCTGGCC
ACCGACTGCA ACCCCGGATC GTCGTACACC TCCTCGGTGC CGTTCTGCAT CGCGCTCGCC
GTACGGGAGA TGCGGATGAG CCCCTCCGAG GCGGTCTGGG CGGCGACCGC CGGCGGCGCC
GCGGCGCTAC GCCGCACCGA CGTGGGCCGG CTGGCGCCCG GCTCCCAGGC CGACCTGATG
ATCCTCGACG CCCCGTCCCA CCTGCACCTG GCCTACCGGC CGGGGATCCC ACTGATCCGT
CAGGTCCTGC ACAACGGAGT ACCGCAATGT CGACCGTAG
 
Protein sequence
MSSLLLDNIG ELVTNTAAGE GPLGIHRDAA VLVEDGVVAW VGPNGSAPAA DRRIDAESAA 
VLPGFVDSHA HLVFAGDRAV EFAARMAGQP YTGGGIRTTV GATRAATDDE LRATAHRLHA
EALRQGTTTI EIKSGYGLTV VDEARSLRIA AEVSSETTFL GAHLVPAEYA DRPDDYVDLV
CGPMLAAAAP YARWVDVFCE RGAFDADHTR AILTRGQAAG LGTRLHANQL GPGPGVQLGV
ELGVASVDHC THLTDADVDA LAGADGATVA TLLPGAEFST RSPYPDARRL LDAGVTVALA
TDCNPGSSYT SSVPFCIALA VREMRMSPSE AVWAATAGGA AALRRTDVGR LAPGSQADLM
ILDAPSHLHL AYRPGIPLIR QVLHNGVPQC RP