Gene Strop_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1854 
SymbolpyrC 
ID5058313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2119179 
End bp2120456 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content73% 
IMG OID640474124 
Productdihydroorotase 
Protein accessionYP_001158694 
Protein GI145594397 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.513427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGT ATCTGATCAC CAACGTGAGC GTCCTCGGTG CCGCGCCGAC CGACCTGCTC 
ATCCGCGACG GTGTCGTGGC CGAGACCGGC GTGGGCCTGA CGGCCTCCGA CGCGGTCGTG
GTCGACGGCA CCGGCCTGGT CGCCCTGCCC GGCCTGGTGG ACCTGCACAC CCATCTGCGT
GAGCCCGGCC GGGAAGACGC CGAGACCGTG GCGACCGGCT CCCGCGCCGC GGCGCTCGGC
GGTTTCACCG CCGTCTGCGC GATGGCGAAC ACCTCCCCGG TGGCCGACAC CGCCGGTGTG
GTCGAGCAGG TCTGGCGGCT GGGCCGGGAG GCCGGGCTGG TCGACGTGCA GCCGATCGGC
GCGGTCACGG TCGGGCTGGC CGGCCAGCGC CTGGCCGAGT TGGGCGCGAT GGCCGACTCC
GCCGCCCGGG TGCGGATCTT CTCCGACGAC GGACACTGCG TCGCCGACCC GCGGTTGATG
CGCCGGGCCC TGGAGTACGT GAAGGCGTTC GACGGGATCG TTGCCCAGCA CGCCGAGGAG
CCACGGCTGA CCGAAGGCGC TCAGATGCAC GAGGGTGAGA TCTCCACCCG CCTTGGCCTG
ACTGGCTGGC CGGCGGTCGC CGAGGAGGCG ATCATCGCCC GGGACGTGCT GCTCGCCGAG
CACGTGGGTA GCCGCCTGCA CATCTGCCAC GTCTCCACGG CCGGCAGCGT CGGGGTGCTG
CGGCAGGCCA AGGCCCGCGG CGTTCAGGTC ACTGCCGAGG TCACTCCGCA CCACCTGTTG
TTGACCGACG AGAAGGCGGT TACCTACGAC CCGGTCTACA AGGTCAACCC GCCGCTGCGG
ACCGCCGCCG ATGTCGCCGC ACTGCGCACC GCGCTGGCCG AGGGGGTCGT GGACATCGTC
GCCACCGACC ACGCCCCGCA CTCCGTGGAG GACAAGGAGT GCGAGTGGGC GTATGCCCGG
CCGGGCATGC TCGGCCTGGA GACGGCGCTC TCCATCACGC TGGACGTGCT CGGCCCGCGG
TGGGACCTCA TCGCCGAGCG GATGTCCCGC ACCCCCGCCC GGATCGCTGG CCTCACCGAG
CACGGCCACG ACCCCGCGCC GGGCGCGCCG GCGAACCTGA CCCTGGTGGA TCCGGCGGCG
CGGCGCGTCG TCGAGCCGAC CGAGTTGGCC AGCCGCAGCC GCAACACCCC GTACGCCCGC
ATGACGCTGC CGGGTCGCAT CGTGGCGACC TTCCTGCGCG GCGAGGCGAC GGTCCTGGAC
GGAAAGGCAG TGAAGTGA
 
Protein sequence
MTAYLITNVS VLGAAPTDLL IRDGVVAETG VGLTASDAVV VDGTGLVALP GLVDLHTHLR 
EPGREDAETV ATGSRAAALG GFTAVCAMAN TSPVADTAGV VEQVWRLGRE AGLVDVQPIG
AVTVGLAGQR LAELGAMADS AARVRIFSDD GHCVADPRLM RRALEYVKAF DGIVAQHAEE
PRLTEGAQMH EGEISTRLGL TGWPAVAEEA IIARDVLLAE HVGSRLHICH VSTAGSVGVL
RQAKARGVQV TAEVTPHHLL LTDEKAVTYD PVYKVNPPLR TAADVAALRT ALAEGVVDIV
ATDHAPHSVE DKECEWAYAR PGMLGLETAL SITLDVLGPR WDLIAERMSR TPARIAGLTE
HGHDPAPGAP ANLTLVDPAA RRVVEPTELA SRSRNTPYAR MTLPGRIVAT FLRGEATVLD
GKAVK