Gene Sare_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1847 
SymbolpyrC 
ID5704710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2130028 
End bp2131305 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content73% 
IMG OID641271348 
Productdihydroorotase 
Protein accessionYP_001536723 
Protein GI159037470 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.140759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGT ATCTGATCAC CAATGTGAGT GTCCTCGGTG CCGCGCCGAC CGACCTGCTG 
ATCCGCGACG GCGTCGTGGC CGAGGCCGGC ACGGGCCTGG CCGCCCCGGA TGCCGTCGTG
GTCAACGGTA CCGGGCTGGT CGCCCTGCCC GGCCTGGTCG ACCTGCACAC TCACCTGCGC
GAGCCCGGCC GGGAGGACGC CGAAACCGTC GAGACCGGCT CCCGGGCGGC GGCGCTCGGC
GGCTACACGG CGGTCTGCGC GATGGCGAAC ACCTCCCCGG TCGCCGACAC CGCCGGTGTG
GTCGAGCAGG TCTGGCGGCT GGGCCGGGAG GCCGGGCTGG TGGACGTGCA GCCGATCGGC
GCGGTCACGG TCGGCCTGGC CGGTGAGCGG CTGGCCGAGT TGGGTGCGAT GGCCGACTCC
GCCGCCCGGG TACGGGTCTT CTCCGACGAC GGGCACTGCG TCGCCGATCC CCGGTTGATG
CGTCGGGCCC TGGAGTACGT GAAGGCGTTC GACGGAATCG TCGCCCAGCA CGCCGAGGAG
CCACGGCTGA CCGAGGGTGC CCAGATGCAC GAGGGTGAGG TCGCCACCCG CCTCGGCCTG
ACCGGTTGGC CGGCGGTCGC CGAGGAGGCG ATCATCGCCC GGGACGTGTT GCTCGCCGAG
CATGTGGGCA GCCGCCTGCA CGTCTGCCAC GTCTCCACGG CGGGCAGCGT CGGGGTGCTG
CGGCAGGCCA AGGCCCGAGG CGTCCAGGTC ACCGCGGAGG TCACCCCGCA CCACCTGCTG
CTGACCGATG AGAAGGCGGC CACGTACGAC CCGGTCTACA AGGTCAACCC ACCGCTGCGG
ACCGCCGCCG ACATCGCCGC GCTGCGCACC GCACTGGCCG AGGGCATCAT CGACATCGTC
GCCACCGACC ACGCCCCGCA CGCGGTGGAG GACAAGGAGT GCGAGTGGGC GTACGCCCGG
CCGGGCATGC TCGGCCTGGA GACGGCGCTG TCCATCGCGC TGGACGTGCT CGGCCCGCAG
TGGGACCTCA TCGCCGAGCG GATGTCCCGT GCCCCCGCCC GGATCGCGGG CCTGGCCGAG
CACGGCCACG ACCCGGCACC GGGCGCACCG GCGAACCTGA CGCTGGTGGA TCCGGCGGCC
CGCCGTACGG TCGAGCCGAC CGAGTTGGCC AGCCGTAGCC GCAACACCCC GTACGCCCGC
ATGACGCTGC CGGGTCGCAT CGTGGCGACC TTCCTGCGCG GTGTGGCGAC GGTTCTGGAC
GGAAAGGCAG TGAAGTGA
 
Protein sequence
MTAYLITNVS VLGAAPTDLL IRDGVVAEAG TGLAAPDAVV VNGTGLVALP GLVDLHTHLR 
EPGREDAETV ETGSRAAALG GYTAVCAMAN TSPVADTAGV VEQVWRLGRE AGLVDVQPIG
AVTVGLAGER LAELGAMADS AARVRVFSDD GHCVADPRLM RRALEYVKAF DGIVAQHAEE
PRLTEGAQMH EGEVATRLGL TGWPAVAEEA IIARDVLLAE HVGSRLHVCH VSTAGSVGVL
RQAKARGVQV TAEVTPHHLL LTDEKAATYD PVYKVNPPLR TAADIAALRT ALAEGIIDIV
ATDHAPHAVE DKECEWAYAR PGMLGLETAL SIALDVLGPQ WDLIAERMSR APARIAGLAE
HGHDPAPGAP ANLTLVDPAA RRTVEPTELA SRSRNTPYAR MTLPGRIVAT FLRGVATVLD
GKAVK