Gene YpAngola_A0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0402 
SymbolpyrC 
ID5798866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp414568 
End bp415614 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content50% 
IMG OID641338409 
Productdihydroorotase 
Protein accessionYP_001605008 
Protein GI162421294 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0202499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAC AACCCCAAAC CCTAAAAATT CGCCGCCCGG ATGACTGGCA CATTCATCTA 
CGTGATGATG AAATGCTCAG TACCGTGTTG CCCTATACCT CCGAAGTATT CGCCCGCGCT
ATTGTTATGC CAAATCTAGC CCAGCCAATT ACAACGGTTG CCAGTGCTAT TGCTTATCGG
GAGCGTATTT TAGCAGCGGT TCCTGCGGGC CATAAATTCA CCCCGTTGAT GACGTGTTAC
CTGACTAATA GCCTTGATGC TAAAGAGTTG ACCACGGGTT TTGAGCAAGG CGTTTTTACC
GCGGCCAAAC TGTATCCGGC CAATGCCACC ACCAACTCCA CTCACGGTGT ATCTGACATC
CCGGCAATTT ACCCGTTGTT TGAACAAATG CAAAAGATAG GCATGCCCCT GCTTATTCAC
GGTGAGGTAA CAGATGCGGC CGTTGACATC TTTGATCGTG AAGCCCGTTT TATTGACCAA
ATTTTAGAGC CCATTCGCCA AAAGTTTCCC GAACTAAAAA TTGTCTTTGA GCATATCACG
ACCAAAGATG CGGCAGATTA TGTGCTGGCA GGCAATCGTT TCCTTGGGGC AACCGTCACG
CCACAACACT TGATGTTTAA CCGCAATCAC ATGCTGGTAG GCGGTATTCG CCCCCACTTG
TTCTGCCTGC CAATATTGAA GCGCAGCACC CATCAGCAAG CATTGCGCGC AGCCGTCGCC
AGTGGTTCTG ATCGCTTCTT CCTTGGGACC GATTCAGCTC CCCATGCCAA ACATCGTAAA
GAGTCATCTT GCGGCTGTGC GGGTGTATTC AACGCCCCAG CGGCATTGCC TGCTTATGCT
TCCGTGTTTG AGGAACTGAA TGCATTGCAA CATCTGGAAG CGTTTTGCGC CTTAAATGGC
CCACGATTTT ATGGCTTGCC TGTTAATGAT GACGTTGTTG AATTGGTTCG CACTCCATTC
CTGCAGCCAG AAGAGATCCC ATTAGGCAAT GAATCGGTTA TTCCTTTCCT TGCGGGTCAA
ACGCTTAATT GGTCAGTGAA ACGCTAA
 
Protein sequence
MTAQPQTLKI RRPDDWHIHL RDDEMLSTVL PYTSEVFARA IVMPNLAQPI TTVASAIAYR 
ERILAAVPAG HKFTPLMTCY LTNSLDAKEL TTGFEQGVFT AAKLYPANAT TNSTHGVSDI
PAIYPLFEQM QKIGMPLLIH GEVTDAAVDI FDREARFIDQ ILEPIRQKFP ELKIVFEHIT
TKDAADYVLA GNRFLGATVT PQHLMFNRNH MLVGGIRPHL FCLPILKRST HQQALRAAVA
SGSDRFFLGT DSAPHAKHRK ESSCGCAGVF NAPAALPAYA SVFEELNALQ HLEAFCALNG
PRFYGLPVND DVVELVRTPF LQPEEIPLGN ESVIPFLAGQ TLNWSVKR