Gene Arth_2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2265 
SymbolpyrC 
ID4445256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2551276 
End bp2552664 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content69% 
IMG OID639690074 
Productdihydroorotase 
Protein accessionYP_831745 
Protein GI116670812 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.360956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACACG ATCAACAGGA TGCAGCCACC ACCGGGGCAT ACCTCATCCG CGGCGCAGCG 
ATTCTTGGCG GCGCAGCCGA AGACCTCCTG ATCCGCGACG GCATTATCGC CGCGCGCGGC
ACCGACCTGG CCGGCACCGA CGCAGCTGAC GGCGCCACCG TCATCGAGGC CGCCGGCCTG
GTGGCACTGC CCGGCATGGT GGACATCCAC ACCCACCTCC GCGAACCCGG ACGGGAAGAC
GCCGAAACAG TGGAAACCGG CACCCGCGCC GCCGCGCTTG GCGGCTACAC GGCCGTGCAC
GCCATGGCCA ACAGTAATCC CGTTGCGGAC ACCGCAGGCG TCGTGGAACA GGTGCACACC
CTGGGACGTG CCGCCGGCTG GGTGGACGTC CGCCCAGTTG GCGCGGTGAC CGTGGGCCTC
GCAGGCGAAC AACTGGCCGA GCTCGGCGCC ATGGCTGATT CCCGTGCCAA GGTCCGGGTC
TTTTCCGACG ACGGCATCTG CGTCCACGAC CCCGTGATCA TGCGCCGTGC GCTGGAATAT
GTGAAGGCGT TCGACGGCGT GGTGGCACAG CACGCGCAGG AACCGCGGCT CACCGCGGGC
GCCCAGATGA ACGAAGGCGA CGTCTCGGCA GTTCTCGGAC TGACGGGCTG GCCCGCCGTG
GCCGAGGAAA GCATCATTGC CCGGGACGTG CTGCTCGCCC AGCACGTCGG GTCCAGGCTG
CACGTCTGCC ACGTTTCCAC GGCAGGCTCG GTGGAAATCA TCCGCTGGGC CAAGGCCCGC
GGGATCAATG TGACGGCCGA AGTGACGCCG CACCACCTGC TCCTGACCGA TGAACTGGTC
CGCAGCTACG ACCCCGTCTA CAAGGTCAAC CCGCCGCTGC GCACGGACTC CGACGTCCAG
GCCCTGCGTG CCGCCCTGGC CGACGGCACG ATCGACGTCG TCGGAACCGA CCACGCCCCG
CACCCGAGCG AACACAAGGA ATGCGAGTGG GCGCAGGCGG CCATGGGCAT GACGGGGCTG
GAAACGGCGC TGTCCGTCGT CCAGGAAACC ATGATCGAGA CCGGCCTGAT GGGCTGGGCC
GACTTTGCCC GGGTGACGTC CACCGCTCCC GCCGTGATCG GACGGGTGGC GGACCAGGGA
CGTCCGCTGG AGGCCGGCGA ACCCGCAAAC GTCACACTGG TGGACCCGGC TGCGCGCTGG
ACCGTGGACC CTTCTAAGAT GGCAACCATG GGCCGTAACT CTCCGTTCGC CGGCAGGGAA
CTCCCCGGCA AGGTAGTGGC GACGTTCTTC AAGGGCCACC CCACCGTCCT TAACGGCGAG
CTCAACACCC CGTACCGCCA CGGGCCCCAC CAGGAAACAA CCGCCGCCGC ACCGGCGGGC
GGGTACTGA
 
Protein sequence
MAHDQQDAAT TGAYLIRGAA ILGGAAEDLL IRDGIIAARG TDLAGTDAAD GATVIEAAGL 
VALPGMVDIH THLREPGRED AETVETGTRA AALGGYTAVH AMANSNPVAD TAGVVEQVHT
LGRAAGWVDV RPVGAVTVGL AGEQLAELGA MADSRAKVRV FSDDGICVHD PVIMRRALEY
VKAFDGVVAQ HAQEPRLTAG AQMNEGDVSA VLGLTGWPAV AEESIIARDV LLAQHVGSRL
HVCHVSTAGS VEIIRWAKAR GINVTAEVTP HHLLLTDELV RSYDPVYKVN PPLRTDSDVQ
ALRAALADGT IDVVGTDHAP HPSEHKECEW AQAAMGMTGL ETALSVVQET MIETGLMGWA
DFARVTSTAP AVIGRVADQG RPLEAGEPAN VTLVDPAARW TVDPSKMATM GRNSPFAGRE
LPGKVVATFF KGHPTVLNGE LNTPYRHGPH QETTAAAPAG GY