Gene Arth_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4066 
Symbol 
ID4447708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4589906 
End bp4591345 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID639691897 
Productdihydropyrimidinase 
Protein accessionYP_833541 
Protein GI116672608 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCA TTCCCGACCT CGTCATCGCC CACGGCACCG TGGTCAACAG CTTCGGCCGC 
CGGAACGCCC ACGTGGTGGT CCGCGACGGC CGCATAGAAC AGCTCATCGA TGCCGCGGAA
CCCGTCCCCG CCGCTAACCG CACCATCGAC GCCACCGGAC AGCTGGTGAT TCCGGGCGGC
GTGGACGGCC ACTGCCACGT GGCCCAGGTG ACCGGACGCT TCCGCACCTT GGACGACTAC
CGCACCACCT CCACCGCGGC ACTGTGGGGC GGCACCACCA CCATCATCGA CTTCGGCATT
CCCCGCGACG CGCAGGAAAC CCCGCTGGCG GCCGTCCTGC ACAAGAAGGA ACTGGCGCTG
GAATCCCGCT GCGACGTCGC CCTCCACGGC TCCGTGGTCA GCTGGGACGA GACCGTGCCC
TGGCAGTTGG AGCAGCTCGC CGCCGAGGGG GTGCGTTCAG TAAAGATGTA CACCACCAAC
CGCGGCACCA CCATGGCGGA CGGCGACACC ATCCTGAAAG TGATGCGAGA GATGGTCCGG
CTGGACGGCC TCACCTACAT CCACGCCGAA CACGACCCCA TCATCAGCGA CTGCACCCGG
CAGCACGCCG ACGACGGCAG GATCGGCATC GAACACCTGC ACCGGACCCG CCCCGAGCTC
GCCGAGGAAA TCTCGGTCAA GGAAACCCTG GCCATGGCCG AGTACACCAA GGCACCGGTG
TACTTCGTGC ACCAATCCAC GCCGGGCGCC GTCGACCTGG TCACCGAGGC ACGGGCCCGC
GGCCAGGAAG CCTTCTCCGA GACGTGCCCG CACTACCTCA CCCTCGATGA CACGGTGTAC
GGCTCCGCTT TCCCCGAATG GTACGCCTGC TGCCCGCCCA TGCGCAGCCC CGAAACCGTT
GCCGCGCTCA AGGAGCGGCT GGCCGACGGC GCCATCCACA CGGTTTCCTC GGACCACTCC
TGCTACGACC TCTCCCAGAA GCGCGAGCGC ACCGATGACA TCCGCGCCAT GCCGCACGGC
CTGCCTGGCG TCGAAACCCG GATGCCCGTC ACCTTTACGG CCATGGCGTC CGCGGGCTCG
TCAGTGGAGG ACTTTGTGGA GGTCTTCGCC GCCGGCCCCG CCCGCATCAA CGCGGTCCCC
GGCAAGGGAA CCATCGCCGA GGGCTTCGAC GCCGACCTGG TGATCTTCGA TCCTGCCGAG
GAACGGACGG TGGACGGCGG TGCGCTGCAC ATGGGCACTG ATTTCTCACC GTTCGACGGC
CGCACGCTGA CCGGCTGGCC CGCCGTCGTG GTTTCCGCGG GAAGGGTGGT GCTCGACGGC
GCCGGCTTCC ATGACCCCGG AGCTGTGGGA CGTTTCGTGG CCCGGAACGG CTTCCGCGAA
CACCTCTCGT CCACCACGGC GTCCGCCGCC ACTCCCGCGA CCCTCTCCGC AGCTAAGTAG
 
Protein sequence
MSRIPDLVIA HGTVVNSFGR RNAHVVVRDG RIEQLIDAAE PVPAANRTID ATGQLVIPGG 
VDGHCHVAQV TGRFRTLDDY RTTSTAALWG GTTTIIDFGI PRDAQETPLA AVLHKKELAL
ESRCDVALHG SVVSWDETVP WQLEQLAAEG VRSVKMYTTN RGTTMADGDT ILKVMREMVR
LDGLTYIHAE HDPIISDCTR QHADDGRIGI EHLHRTRPEL AEEISVKETL AMAEYTKAPV
YFVHQSTPGA VDLVTEARAR GQEAFSETCP HYLTLDDTVY GSAFPEWYAC CPPMRSPETV
AALKERLADG AIHTVSSDHS CYDLSQKRER TDDIRAMPHG LPGVETRMPV TFTAMASAGS
SVEDFVEVFA AGPARINAVP GKGTIAEGFD ADLVIFDPAE ERTVDGGALH MGTDFSPFDG
RTLTGWPAVV VSAGRVVLDG AGFHDPGAVG RFVARNGFRE HLSSTTASAA TPATLSAAK