Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4066 |
Symbol | |
ID | 4447708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4589906 |
End bp | 4591345 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691897 |
Product | dihydropyrimidinase |
Protein accession | YP_833541 |
Protein GI | 116672608 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR02033] D-hydantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGCA TTCCCGACCT CGTCATCGCC CACGGCACCG TGGTCAACAG CTTCGGCCGC CGGAACGCCC ACGTGGTGGT CCGCGACGGC CGCATAGAAC AGCTCATCGA TGCCGCGGAA CCCGTCCCCG CCGCTAACCG CACCATCGAC GCCACCGGAC AGCTGGTGAT TCCGGGCGGC GTGGACGGCC ACTGCCACGT GGCCCAGGTG ACCGGACGCT TCCGCACCTT GGACGACTAC CGCACCACCT CCACCGCGGC ACTGTGGGGC GGCACCACCA CCATCATCGA CTTCGGCATT CCCCGCGACG CGCAGGAAAC CCCGCTGGCG GCCGTCCTGC ACAAGAAGGA ACTGGCGCTG GAATCCCGCT GCGACGTCGC CCTCCACGGC TCCGTGGTCA GCTGGGACGA GACCGTGCCC TGGCAGTTGG AGCAGCTCGC CGCCGAGGGG GTGCGTTCAG TAAAGATGTA CACCACCAAC CGCGGCACCA CCATGGCGGA CGGCGACACC ATCCTGAAAG TGATGCGAGA GATGGTCCGG CTGGACGGCC TCACCTACAT CCACGCCGAA CACGACCCCA TCATCAGCGA CTGCACCCGG CAGCACGCCG ACGACGGCAG GATCGGCATC GAACACCTGC ACCGGACCCG CCCCGAGCTC GCCGAGGAAA TCTCGGTCAA GGAAACCCTG GCCATGGCCG AGTACACCAA GGCACCGGTG TACTTCGTGC ACCAATCCAC GCCGGGCGCC GTCGACCTGG TCACCGAGGC ACGGGCCCGC GGCCAGGAAG CCTTCTCCGA GACGTGCCCG CACTACCTCA CCCTCGATGA CACGGTGTAC GGCTCCGCTT TCCCCGAATG GTACGCCTGC TGCCCGCCCA TGCGCAGCCC CGAAACCGTT GCCGCGCTCA AGGAGCGGCT GGCCGACGGC GCCATCCACA CGGTTTCCTC GGACCACTCC TGCTACGACC TCTCCCAGAA GCGCGAGCGC ACCGATGACA TCCGCGCCAT GCCGCACGGC CTGCCTGGCG TCGAAACCCG GATGCCCGTC ACCTTTACGG CCATGGCGTC CGCGGGCTCG TCAGTGGAGG ACTTTGTGGA GGTCTTCGCC GCCGGCCCCG CCCGCATCAA CGCGGTCCCC GGCAAGGGAA CCATCGCCGA GGGCTTCGAC GCCGACCTGG TGATCTTCGA TCCTGCCGAG GAACGGACGG TGGACGGCGG TGCGCTGCAC ATGGGCACTG ATTTCTCACC GTTCGACGGC CGCACGCTGA CCGGCTGGCC CGCCGTCGTG GTTTCCGCGG GAAGGGTGGT GCTCGACGGC GCCGGCTTCC ATGACCCCGG AGCTGTGGGA CGTTTCGTGG CCCGGAACGG CTTCCGCGAA CACCTCTCGT CCACCACGGC GTCCGCCGCC ACTCCCGCGA CCCTCTCCGC AGCTAAGTAG
|
Protein sequence | MSRIPDLVIA HGTVVNSFGR RNAHVVVRDG RIEQLIDAAE PVPAANRTID ATGQLVIPGG VDGHCHVAQV TGRFRTLDDY RTTSTAALWG GTTTIIDFGI PRDAQETPLA AVLHKKELAL ESRCDVALHG SVVSWDETVP WQLEQLAAEG VRSVKMYTTN RGTTMADGDT ILKVMREMVR LDGLTYIHAE HDPIISDCTR QHADDGRIGI EHLHRTRPEL AEEISVKETL AMAEYTKAPV YFVHQSTPGA VDLVTEARAR GQEAFSETCP HYLTLDDTVY GSAFPEWYAC CPPMRSPETV AALKERLADG AIHTVSSDHS CYDLSQKRER TDDIRAMPHG LPGVETRMPV TFTAMASAGS SVEDFVEVFA AGPARINAVP GKGTIAEGFD ADLVIFDPAE ERTVDGGALH MGTDFSPFDG RTLTGWPAVV VSAGRVVLDG AGFHDPGAVG RFVARNGFRE HLSSTTASAA TPATLSAAK
|
| |