Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0095 |
Symbol | |
ID | 4447463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 97771 |
End bp | 99438 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639687890 |
Product | urea amidolyase related protein |
Protein accession | YP_829596 |
Protein GI | 116668663 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 [COG2049] Allophanate hydrolase subunit 1 |
TIGRFAM ID | [TIGR00370] conserved hypothetical protein TIGR00370 [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAG CAGCCATGGA AACCCTGACC GTCCACAAGG TCCTTTCGGT GCGGCCCGTA GGAACGCGCG CGGTGCTCGC GGAGCTTTCC GGAACGCAGG AGGTCCTCGC CCTGCAGGCC CTGCTCGTGG AAGCGCCGCT GCCGGGCCAG GTGGACGTAC TTGCGGCAGC GCAGACCGTC ATGGTCCGCG CGGATTCGCC GGCGGCTGCC CGCCGCATGG GCGGGCTCCT GCTGGAGCTG GACCTCACTG CCCGGGCCAA ACAGGACGGC GGCCTGGTGT TCATCGACAC CGTGTACGAC GGCGAGGACC TCGCCGAAGT CGGGCGGCTC ACCGGCCTGG GGACGGACGG CGTCATCGAA GCCCACACCG GCCAGGTCTG GACGGTGGCT TTCGCCGGGT TCGCGCCCGG CTTCGGCTAC ATGGTGGGGG AGAACCAGAC GCTGGAAGTA CCGCGCCGGA GCTCACCGCG CACAGCCGTG CCCGCCGGAT CCGTGGCCCT GGCCGGCAAC TATTCGGCGG TGTATCCGCG CAGGTCCCCG GGCGGCTGGC AGCTGATCGG CCGCACCGGC GCCCGCATGT GGGACCTCAA CCGGGCCGAG CCGGCCCTGG CCAGCCCGGG CCACCGGGTC CAGTTCCGCG CCGTCCGCGA CGTTGTGACC ATGGCAACGG AGCCCGCAGC GTCCGACGCT GCACCCGCTG CAGGGATGCC GGAGGAGCTG CCGGGCCAGG AAACACCTTC CGGGCTGCGG GTCGTTTCGC CCGGGCTGCA AAGCCTCCTC CAGGACCTTG GACGGCACGG CCACTCTGCC TTGGGCGTCT CTGCCGCCGG CGCGCTGGAC CGGGCTTCGC TGCGCAGGGC CAACCGTTTG GTGGGCAACC GTTCTTCCGC CGCAGCCATC GAAAGCGTGG CCGGCGGTCT TCGGGTCCAG GCCGTCGGGG ACCAGGTCCT TGCCGTCGCC GGGGCGCCAT CGGCACTGAC AATTGATTCA CCGTCGGACG CTGGATCCGA TTCTGACGCC GGCAAGCCTG CGCGGCAGCG CACCGTCCCC ATGGCCACCC CGTTTGCCCT GCTCGACGGC GAAATCCTTA CGCTCGGTGC ACCGGAGTCG GGGTTCCGCA GCTACCTGGC CGTCCGTGGC GGGGTGGACG CCGCGCCGGT GCTGGGCAGC CGTTCCACCG ACACGATGTC CGGCATCGGG CCTGCACCGC TGGCCGCCGG GCAGCTCCTG GCATCCGGCG ACGTCACCGA ATCCGGCGTG GTGGGCAGTC CCGAACTCCA GCCGGACTTC CCCGGGGACG GCGTCACCGT CCTGGACATT GTGCTGGGCC CGCGGGCCGA CTGGTTCGAC CAGTCCGCCA TTGACTCCCT GTGTGGGCAG GACTGGGTGG TGAAGCCGCA GTCCAACCGG GTGGGCATGA GGCTGGACGG CACGCCCCTG CAACGCAGCC GGCAAGGCGA ACTGCCCAGC GAAGGCACCG TGGCCGGGGC CATCCAGGTC CCGCCCGAGG GCCTTCCCGT CCTCTTCCTG GCAGACCACC CGATCACGGG CGGCTACCCG GTGATCGGCG TGGTGGTGGA CCACCAGCTG GACCTTGCCG CCCAGGTGCC GATCGGCGGC AGCATCCGCT TCCGCATTGT TCCCGAACAG ACTCCCCAAG AAAAGTGA
|
Protein sequence | MKAAAMETLT VHKVLSVRPV GTRAVLAELS GTQEVLALQA LLVEAPLPGQ VDVLAAAQTV MVRADSPAAA RRMGGLLLEL DLTARAKQDG GLVFIDTVYD GEDLAEVGRL TGLGTDGVIE AHTGQVWTVA FAGFAPGFGY MVGENQTLEV PRRSSPRTAV PAGSVALAGN YSAVYPRRSP GGWQLIGRTG ARMWDLNRAE PALASPGHRV QFRAVRDVVT MATEPAASDA APAAGMPEEL PGQETPSGLR VVSPGLQSLL QDLGRHGHSA LGVSAAGALD RASLRRANRL VGNRSSAAAI ESVAGGLRVQ AVGDQVLAVA GAPSALTIDS PSDAGSDSDA GKPARQRTVP MATPFALLDG EILTLGAPES GFRSYLAVRG GVDAAPVLGS RSTDTMSGIG PAPLAAGQLL ASGDVTESGV VGSPELQPDF PGDGVTVLDI VLGPRADWFD QSAIDSLCGQ DWVVKPQSNR VGMRLDGTPL QRSRQGELPS EGTVAGAIQV PPEGLPVLFL ADHPITGGYP VIGVVVDHQL DLAAQVPIGG SIRFRIVPEQ TPQEK
|
| |