Gene Arth_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0095 
Symbol 
ID4447463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp97771 
End bp99438 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content71% 
IMG OID639687890 
Producturea amidolyase related protein 
Protein accessionYP_829596 
Protein GI116668663 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2
[COG2049] Allophanate hydrolase subunit 1 
TIGRFAM ID[TIGR00370] conserved hypothetical protein TIGR00370
[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAG CAGCCATGGA AACCCTGACC GTCCACAAGG TCCTTTCGGT GCGGCCCGTA 
GGAACGCGCG CGGTGCTCGC GGAGCTTTCC GGAACGCAGG AGGTCCTCGC CCTGCAGGCC
CTGCTCGTGG AAGCGCCGCT GCCGGGCCAG GTGGACGTAC TTGCGGCAGC GCAGACCGTC
ATGGTCCGCG CGGATTCGCC GGCGGCTGCC CGCCGCATGG GCGGGCTCCT GCTGGAGCTG
GACCTCACTG CCCGGGCCAA ACAGGACGGC GGCCTGGTGT TCATCGACAC CGTGTACGAC
GGCGAGGACC TCGCCGAAGT CGGGCGGCTC ACCGGCCTGG GGACGGACGG CGTCATCGAA
GCCCACACCG GCCAGGTCTG GACGGTGGCT TTCGCCGGGT TCGCGCCCGG CTTCGGCTAC
ATGGTGGGGG AGAACCAGAC GCTGGAAGTA CCGCGCCGGA GCTCACCGCG CACAGCCGTG
CCCGCCGGAT CCGTGGCCCT GGCCGGCAAC TATTCGGCGG TGTATCCGCG CAGGTCCCCG
GGCGGCTGGC AGCTGATCGG CCGCACCGGC GCCCGCATGT GGGACCTCAA CCGGGCCGAG
CCGGCCCTGG CCAGCCCGGG CCACCGGGTC CAGTTCCGCG CCGTCCGCGA CGTTGTGACC
ATGGCAACGG AGCCCGCAGC GTCCGACGCT GCACCCGCTG CAGGGATGCC GGAGGAGCTG
CCGGGCCAGG AAACACCTTC CGGGCTGCGG GTCGTTTCGC CCGGGCTGCA AAGCCTCCTC
CAGGACCTTG GACGGCACGG CCACTCTGCC TTGGGCGTCT CTGCCGCCGG CGCGCTGGAC
CGGGCTTCGC TGCGCAGGGC CAACCGTTTG GTGGGCAACC GTTCTTCCGC CGCAGCCATC
GAAAGCGTGG CCGGCGGTCT TCGGGTCCAG GCCGTCGGGG ACCAGGTCCT TGCCGTCGCC
GGGGCGCCAT CGGCACTGAC AATTGATTCA CCGTCGGACG CTGGATCCGA TTCTGACGCC
GGCAAGCCTG CGCGGCAGCG CACCGTCCCC ATGGCCACCC CGTTTGCCCT GCTCGACGGC
GAAATCCTTA CGCTCGGTGC ACCGGAGTCG GGGTTCCGCA GCTACCTGGC CGTCCGTGGC
GGGGTGGACG CCGCGCCGGT GCTGGGCAGC CGTTCCACCG ACACGATGTC CGGCATCGGG
CCTGCACCGC TGGCCGCCGG GCAGCTCCTG GCATCCGGCG ACGTCACCGA ATCCGGCGTG
GTGGGCAGTC CCGAACTCCA GCCGGACTTC CCCGGGGACG GCGTCACCGT CCTGGACATT
GTGCTGGGCC CGCGGGCCGA CTGGTTCGAC CAGTCCGCCA TTGACTCCCT GTGTGGGCAG
GACTGGGTGG TGAAGCCGCA GTCCAACCGG GTGGGCATGA GGCTGGACGG CACGCCCCTG
CAACGCAGCC GGCAAGGCGA ACTGCCCAGC GAAGGCACCG TGGCCGGGGC CATCCAGGTC
CCGCCCGAGG GCCTTCCCGT CCTCTTCCTG GCAGACCACC CGATCACGGG CGGCTACCCG
GTGATCGGCG TGGTGGTGGA CCACCAGCTG GACCTTGCCG CCCAGGTGCC GATCGGCGGC
AGCATCCGCT TCCGCATTGT TCCCGAACAG ACTCCCCAAG AAAAGTGA
 
Protein sequence
MKAAAMETLT VHKVLSVRPV GTRAVLAELS GTQEVLALQA LLVEAPLPGQ VDVLAAAQTV 
MVRADSPAAA RRMGGLLLEL DLTARAKQDG GLVFIDTVYD GEDLAEVGRL TGLGTDGVIE
AHTGQVWTVA FAGFAPGFGY MVGENQTLEV PRRSSPRTAV PAGSVALAGN YSAVYPRRSP
GGWQLIGRTG ARMWDLNRAE PALASPGHRV QFRAVRDVVT MATEPAASDA APAAGMPEEL
PGQETPSGLR VVSPGLQSLL QDLGRHGHSA LGVSAAGALD RASLRRANRL VGNRSSAAAI
ESVAGGLRVQ AVGDQVLAVA GAPSALTIDS PSDAGSDSDA GKPARQRTVP MATPFALLDG
EILTLGAPES GFRSYLAVRG GVDAAPVLGS RSTDTMSGIG PAPLAAGQLL ASGDVTESGV
VGSPELQPDF PGDGVTVLDI VLGPRADWFD QSAIDSLCGQ DWVVKPQSNR VGMRLDGTPL
QRSRQGELPS EGTVAGAIQV PPEGLPVLFL ADHPITGGYP VIGVVVDHQL DLAAQVPIGG
SIRFRIVPEQ TPQEK