Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3693 |
Symbol | |
ID | 4443694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4154940 |
End bp | 4156283 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691517 |
Product | allantoinase |
Protein accession | YP_833168 |
Protein GI | 116672235 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type [TIGR03178] allantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAG AAAGCTTTGA CCTCGTTATC CGGGGGCAGC GTATCCTCAC CACGGCCGGC ATCGCACCCC GGGAAGTGGG CGTGCGCGGC GGCAAGATCG TGGCCATCGA ACCGCTCGGC AACGGCCTGG CCGGCGCCGA AGTGATCGAA CTCGCCGACG ACGAAACCTT GATCCCCGGC CTGGTGGACA CCCACGTCCA CGTCAACGAG CCCGGCCGCA CCGAATGGGA GGGCTTCGCG TCCGCCACCC GGGCCGCGGC AGCCGGCGGC GTCACCACCA TCATCGACAT GCCGCTGAAC TCCATCCCGC CCACCACCAC CGTTGAAGGC CTTAAGCTCA AGCGCGAAGT GGCCGAGGAC CAGGCGTTCG TGGACGTCGG CTTCTGGGGC GGCGCCGTGC CCGGCAACAA GGCCGACCTG CGCCCGCTGC ACGACGAAGG TGTGTTCGGT TTCAAGTGCT TCCTGCTGCA CTCCGGCGTG GACGAGTTCC CGCACCTGGA GGCGGACGAG ATGGAAGAGG ACATGGCCGA GCTCAAGTCC TTCGACTCGC TCATGATCGT CCACGCCGAG GACTCGCACG CCATTGACCG CGCACCGCAT CCGGGCGGCG ACCACTACTC CACCTTCCTG GCATCCCGCC CCCGCGGCGC AGAGAACAAG GCCATCGCCG AGGTGATCGA GCGTGCCCGC TGGACGGGTG CCCGCGCCCA CATCCTGCAC CTCTCCTCTT CCGATGCGCT GCCGATGATC GCCAGCGCCA AGCGCGACGG CGTGCACCTC ACTGTGGAGA CCTGCCCGCA CTACCTCACC CTGATGGCCG AGGAGATCCC CGACGGCGCC ACCGCCTACA AGTGCTGCCC GCCCATCCGC GAGGCCTCCA ACCGCGAGCT CCTCTGGAAG GGACTGCAAG ACGGCACCAT CGACTGCATC GTCTCCGACC ACTCCCCGTC CACGCTTGAC CTGAAGGATC TGGAAAACGG CGACTTCGCT GTGGCCTGGG GCGGCGTCTC CTCGCTGCAG CTTGGCCTGT CGCTGATCTG GACCGAGGCC CGGCACCGCA ACATCCCGCT GGAGCAGGTT GTTTCGTGGA TGGCAGAGAA GCCGGCCGCC CTGGCACGAC TCTCAAACAA GGGCCAGCTG GCGCTCGGTT TCGACGCCGA CTTCTCGGTC TTCGCGCCCG ATGAGGCCTT CGTGGTGGAC GTTTCCAAGC TCAAGCACAA GAACCCCATC ACGCCCTACG ACGGCAAGGC ACTCTCCGGC GTGGTCCGGA AGACATTCCT GCGCGGACAT GAAATCGATG GCCAGACCCC CGGCGGCAAG CTGATCCGCC GCGGCGGCGT CTGA
|
Protein sequence | MSEESFDLVI RGQRILTTAG IAPREVGVRG GKIVAIEPLG NGLAGAEVIE LADDETLIPG LVDTHVHVNE PGRTEWEGFA SATRAAAAGG VTTIIDMPLN SIPPTTTVEG LKLKREVAED QAFVDVGFWG GAVPGNKADL RPLHDEGVFG FKCFLLHSGV DEFPHLEADE MEEDMAELKS FDSLMIVHAE DSHAIDRAPH PGGDHYSTFL ASRPRGAENK AIAEVIERAR WTGARAHILH LSSSDALPMI ASAKRDGVHL TVETCPHYLT LMAEEIPDGA TAYKCCPPIR EASNRELLWK GLQDGTIDCI VSDHSPSTLD LKDLENGDFA VAWGGVSSLQ LGLSLIWTEA RHRNIPLEQV VSWMAEKPAA LARLSNKGQL ALGFDADFSV FAPDEAFVVD VSKLKHKNPI TPYDGKALSG VVRKTFLRGH EIDGQTPGGK LIRRGGV
|
| |