Gene Arth_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3693 
Symbol 
ID4443694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4154940 
End bp4156283 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content67% 
IMG OID639691517 
Productallantoinase 
Protein accessionYP_833168 
Protein GI116672235 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG AAAGCTTTGA CCTCGTTATC CGGGGGCAGC GTATCCTCAC CACGGCCGGC 
ATCGCACCCC GGGAAGTGGG CGTGCGCGGC GGCAAGATCG TGGCCATCGA ACCGCTCGGC
AACGGCCTGG CCGGCGCCGA AGTGATCGAA CTCGCCGACG ACGAAACCTT GATCCCCGGC
CTGGTGGACA CCCACGTCCA CGTCAACGAG CCCGGCCGCA CCGAATGGGA GGGCTTCGCG
TCCGCCACCC GGGCCGCGGC AGCCGGCGGC GTCACCACCA TCATCGACAT GCCGCTGAAC
TCCATCCCGC CCACCACCAC CGTTGAAGGC CTTAAGCTCA AGCGCGAAGT GGCCGAGGAC
CAGGCGTTCG TGGACGTCGG CTTCTGGGGC GGCGCCGTGC CCGGCAACAA GGCCGACCTG
CGCCCGCTGC ACGACGAAGG TGTGTTCGGT TTCAAGTGCT TCCTGCTGCA CTCCGGCGTG
GACGAGTTCC CGCACCTGGA GGCGGACGAG ATGGAAGAGG ACATGGCCGA GCTCAAGTCC
TTCGACTCGC TCATGATCGT CCACGCCGAG GACTCGCACG CCATTGACCG CGCACCGCAT
CCGGGCGGCG ACCACTACTC CACCTTCCTG GCATCCCGCC CCCGCGGCGC AGAGAACAAG
GCCATCGCCG AGGTGATCGA GCGTGCCCGC TGGACGGGTG CCCGCGCCCA CATCCTGCAC
CTCTCCTCTT CCGATGCGCT GCCGATGATC GCCAGCGCCA AGCGCGACGG CGTGCACCTC
ACTGTGGAGA CCTGCCCGCA CTACCTCACC CTGATGGCCG AGGAGATCCC CGACGGCGCC
ACCGCCTACA AGTGCTGCCC GCCCATCCGC GAGGCCTCCA ACCGCGAGCT CCTCTGGAAG
GGACTGCAAG ACGGCACCAT CGACTGCATC GTCTCCGACC ACTCCCCGTC CACGCTTGAC
CTGAAGGATC TGGAAAACGG CGACTTCGCT GTGGCCTGGG GCGGCGTCTC CTCGCTGCAG
CTTGGCCTGT CGCTGATCTG GACCGAGGCC CGGCACCGCA ACATCCCGCT GGAGCAGGTT
GTTTCGTGGA TGGCAGAGAA GCCGGCCGCC CTGGCACGAC TCTCAAACAA GGGCCAGCTG
GCGCTCGGTT TCGACGCCGA CTTCTCGGTC TTCGCGCCCG ATGAGGCCTT CGTGGTGGAC
GTTTCCAAGC TCAAGCACAA GAACCCCATC ACGCCCTACG ACGGCAAGGC ACTCTCCGGC
GTGGTCCGGA AGACATTCCT GCGCGGACAT GAAATCGATG GCCAGACCCC CGGCGGCAAG
CTGATCCGCC GCGGCGGCGT CTGA
 
Protein sequence
MSEESFDLVI RGQRILTTAG IAPREVGVRG GKIVAIEPLG NGLAGAEVIE LADDETLIPG 
LVDTHVHVNE PGRTEWEGFA SATRAAAAGG VTTIIDMPLN SIPPTTTVEG LKLKREVAED
QAFVDVGFWG GAVPGNKADL RPLHDEGVFG FKCFLLHSGV DEFPHLEADE MEEDMAELKS
FDSLMIVHAE DSHAIDRAPH PGGDHYSTFL ASRPRGAENK AIAEVIERAR WTGARAHILH
LSSSDALPMI ASAKRDGVHL TVETCPHYLT LMAEEIPDGA TAYKCCPPIR EASNRELLWK
GLQDGTIDCI VSDHSPSTLD LKDLENGDFA VAWGGVSSLQ LGLSLIWTEA RHRNIPLEQV
VSWMAEKPAA LARLSNKGQL ALGFDADFSV FAPDEAFVVD VSKLKHKNPI TPYDGKALSG
VVRKTFLRGH EIDGQTPGGK LIRRGGV