Gene Arth_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3641 
Symbol 
ID4443642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4091224 
End bp4093254 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content66% 
IMG OID639691465 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_833116 
Protein GI116672183 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATCG GCGGAACGTT CACGGACATC GTGGCGTACG ACCAGGCTGC GGGGACCTAT 
GAGGCGACGA AAGCATCCAC CACGCCAGGC AACCTGAGCG CCGGCGTCAT CGCCGGCCTC
GAATCGATTG TTAGCGACCT GTCCGACATC GAATTCCTGG TGCACGGCAC TACCCAGGGC
CTTAACGCTT TCCTCGAACG CCGCGGCGTT CCCGTGCTGC TGCTGGCTAC TGCCGGTGTC
GAGGACACCT ACCACATTGC CCGCGGCCCC CGCCTCGAGC TCTACAACGC CCAGTACCGC
AAGCCGGCTC CCCTCGTCGA GCGGAAGGAC GTCATCGGGA TCGGCGGACG CCTGGACGGT
CAGGGCCACG TGATCCGGCC GCTCGACGAA GTGGCAGTAC GCCAGGCGGC CCGCCGGGCA
CTGGACGAAG GATACGGCGC CGTGGCCGTT GCCTTCCTCT TCAGCTACAA GAACCCGGCC
CACGAACTTC GTGCCCGCGA AATCCTGCTC GAAGAACTGG GTGAAGACTT CACCATCTCC
CTTTCGCATG AAGCAGCCAA GGAATGGCGC GAATACGAGC GCACCTCGTC CGCCGTCGTC
GAGGCGTACA CCGGCCCCGT GGTCCGCAAC TACCTCCTGG ACCTGGAGGA GAAGCTTGCC
GACCGCGGCG TGGAGGCTCC CCTGCACATC ATGCAGTCCT CCGGCGGCGT GCTCACCGCC
GAGTCCGCCC GCAAGCGTCC CCTTCAGACC CTGCTGTCCG GGCCGGTGGG CGGCGCCATG
GGCGATGTCG AACTGGCCGG CGTGTCCGGC AACCGCAACC TCATCGGTGT TGACATGGGC
GGCACCTCCT TCGACGTCTC CCTGGTAGTC GACGGGAAAC CCGATGTGTC CACCGAAGCC
CACCTCGAAG GCCTGCCAAT GCTCATGAGC GTCGTCAACA TCCACACCGT GGGCGCCGGC
GGCGGCTCTG TGGCATGGCT CGAAGCCGGC GGCCTCCGGG TGGGCCCGCG TTCGGCCGGC
GCCACCCCTG GGCCGGCCTG CTACGGCCGC GGCGGCACCG AACCCACCGT CACCGATGCC
AACCTGGTGC TCGGCCGGGT CGACCCCGAC TGGTTCGCAG GCGGACAGGT CACCCTGGAC
CGGGAAGCAG CCGTCACCGC CCTCAAAACC GTGGGAGACC AGCTCGGCCT GGATCCGATC
GCCATGGCCG AAGGCATCTG CGACGTCGCC AACTCCCAGA TGGCCCAGGC CATCAGGACC
ATCACCATCT CCCGCGGCAT CGAACCCCGC GACTTTGCCC TCGTCGCCTT CGGCGGCGCA
GGACCAATGC ATGCAGTCTT CCTCGCCAAG GAGCTTGGAA TACCGGAAAC CGTGGTTCCC
CGTTTCCCCG GCGCGTTCTC GGCGTGGGGC ATGCTGCAGA CGAACATCCG CAAGGACTTC
TCCGAACCGT ACTTCTTCCT CGACGAGGAC ATCGACGTGG CTGACATGGC CGGTGTCCTG
CGAAGAATGG AAGTTGAGGG GCTGAGGTCC CTTGTCTCCG AGGGCGTCCC CGAGGAAAGC
CGCCGCACCA CCGTCTCCGT GGACGTCCGC TACCAGTCGC AGGAATTTTC GCTGAACGTC
CCACTGACGT CCGCCGACGA ACCCGAGTCA GAGGGCTTCG TCGCCAACCT GGCTACCCGC
TTCTCCGCGA TGTACCACGA GCGCTACGGG CACTCCAACC TGGGCGCCCC CATCGAGATC
GTCGCACTGC GGACGCAGGC GGTGGGCGAC CTGGGCCGGC TGGAGGCACC GCTCTTCGCC
GCAGCACAGA GCCCGGAATT CAAGCACGAA ATGCGCCGGG TGGTCTTTGA CCACGAAGAA
CACGAGACCA CCGTGGTTCG CCGCGACGAC CTGGCCGCGG GGCACACCTT TGAAGGTCCC
GCCATCATCG TGGAGCAGAC CGCCACCACC GTGGTGCCGC CCGGGTTCAA CGTCACAGTC
GACGAGTTCG GCTCCCTGGT CATCCGCACT GAAGACGCAG AAGGAAATTG A
 
Protein sequence
MDIGGTFTDI VAYDQAAGTY EATKASTTPG NLSAGVIAGL ESIVSDLSDI EFLVHGTTQG 
LNAFLERRGV PVLLLATAGV EDTYHIARGP RLELYNAQYR KPAPLVERKD VIGIGGRLDG
QGHVIRPLDE VAVRQAARRA LDEGYGAVAV AFLFSYKNPA HELRAREILL EELGEDFTIS
LSHEAAKEWR EYERTSSAVV EAYTGPVVRN YLLDLEEKLA DRGVEAPLHI MQSSGGVLTA
ESARKRPLQT LLSGPVGGAM GDVELAGVSG NRNLIGVDMG GTSFDVSLVV DGKPDVSTEA
HLEGLPMLMS VVNIHTVGAG GGSVAWLEAG GLRVGPRSAG ATPGPACYGR GGTEPTVTDA
NLVLGRVDPD WFAGGQVTLD REAAVTALKT VGDQLGLDPI AMAEGICDVA NSQMAQAIRT
ITISRGIEPR DFALVAFGGA GPMHAVFLAK ELGIPETVVP RFPGAFSAWG MLQTNIRKDF
SEPYFFLDED IDVADMAGVL RRMEVEGLRS LVSEGVPEES RRTTVSVDVR YQSQEFSLNV
PLTSADEPES EGFVANLATR FSAMYHERYG HSNLGAPIEI VALRTQAVGD LGRLEAPLFA
AAQSPEFKHE MRRVVFDHEE HETTVVRRDD LAAGHTFEGP AIIVEQTATT VVPPGFNVTV
DEFGSLVIRT EDAEGN