Gene Arth_3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3642 
Symbol 
ID4443643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4093265 
End bp4094938 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content64% 
IMG OID639691466 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_833117 
Protein GI116672184 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATCG CTGCAGTCAA AACCCTGGAT CCCGTAACCG TGGAGATCAT CCGCAACGCG 
CTCACCAGCG CCGCGGACGA TATGAATGCA ACCCTGATCC GCTCCGCCTA CTCGCCCATC
CTCTATGAGG GCGGTGACTG CGTGGTGGCG CTGCTGGACA AGGAACACCG TGTACTCGGA
CAATCCGCGG GGCTCCCGCT GTTCCTCGGC AACCTGGAAA CGTGCTCCAT CGCTGTGGAG
GAGCTGTACG GCCGCGAAGT CTGGCAGGAA GGGGACGTGT GGATCCTCAA CGACTCCTAC
CTTGGCGGAA CGCACCTGAA CGATGTCACC ATCTTCGCGC CGATTTTTGA TGACGGCTCG
GTGGTTGGCT TCGCCGCCAC CCGCGCGCAC TGGATGGACA TGGGGTCCAA GGATGTGGGC
GGCTCGATGG ATGCCACGGA CATCTTCCAG GAAGGCTTCC GTATGGGGCC GGTCAAGCTC
ATGGAAGCCG GCATTGAAAC CTCGGTGGTG GACCTGATCC GCACCAACGT GCGTTTCCCC
TACCAGACCA TCGGCGACAT GCACGCGATG ATCGCCGCAC TCCGGATGGG AACCACCCGG
ATGAAGGAGC TGGTGGGCCG GTACGGCATG GAGCAGCTCG ATGCTGCCCG CGATGAAATC
TTCCGCCAGA CAGAGGAGAT CGAGCGCGAA ACCGTCCGAA ACATCCCGGA CGGCGTCTAT
GAAGCCGAAG GCGTGCTGGA CAACGACGGC ATCAACCTGG ACACGCCCAT CCCCATCCGG
CTGAAGATCA CCGTTGCCGG CGACACTGTT GACTTCGACG TCACCGGCTC CGCCGACCAG
ACCATGGGCC CGGTCAACTG CGGCGCAGCC CAAGCCGTTT CGGCCCTGCG CGTGGGGTAC
AAGCTCCTCG TCAGCCCGGA CTCCAACTCC AACGGCGGAT CCTTCCGCCC ACTGACCACG
CAGGTGCGTT CCGGGTCGGT GCTCGGCGCC GTGGCACCTG CACCGTGCCA GTGGTACTTC
TCCCATCTGG GGCTGCTGAT CGACCTGGTC TCCAAGGCAA TGGCCCCCGC AATGCCTGAA
CGCGTAGCCA GCGCCAGCCA CGGCGACTCA ATGATCATCA CCGCCGCTGG CTTCGATCCC
CGCTTCGGCC GGAACTTCGT CAGCATGGAA GCCACTCTGG GCGGCTGGGG CGCCTGGCAG
GGCACGGATG GCGAATCCGC CATGATCAAC AACGTCAACG GCTCGCTCAA GGACCTGCCC
ATCGAAATGA TGGAAACCCG GTACCCGCTG CGGATCAACG AGTACTCCAT CCGGCCGAAC
TCCGGTGGCC CAGGGCAGTG GCGCGGCGGC AACGGAGTTA TCCGTGAATA CGAGTTCCTG
GCCGACTGCG TGGTAGGCCT CTGGTTCGAA AGGTCCAAGA CGCCGGCCTG GGGCCTCTTC
GGCGGTTCCG ACGCCCAGGG CCCGGAAGTG GTGATCAACC CCGGCCGGCA CGACGAGGTC
CGGACGCTGA AGGCCAACGC ACGGAAGGTC AAGGCCGGCG ACGTCGTCCG CCTGGCAGTC
GGGGGCGGTG GCGGTTTCGG AGATGTCTCC AAACGTACCC GTGAAGACAT CAAGTACGAC
ATCGTCAACG GTTTCATCAC CGAGGACTTC GCCAAGACCC ACTACGGCTA CTAA
 
Protein sequence
MTIAAVKTLD PVTVEIIRNA LTSAADDMNA TLIRSAYSPI LYEGGDCVVA LLDKEHRVLG 
QSAGLPLFLG NLETCSIAVE ELYGREVWQE GDVWILNDSY LGGTHLNDVT IFAPIFDDGS
VVGFAATRAH WMDMGSKDVG GSMDATDIFQ EGFRMGPVKL MEAGIETSVV DLIRTNVRFP
YQTIGDMHAM IAALRMGTTR MKELVGRYGM EQLDAARDEI FRQTEEIERE TVRNIPDGVY
EAEGVLDNDG INLDTPIPIR LKITVAGDTV DFDVTGSADQ TMGPVNCGAA QAVSALRVGY
KLLVSPDSNS NGGSFRPLTT QVRSGSVLGA VAPAPCQWYF SHLGLLIDLV SKAMAPAMPE
RVASASHGDS MIITAAGFDP RFGRNFVSME ATLGGWGAWQ GTDGESAMIN NVNGSLKDLP
IEMMETRYPL RINEYSIRPN SGGPGQWRGG NGVIREYEFL ADCVVGLWFE RSKTPAWGLF
GGSDAQGPEV VINPGRHDEV RTLKANARKV KAGDVVRLAV GGGGGFGDVS KRTREDIKYD
IVNGFITEDF AKTHYGY