Gene Arth_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3597 
Symbol 
ID4443908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4036949 
End bp4038286 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content68% 
IMG OID639691421 
Productamidohydrolase 
Protein accessionYP_833072 
Protein GI116672139 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCCTGC ATGATCACAC CCCTGCCGCT GTGCCCCCCG CTGCGGTTCC GCGCCGCGGC 
ATCCTCGCCG GGGCGGCCGC CCTCGCCGGA ATCTCAGTGG CAAGCCTTGC CGCTCAGCTG
GGAGGTGCCC CGGCCGCGAA AGCGGAGGGC AACGCATCCG GTCCCGGCTA CCCCGGCCGG
CCCGCTTCAC CGCAGCCCAT GATCATCGAG GGCGGCACCA TAGTGGACCC CATGACAGGC
GACGCCGTGC CAGACGGGGT CCTTGTCCTG GAAGGCGGTA AGGTCACCGC CGTCGGCAGC
CGGGAGGAAA CACGGCGTGC CGTGGCCGCA CTTGCCGGGC GCGCCCGCAT CGTGAATGCC
GCCGGCCGCT TTGTCCTCCC CGGCCTCATC GACGTGCACG TCCATGCCAA CGCACTGGCC
GATGCACGGG CCATCCTGCA GGGCGGCGCG ACCAGCGTCA GGAGCGGGTC GAGCAGCTTC
TACCAGGACG TCGCCCTTGC TGCCCTGCCC GCCTGGGCCG CCGGCGCCTC GCCCCGGATG
AGCCCGGCCG GGCTGTTCAT TTCACCCGAA CTCGGGGACT CGCTCCTTGC AGACCCGGAC
CTTGCACCAC TCGCCTCCTT GTCCGGGGGA GTCGCCGAAC CATCGGACCT TGCCTACCTC
ACCCGCACCA ACCTGAAACG CGGCGCCCAG GTGATCAAAA CCCGGGCCAA CCCCCGGGCG
GGGCTGCCGG AACAGGACCC CCGCGAACTC GTCTACACCT ACGATCAGCT CTCTGCCGTG
GTCACGGCCG CAGGAAAAGC AGGAGTGCTG TGCCACGCCT ACAGCGCGGA AGGGATCGAC
GGCGCAGTCC GGGCCGGTGT TCGGAGCATC GAGCACGGCG TCTTCGTCAC CGAGGAGACC
ATTTCCCGGA TGGCCCGCCG AGGGACCTAC TTCACGCCAA CCATGGATGC CATCACAGGG
ATGGCGGGCT CGCCCAATCC CATCCTCGCC GCCCGCGGCA AGGAATACAC GCCCATCATC
AAAGCCGCCG TCAAAGCTGC CCATGAAGCC GGTGTGACCA TCGTTGCCGG GACGGACTCT
TTCGGATCCG ACGTAACCCC GATTGGCACG GAAGTCCGTC TTCTTGCCGA GGCAGGGCTG
TCTCCGCTGG AAGCGCTCAG GGCAGCGACG GTGAACGCCG CCGCCTTGCT CGGCTGGGGT
GAGAGCGCGG GCCGACTGAT GCGCGGTTCC TTCGCCGACG CCGTCATTGT GGACTCCGAT
CCGTTGAGCA GCGCTTCGGC CCTGGAAGAA ATCCGGGCTG TGGTGGCACA GGGGGTCCTT
GTCCGGAACG ACCTCTGA
 
Protein sequence
MCLHDHTPAA VPPAAVPRRG ILAGAAALAG ISVASLAAQL GGAPAAKAEG NASGPGYPGR 
PASPQPMIIE GGTIVDPMTG DAVPDGVLVL EGGKVTAVGS REETRRAVAA LAGRARIVNA
AGRFVLPGLI DVHVHANALA DARAILQGGA TSVRSGSSSF YQDVALAALP AWAAGASPRM
SPAGLFISPE LGDSLLADPD LAPLASLSGG VAEPSDLAYL TRTNLKRGAQ VIKTRANPRA
GLPEQDPREL VYTYDQLSAV VTAAGKAGVL CHAYSAEGID GAVRAGVRSI EHGVFVTEET
ISRMARRGTY FTPTMDAITG MAGSPNPILA ARGKEYTPII KAAVKAAHEA GVTIVAGTDS
FGSDVTPIGT EVRLLAEAGL SPLEALRAAT VNAAALLGWG ESAGRLMRGS FADAVIVDSD
PLSSASALEE IRAVVAQGVL VRNDL