Gene Arth_0759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0759 
Symbol 
ID4446733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp820479 
End bp821489 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content69% 
IMG OID639688565 
Productpeptidyl-arginine deiminase 
Protein accessionYP_830257 
Protein GI116669324 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.451964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCG AGACGGCTCC GCAGGAGCGC ATCTGGATGG CGTTCCCTAC CGGCGGCTAC 
ACCCTCGGCG ACACCGCCGA AGAGGCACAC GCCGCGCGGA CAGTCTGGGC GGCCGTCGCC
AATGCCGCCG TCGAATTCGA GCCGGTCACC ATGGTGGTCA CCCCCGACGA CGTCCTGACC
GCCGCGCGCT ACCTGGATCC CGCCGTCGAG GTGCTCACCG CGGACTTGAA CGATGCGTGG
ATGCGGGACA TCGGCCCCAC CTTCGTCCTC GACGGCGACG GGCGTCTCGG CGCCGTCGAC
TGGGTGTTCA ACGGCTGGGG CGGGCAGGAA TGGGCCCGCT GGGACAAAGA CTCGCTGATC
GGGGCGGAAG TCGCCGGCCG GTCCGGCGCC CGGCACATAG CCTCCGCGCT CGTCAATGAA
GGCGGCGGCA TCCAGGTGGA CGGCGAGGGA ACCGTGCTGG TGACAGAGAC GGTGCAGCTG
GACCCGGGAC GCAACCCCGG ACTGTCCAAG GCCGAGGTGG AAGCAGAGCT CGCCCGGACC
ATCGGCGCCA CCCATGTCAT CTGGCTTCCG CGCGGCCTGA CCCGGGACTC AGAGCGGTTC
GGCACCCGGG GCCACGTGGA CATCGTGGCC GCCATCCCGT CCCCCGGCAC ACTGCTGGTG
CATTCCCAGC AGGACCCGGA ACATCCCGAT TTCGAGGTCA GCCGCGAAAT CATCAATTTC
CTCTCGGCCA CGCGGGACGC AGCCGGCCGA GAGTGGAACA TCATCGAAGT CCCCGCTCCC
GTGGCACTCA GTGACCCGGA GGGCTTCGTG GACTACAGCT ACATCAACCA CCTCGTGGTC
AACGGCGGTG TGATTGCCTG CACCTTCGGC GACCCCAACG ACGAAAAGGC CCTCCGGATC
CTCGCCGATG CCTACCCCGG CCGCCGCGTC GTGGGCATCG ACGCCCGCGA ACTGTTCGCC
AGGGGCGGCG GCATCCACTG CATCACCCAG CAGCAACCCG CTGCCTCCTA G
 
Protein sequence
MPAETAPQER IWMAFPTGGY TLGDTAEEAH AARTVWAAVA NAAVEFEPVT MVVTPDDVLT 
AARYLDPAVE VLTADLNDAW MRDIGPTFVL DGDGRLGAVD WVFNGWGGQE WARWDKDSLI
GAEVAGRSGA RHIASALVNE GGGIQVDGEG TVLVTETVQL DPGRNPGLSK AEVEAELART
IGATHVIWLP RGLTRDSERF GTRGHVDIVA AIPSPGTLLV HSQQDPEHPD FEVSREIINF
LSATRDAAGR EWNIIEVPAP VALSDPEGFV DYSYINHLVV NGGVIACTFG DPNDEKALRI
LADAYPGRRV VGIDARELFA RGGGIHCITQ QQPAAS