Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0759 |
Symbol | |
ID | 4446733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 820479 |
End bp | 821489 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639688565 |
Product | peptidyl-arginine deiminase |
Protein accession | YP_830257 |
Protein GI | 116669324 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.451964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCCG AGACGGCTCC GCAGGAGCGC ATCTGGATGG CGTTCCCTAC CGGCGGCTAC ACCCTCGGCG ACACCGCCGA AGAGGCACAC GCCGCGCGGA CAGTCTGGGC GGCCGTCGCC AATGCCGCCG TCGAATTCGA GCCGGTCACC ATGGTGGTCA CCCCCGACGA CGTCCTGACC GCCGCGCGCT ACCTGGATCC CGCCGTCGAG GTGCTCACCG CGGACTTGAA CGATGCGTGG ATGCGGGACA TCGGCCCCAC CTTCGTCCTC GACGGCGACG GGCGTCTCGG CGCCGTCGAC TGGGTGTTCA ACGGCTGGGG CGGGCAGGAA TGGGCCCGCT GGGACAAAGA CTCGCTGATC GGGGCGGAAG TCGCCGGCCG GTCCGGCGCC CGGCACATAG CCTCCGCGCT CGTCAATGAA GGCGGCGGCA TCCAGGTGGA CGGCGAGGGA ACCGTGCTGG TGACAGAGAC GGTGCAGCTG GACCCGGGAC GCAACCCCGG ACTGTCCAAG GCCGAGGTGG AAGCAGAGCT CGCCCGGACC ATCGGCGCCA CCCATGTCAT CTGGCTTCCG CGCGGCCTGA CCCGGGACTC AGAGCGGTTC GGCACCCGGG GCCACGTGGA CATCGTGGCC GCCATCCCGT CCCCCGGCAC ACTGCTGGTG CATTCCCAGC AGGACCCGGA ACATCCCGAT TTCGAGGTCA GCCGCGAAAT CATCAATTTC CTCTCGGCCA CGCGGGACGC AGCCGGCCGA GAGTGGAACA TCATCGAAGT CCCCGCTCCC GTGGCACTCA GTGACCCGGA GGGCTTCGTG GACTACAGCT ACATCAACCA CCTCGTGGTC AACGGCGGTG TGATTGCCTG CACCTTCGGC GACCCCAACG ACGAAAAGGC CCTCCGGATC CTCGCCGATG CCTACCCCGG CCGCCGCGTC GTGGGCATCG ACGCCCGCGA ACTGTTCGCC AGGGGCGGCG GCATCCACTG CATCACCCAG CAGCAACCCG CTGCCTCCTA G
|
Protein sequence | MPAETAPQER IWMAFPTGGY TLGDTAEEAH AARTVWAAVA NAAVEFEPVT MVVTPDDVLT AARYLDPAVE VLTADLNDAW MRDIGPTFVL DGDGRLGAVD WVFNGWGGQE WARWDKDSLI GAEVAGRSGA RHIASALVNE GGGIQVDGEG TVLVTETVQL DPGRNPGLSK AEVEAELART IGATHVIWLP RGLTRDSERF GTRGHVDIVA AIPSPGTLLV HSQQDPEHPD FEVSREIINF LSATRDAAGR EWNIIEVPAP VALSDPEGFV DYSYINHLVV NGGVIACTFG DPNDEKALRI LADAYPGRRV VGIDARELFA RGGGIHCITQ QQPAAS
|
| |