Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1939 |
Symbol | |
ID | 4071415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2333070 |
End bp | 2334167 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983951 |
Product | peptidyl-arginine deiminase |
Protein accession | YP_591014 |
Protein GI | 94968966 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.860811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.826102 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGGA AGAATGTCGC GCCCACCCCC GGTTCTCCTG CCGCTCTCGG TTACCGCATG CCTGCCGAGT GGGAGTCCCA TTACGCGACG TGGATCGCCT GGCCGCACAA CCGCAGCGAC TGGCCGGGGA AATTCGAACC CATCGAGTGG GTGATGGCCG AGATCGTCCG CCACCTCGCA CACGACGAGC GCGTGAACAT CCTCGTCAAC CACGAAGCCG GCGAGGAGCG TGCCCGCAAG TTCCTCACCC GCGCGCAGGT TATCGCTGAA GGCTCCGACG GCAGCATCCA ATTCCATCAC TACCCCACCA ATCGCATCTG GACCCGCGAC TATGGCCCAA TCTTCGTGAA AGCCGGCAAG AAACTCGCCG CCATGGATTG GCGTTTCAAC TCCTGGGCGA AATACAACGA CTGGCGGCGC GATGACGCTA TCCCGCAGCA GATCCTCACC GAATTGAATG TAGAGAACTG GCAGCCGACG ACCGAAGTCA ACGGCAAGGC CAAGCGCGTC GTCCTCGAAG GCGGCAGTAT CGACGTCAAC GGCGCCGGCG CCATTCTCAC CACTGAAGAA TGCCTGCTGA GTGATGTTCA GGCGCGCAAT CCGGATCTCT CGCGCAATGA ACTCGAACAG GTATTCGCCG ATTATCTCGG CTGCACCAAA ACTCTATGGC TCAACAAGGG CATTGCCGGC GACGACACCC ACGGCCACGT GGACGACCTC GCCCGCTTCG TGGATAAGCA CACCATCGTC GTTGCCTGCG AGAACCACCG CGACGACGAG AACCATGCGC CGCTGCTGGA AAACTACAAG CGCCTGCAAC ACATGACCGA CCAGGATGGT CGCCCGTTAA AGATTGCCAA ACTGCCAATG CCGGATCCGG TTTGGTTCGA AGGGCGACGT CTCCCGGCCA GCTACGCAAA CTTCTACATC GCCAACAAAC GCGTGCTGGT ACCGGTGTTT AATTCGGTCA ATGATCTTCC CGCGCTAACC ACGATTCAGC AGTTGTTCCC CTCACGCCGC GTTGTTCCGA TCTATTGCGG CGATTTCATC TGGGGTCTCG GCGCCATCCA CTGCATGACG CAGCAGCAGC CGGCATAG
|
Protein sequence | MKWKNVAPTP GSPAALGYRM PAEWESHYAT WIAWPHNRSD WPGKFEPIEW VMAEIVRHLA HDERVNILVN HEAGEERARK FLTRAQVIAE GSDGSIQFHH YPTNRIWTRD YGPIFVKAGK KLAAMDWRFN SWAKYNDWRR DDAIPQQILT ELNVENWQPT TEVNGKAKRV VLEGGSIDVN GAGAILTTEE CLLSDVQARN PDLSRNELEQ VFADYLGCTK TLWLNKGIAG DDTHGHVDDL ARFVDKHTIV VACENHRDDE NHAPLLENYK RLQHMTDQDG RPLKIAKLPM PDPVWFEGRR LPASYANFYI ANKRVLVPVF NSVNDLPALT TIQQLFPSRR VVPIYCGDFI WGLGAIHCMT QQQPA
|
| |