Gene Acid345_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1939 
Symbol 
ID4071415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2333070 
End bp2334167 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content60% 
IMG OID637983951 
Productpeptidyl-arginine deiminase 
Protein accessionYP_591014 
Protein GI94968966 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.860811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.826102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGA AGAATGTCGC GCCCACCCCC GGTTCTCCTG CCGCTCTCGG TTACCGCATG 
CCTGCCGAGT GGGAGTCCCA TTACGCGACG TGGATCGCCT GGCCGCACAA CCGCAGCGAC
TGGCCGGGGA AATTCGAACC CATCGAGTGG GTGATGGCCG AGATCGTCCG CCACCTCGCA
CACGACGAGC GCGTGAACAT CCTCGTCAAC CACGAAGCCG GCGAGGAGCG TGCCCGCAAG
TTCCTCACCC GCGCGCAGGT TATCGCTGAA GGCTCCGACG GCAGCATCCA ATTCCATCAC
TACCCCACCA ATCGCATCTG GACCCGCGAC TATGGCCCAA TCTTCGTGAA AGCCGGCAAG
AAACTCGCCG CCATGGATTG GCGTTTCAAC TCCTGGGCGA AATACAACGA CTGGCGGCGC
GATGACGCTA TCCCGCAGCA GATCCTCACC GAATTGAATG TAGAGAACTG GCAGCCGACG
ACCGAAGTCA ACGGCAAGGC CAAGCGCGTC GTCCTCGAAG GCGGCAGTAT CGACGTCAAC
GGCGCCGGCG CCATTCTCAC CACTGAAGAA TGCCTGCTGA GTGATGTTCA GGCGCGCAAT
CCGGATCTCT CGCGCAATGA ACTCGAACAG GTATTCGCCG ATTATCTCGG CTGCACCAAA
ACTCTATGGC TCAACAAGGG CATTGCCGGC GACGACACCC ACGGCCACGT GGACGACCTC
GCCCGCTTCG TGGATAAGCA CACCATCGTC GTTGCCTGCG AGAACCACCG CGACGACGAG
AACCATGCGC CGCTGCTGGA AAACTACAAG CGCCTGCAAC ACATGACCGA CCAGGATGGT
CGCCCGTTAA AGATTGCCAA ACTGCCAATG CCGGATCCGG TTTGGTTCGA AGGGCGACGT
CTCCCGGCCA GCTACGCAAA CTTCTACATC GCCAACAAAC GCGTGCTGGT ACCGGTGTTT
AATTCGGTCA ATGATCTTCC CGCGCTAACC ACGATTCAGC AGTTGTTCCC CTCACGCCGC
GTTGTTCCGA TCTATTGCGG CGATTTCATC TGGGGTCTCG GCGCCATCCA CTGCATGACG
CAGCAGCAGC CGGCATAG
 
Protein sequence
MKWKNVAPTP GSPAALGYRM PAEWESHYAT WIAWPHNRSD WPGKFEPIEW VMAEIVRHLA 
HDERVNILVN HEAGEERARK FLTRAQVIAE GSDGSIQFHH YPTNRIWTRD YGPIFVKAGK
KLAAMDWRFN SWAKYNDWRR DDAIPQQILT ELNVENWQPT TEVNGKAKRV VLEGGSIDVN
GAGAILTTEE CLLSDVQARN PDLSRNELEQ VFADYLGCTK TLWLNKGIAG DDTHGHVDDL
ARFVDKHTIV VACENHRDDE NHAPLLENYK RLQHMTDQDG RPLKIAKLPM PDPVWFEGRR
LPASYANFYI ANKRVLVPVF NSVNDLPALT TIQQLFPSRR VVPIYCGDFI WGLGAIHCMT
QQQPA