Gene CPF_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0161 
SymbolarcA 
ID4202802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp191622 
End bp192863 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content31% 
IMG OID638081042 
Productarginine deiminase 
Protein accessionYP_694625 
Protein GI110800897 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATG ACAGAGCATT AAATGTTACT TCTGAAATAG GAAGATTAAA AACAGTTCTA 
TTACATAGAC CTGGAGAAGA AATTGAAAAC TTAACACCAG ATCTATTAGA TAGACTACTA
TTTGATGACA TTCCATACTT AAAAGTTGCA AGAGAAGAAC ATGACGCTTT TGCACAAACT
TTAAGAGAAG CAGGAGTAGA AGTGCTTTAT TTAGAAGTTC TTGCTGCTGA GGCAATAGAA
ACTAGTGATG AGGTTAAACA ACAATTTATA AGTGAATTTA TTGATGAGGC TGGAGTTGAA
AGCGAAAGAT TAAAAGAAGC ATTAATAGAA TACTTCAACT CATTTAGTGA TAATAAAGCA
ATGGTTGATA AGATGATGGC TGGGGTAAGA AAGGAAGAGC TTAAAGATTA CCACAGAGAA
TCATTATATG ACCAAGTAAA TAATGTATAT CCATTTGTAT GTGATCCAAT GCCAAATCTT
TATTTTACAA GAGATCCATT TGCAACAATT GGACATGGTA TTACATTAAA CCACATGAGA
ACAGATACAA GAAATAGAGA AACAATATTT GCTAAATACA TATTTAGACA TCATCCAAGA
TTTGAAGGAA AGGATATTCC ATTCTGGTTT AATAGAAATG ATAAAACTTC TCTTGAAGGT
GGAGATGAAT TAATACTTTC AAAAGAAATT TTAGCAGTTG GTATATCACA AAGAACTGAT
TCAGCATCAG TTGAAAAATT AGCGAAAAAG TTACTTTACT ATCCAGATAC AAGTTTTAAA
ACTGTATTAG CATTTAAAAT ACCAGTATCA AGAGCATTTA TGCATTTAGA TACAGTATTT
ACTCAAGTAG ATTATGATAA ATTTACAGTT CACCCTGGTA TAGTAGGACC TTTAGAAGTT
TATGCATTAA CTAAAGATCC AGAAAATGAT GGACAACTAC TTGTAACAGA AGAAGTTGAT
ACTTTAGAAA ATATATTAAA GAAATATCTA GATAGAGATA TTAAATTAAT AAAATGTGGT
GGCGGAGATG AAATAATAGC TGCTAGAGAA CAATGGAATG ATGGTTCAAA TACACTTGCT
ATTGCTCCTG GAGAAGTTGT AGTTTACTCA AGAAACTATG TAACTAATGA AATATTAGAA
AAAGAAGGAA TCAAATTACA CGTTATACCT TCATCTGAAT TATCAAGAGG TAGAGGGGGC
CCTAGATGTA TGTCAATGCC TCTAATAAGA GAAGATTTAT AA
 
Protein sequence
MRDDRALNVT SEIGRLKTVL LHRPGEEIEN LTPDLLDRLL FDDIPYLKVA REEHDAFAQT 
LREAGVEVLY LEVLAAEAIE TSDEVKQQFI SEFIDEAGVE SERLKEALIE YFNSFSDNKA
MVDKMMAGVR KEELKDYHRE SLYDQVNNVY PFVCDPMPNL YFTRDPFATI GHGITLNHMR
TDTRNRETIF AKYIFRHHPR FEGKDIPFWF NRNDKTSLEG GDELILSKEI LAVGISQRTD
SASVEKLAKK LLYYPDTSFK TVLAFKIPVS RAFMHLDTVF TQVDYDKFTV HPGIVGPLEV
YALTKDPEND GQLLVTEEVD TLENILKKYL DRDIKLIKCG GGDEIIAARE QWNDGSNTLA
IAPGEVVVYS RNYVTNEILE KEGIKLHVIP SSELSRGRGG PRCMSMPLIR EDL