Gene CPF_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0001 
SymboldnaA 
ID4202657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp411 
End bp1784 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content30% 
IMG OID638080873 
Productchromosomal replication initiation protein 
Protein accessionYP_694476 
Protein GI110799791 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000458864 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCCC AACTAAATAA TCTCTGGGAA CAAGCATTGA ACATAATAAA AGGAGAAATT 
TCTGAAATAA GCTTTAACAC TTGGATTAAA AGCTGTACTC CTATATCAAT TTCAGATAAT
ATATTAAAAC TTTCTGTTCC TAATGAATTC ACTAAAGGGA TTTTAGATAC AAGATATAAA
GACTTATTAA TTCAAGCTTT AAAGATAGTT ACATCTAGAA AGTTTAAAAT AGAGTTCTAT
CTTGAGTCTG ATTTAGAAGA AGAAAAAGAA AATGAGGAAA AGCAGAAAGA AGAGAAGAAA
GAGAATACAA ATGATGTCGA TGGATCTATA GTTGTAAGTG ATGAAATGTC AGCTACATTA
AATCCTAAAT ATACTTTCCA ATCCTTTGTA ATAGGTAATA GTAACCGCTT TGCTCATGCA
GCATCTTTAG CTGTTGCAGA ATCACCAGCA AAAGCTTATA ACCCATTATT TATCTATGGA
GGAGTAGGAC TTGGAAAAAC TCACTTAATG CATGCAATAG GACATTATAT TCTTCAAGAA
AATCCTAAAG CAAAAGTAGT TTATGTATCA TCTGAAAAAT TCACTAATGA ACTTATTAAT
GCTATTAAAG ATGATAAGAA TGAAGAGTTT AGAAACAAAT ATAGAAAAGT CGATGTTTTA
TTAATAGATG ATATTCAATT CATAGCAGGT AAAGAACGTA CTCAAGAAGA GTTCTTCCAT
ACCTTTAATG CTCTACATGA AGAGAATAAA CAAATTATAC TATCATCAGA TAGACCGCCT
AAGGAAATTC CTACATTAGA AGACAGGTTA AGATCTAGAT TTGAATGGGG TTTAATAGCA
GATATTCAAC CACCTGATTT CGAAACTAGA ATGGCAATAC TAAAGAAAAA AGCTGATGTT
GAAGGCTTAA ATGTACCAAA TGAAGTTATG GTATATATAG CTACTAAAAT CAAATCAAAT
ATAAGAGAAT TAGAAGGCGC ACTTATAAGA ATAATAGCTT ATTCCTCTCT AACTAATAGG
GATGTAAGCG TTGATTTAGC TTCAGAAGCA CTTAAAGATA TAATATCTAA TAAAGAAAGT
GCTCCAGTTA CAGTAAAAAC TATTCAAGAA TCCGTAGCAA ATTATTATAA TTTAAGAATA
GAGGATTTAA AATCTCAAAG AAGAACTAGA AATATAGCTT ATCCTCGTCA AATTGCTATG
TATTTAAGTA GAAAACTTAC AGATATGTCT TTACCTAAAA TAGGTGAAGA ATTTGGTGGA
AGAGATCATA CTACAGTAAT TCATGCTTAT GAAAAAATTT CTGAAAATTT AAAGACTGAT
GAAGGTCTAC AAAGCATGAT TAATGACATT ACTAAAAAGC TTACTCAAAA GTAA
 
Protein sequence
MDAQLNNLWE QALNIIKGEI SEISFNTWIK SCTPISISDN ILKLSVPNEF TKGILDTRYK 
DLLIQALKIV TSRKFKIEFY LESDLEEEKE NEEKQKEEKK ENTNDVDGSI VVSDEMSATL
NPKYTFQSFV IGNSNRFAHA ASLAVAESPA KAYNPLFIYG GVGLGKTHLM HAIGHYILQE
NPKAKVVYVS SEKFTNELIN AIKDDKNEEF RNKYRKVDVL LIDDIQFIAG KERTQEEFFH
TFNALHEENK QIILSSDRPP KEIPTLEDRL RSRFEWGLIA DIQPPDFETR MAILKKKADV
EGLNVPNEVM VYIATKIKSN IRELEGALIR IIAYSSLTNR DVSVDLASEA LKDIISNKES
APVTVKTIQE SVANYYNLRI EDLKSQRRTR NIAYPRQIAM YLSRKLTDMS LPKIGEEFGG
RDHTTVIHAY EKISENLKTD EGLQSMINDI TKKLTQK