Gene CPF_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1768 
Symbol 
ID4202444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1993044 
End bp1994261 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content28% 
IMG OID638082640 
ProductDEAD-box ATP dependent DNA helicase 
Protein accessionYP_696204 
Protein GI110800080 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00592847 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAT TTTTAAAGTT AGGATTAAGT GAAGAGGTTT TAAAATCTTT AGTAGGATTA 
GGAATAGAGG AGCCAACAGA TATTCAGGAA AAAGCTATAC CTGAAATTTT AAAAGGTAAA
AATGTAATAG GAAAAGCTGA AACAGGAACA GGAAAAACTT TAGCATATTT ACTTCCTATA
ATAGAAAAGA TTGATGATTC AAAAAATGAG ATGCAAGCTA TTATTCTTTC ACCAACTCAT
GAATTAGGAG TTCAGATAAA CAATGTTTTA AATGATCTTA AAAGAGGACT TGGAAAAAAG
ATAACTTCAA CAACTTTAGT TGGAAGCGGA AATATAAAGA GACAAATGGA GAAGCTTAAA
AATAAGCCTC ATATACTTGT TGGAACTACA GGGAGAATTT TAGAGCTTAT AAATAAGAAA
AAAATAACAA CTAATACTAT AAAAACAATA GTTATTGACG AAGGTGATAA ACTATTAGAT
TTTATAAACA TAAAAGATGT GAAAAGTGTT GTTAAATCTT GTCCAAGGGA TACTCAAAAG
CTTATATTCT CAGCTACAAT GAATGAAAAA GCCTTAGAAA CTGCAGATGA ATTAATAGGA
ACTAGTGAGC TTATTCAAGC AAAAGCTGCA AACAAGGTTA ATGAAAATAT AGAACATGGA
TATTTTCAAG TAGAATTAAG AGATAAAATA GACTTTTTAA GAAAGCTTAT ACATGCTATA
GGGGATGAGA AAAAAATAAT AGTTTTTATA AATAATAGCT ATAATGTACA TAATGTAATT
CAAAAGTTAA AATATAATAA AATAGAGGCA GTTTCTCTTC ATGGAAGTGA TAATAAAATG
GAGAGAAAGA AGGCACTTCA AGATTTTAGA AGTGGAAAAG CCAAGGTTTT AATAACTTCA
GATGTATCAG CTAGAGGGCT AGATATTAAG GGAGCTACAC ATATAGTTAA CTTAGATATT
CCTATGAATT CTCAAAACTA CTTACATAGG GTAGGACGTG TAGGAAGAGC TGGAGAAAAA
GGCTTTGCAT ACTCTTTAGC AGATTATAAA GAAGAAAAAA TAATAGTAAA ATGTGAAAGA
CAATTAAAAA TAAAAATTCC TAGAGTTTAT TTATATGAAG GAAAGATACA TGAAACAGAG
GTAAAAAAGG TTCCTTCAAA TAAAAACAAT AGTAAAAAGA AAAGCAATTT ACCTAAAAAG
GTATATAAAA AAAGATAA
 
Protein sequence
MDKFLKLGLS EEVLKSLVGL GIEEPTDIQE KAIPEILKGK NVIGKAETGT GKTLAYLLPI 
IEKIDDSKNE MQAIILSPTH ELGVQINNVL NDLKRGLGKK ITSTTLVGSG NIKRQMEKLK
NKPHILVGTT GRILELINKK KITTNTIKTI VIDEGDKLLD FINIKDVKSV VKSCPRDTQK
LIFSATMNEK ALETADELIG TSELIQAKAA NKVNENIEHG YFQVELRDKI DFLRKLIHAI
GDEKKIIVFI NNSYNVHNVI QKLKYNKIEA VSLHGSDNKM ERKKALQDFR SGKAKVLITS
DVSARGLDIK GATHIVNLDI PMNSQNYLHR VGRVGRAGEK GFAYSLADYK EEKIIVKCER
QLKIKIPRVY LYEGKIHETE VKKVPSNKNN SKKKSNLPKK VYKKR