Gene CPF_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1991 
Symbol 
ID4201856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2229638 
End bp2231716 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content31% 
IMG OID638082860 
Productprotein kinase 
Protein accessionYP_696424 
Protein GI110798996 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2815] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGGTA AAATCTTAGG GAATAGGTAC GAGTTACTTC AGTGCGTAGG CGAAGGTGGC 
ATGTCCTTTG TTTATAAAGC TAGATGTAGA AAATTAAATA GATTCGTAGC TGTTAAAATA
CTAAAAGATG AATTTAAAAA CAATGAAGAA ATTGTTAGAA GATTTAAAAA AGAAGCTACA
GCTATTGCTA ACTTATCCAA TCCAAATGTA GTAAATGTAT TAGATGTTGG AACTCAAGAT
GATATTAATT ATATTGTTAT GGAATACGTT GAAGGTAAAA CTTTAAAAGA TATAATTAAG
GAAAAAGGTG CTTTACCTTA TGAGGTTGCC ATAAGTATAG GAATAAAGGT TGCTAAAGCT
TTAGAATGTG CCCATAAAAG CGGAATTATA CATAGAGATG TTAAACCACA AAATATATTA
GTTACAGAAG AAGGGGTTGT TAAAGTAACT GACTTCGGAA TAGCTAAATC TATGGATTCC
TCTACTATAG CGCACACTAA TAGTGTTATG GGATCTGCAC ATTATTTTTC ACCAGAGCAA
GCTAAAGGAA CTTATACTGA TTATAGAACT GACTTATATT CTTTAGGTAT AGTATTGTAT
GAAATGGTTA CTGGAGTTGT TCCTTTTAAT GGTGATTCAC CTGTTACTGT AGCTGTAAAA
CATATACAGG AGAAAGCAAT ACCTCCTAAG AATATAAATC AAAATATTCC AAATAGTTTA
AATGATTTAA TAATGAAAGC TATGGAAAAG GATCCTGTTA ATAGATATCA AACAGCTAAG
GAAATAATAG GTGACTTAGA AAAGATAAAG AAGGATCCAA ATGTTACAAT ATCATCAAAA
TCTGCAGAAG ATGAAGATCA ATTTACAAGA GTTATGTCTC CAGTTGTTGT TCCAAATACT
GAAACTAATA ACTCAGAACC TGATGAAGAT GATGAGGATG ATGATGAATA TTATGAAGAT
GATGAGGATG AAGATGAAGA AGAAAATAAT ATTCAAACTA AGCCTCAAAA AGCTATTAAT
AAAAAGAAAA AGAAATCACC AATATTAATT ATAATTGCAA CTATTTTAGT TGTAGCATTA
GGAATAACTC TTGGATTCCT TGGTATGAAG AAATTTATGG AAGGTGGAAA AGATGTTAAA
ATACCTAATG TTGTAGGAGA AAAGGTAGAA GATGCTAAGA GTAAGTTAGA AGGCTTAGGA
TTAAAGGTAT TAGAAGTTAC AGAAGAAAGT GATCAAGAAA AAGGAATAGT TTTAAAAGTT
GATCCAAATG TAGATTCTAC TGTTAAGACC GGTAGCGAAG TAAAACTTAC TGTTAGTGGT
GGAGAAGGAC AAATAAAAGT TCCAAATTTA GCAGATATGA GTTCAGATGA GGTTAGGAGA
ACATTAAAAA GTCTAGGATT AGAGCTTGTT GAGGATGAGA AATATAGTGA TAAAGTTCCA
AGTGGAAAAG TTATTTCTCA AAGTCCTAAT GCTAATGAGT TAGTAGATAA AGGTTCTAAA
GTAAAGGTTG TTTTTAGTAA AGGTAAAGAG ATAAAAAAAG TAAGTGTTCC AAGCTTAGTA
AATATGAATA TAGATTCTGT TAAAAATAAT TTAAATGATA TAGGTTTAAA ACTTGGTGAG
GTAAAATATG AATACAGTGA TAGTGTTCAA CAAGGTCAAG TTATTTCTCA AAGTCCTAAT
GCTAATGAAC CTGTAGATGA GGGTTCTAAA GTAAGCATTA CTATAAGTAA AGGTAAAGAA
ATAAAAACTA CAACTTTAAA TATTCCGGAT GTTTCAGGAA AAAGTGTAGA TGAAGCTAAA
TCAATATTAG CTAATGCTGG TGTTGAAGCT AATGCTGTTA AGGGTGAAGC AGCTAAAAGT
GAAGAAGAAG CAGGGAAGGT TTATAGTCAA AGTCAATCAG GATCTATTAC TCTTAAAGAA
GGAGAAAAGA TAACTATAAC AATAAATTAC TATGGTGATT ATATTAAACC AGTACAACCT
TCACAATCAG ATAAGACTAA TCAATCAGAA CAACAAACAC AATCACCAGA TCAATCAACG
GAGAAACCAA AAGACCCAGC GCATCCTGGT AATAATTAA
 
Protein sequence
MIGKILGNRY ELLQCVGEGG MSFVYKARCR KLNRFVAVKI LKDEFKNNEE IVRRFKKEAT 
AIANLSNPNV VNVLDVGTQD DINYIVMEYV EGKTLKDIIK EKGALPYEVA ISIGIKVAKA
LECAHKSGII HRDVKPQNIL VTEEGVVKVT DFGIAKSMDS STIAHTNSVM GSAHYFSPEQ
AKGTYTDYRT DLYSLGIVLY EMVTGVVPFN GDSPVTVAVK HIQEKAIPPK NINQNIPNSL
NDLIMKAMEK DPVNRYQTAK EIIGDLEKIK KDPNVTISSK SAEDEDQFTR VMSPVVVPNT
ETNNSEPDED DEDDDEYYED DEDEDEEENN IQTKPQKAIN KKKKKSPILI IIATILVVAL
GITLGFLGMK KFMEGGKDVK IPNVVGEKVE DAKSKLEGLG LKVLEVTEES DQEKGIVLKV
DPNVDSTVKT GSEVKLTVSG GEGQIKVPNL ADMSSDEVRR TLKSLGLELV EDEKYSDKVP
SGKVISQSPN ANELVDKGSK VKVVFSKGKE IKKVSVPSLV NMNIDSVKNN LNDIGLKLGE
VKYEYSDSVQ QGQVISQSPN ANEPVDEGSK VSITISKGKE IKTTTLNIPD VSGKSVDEAK
SILANAGVEA NAVKGEAAKS EEEAGKVYSQ SQSGSITLKE GEKITITINY YGDYIKPVQP
SQSDKTNQSE QQTQSPDQST EKPKDPAHPG NN