Gene CPF_2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2344 
Symbol 
ID4201956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2607576 
End bp2609099 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content25% 
IMG OID638083209 
ProductAraC family DNA-binding response regulator 
Protein accessionYP_696767 
Protein GI110800482 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.627636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAGG TTATGCTAGC TGATGATGAA AATTTAATTT TACAAGGACT TGAGAATATA 
ATTGAATGGG AAGAACTAGG GTTAGAAATT GTAAATAAGG CAAGTAATGG TCAAGAAGCC
ATAGATAAAT TTAAGGAAAA TCCAGTTGAT ATAGTGGTAA CTGATATTAA TATGCCACAG
GTTACTGGAT TGGAGTTATT AAAGGAATTA AAGAAAATTA ATTCTGATGT TAAGTTTATA
ATATTAAGTG GATATGATGA TTTTTCTTAT GCTAAGAAAG CAATAGAATT AGGTGTTGAA
AACTATATAT TAAAGCCTAT AGATGAAGAA GAATTAGAAA AAACTTTAAA AAATACAATA
AATAAAATAA AACAAGAAAA AGAAGAAAAT AAATCAAGTT TAGGAAAACA TAATATTCTT
ATTAAACTTA TAAAGGGTAA ATTGGCTCAA GGTGAAATAG AGGAAAACAA AGAAGGATTT
TATATGAATT TAAATTCAGA AAGATATTCT CTATGTATTA TAAACACTAG AAGTAGATAT
GATAGTGAGG AAATGTTACA TAATATAGTT AATGTAATAA AGGAAAACAC TCAAAATAAC
TTTGAGATAA TATATACTTT AGATGAAGAA CTTATTTTAA TAAATTCTTG GGATGAAGCC
TTAAGTAAAA AAGAGATTAA AAAATATTAT GATAAGTTAA AAGAACAAAT AATAAATGAA
TATGGAATAG ATGTATTTTT AAGTGTTGGA GAACCTGTTT GTGATCTTTA TAAAATTAGT
TCAAGTTATA AGGAAGCAAA TAATTTAAAA AAATATGTTC TTACCTTAGG ATATAATAAG
TGTATAACAA CAGAGGATGT TAAGGATATA AATGAGAAGA ATATAAACTT TAGTGAGGTT
TTAGATAAAT TAAATAAAAG AATAATTGCT AAAGATATAG AAGGGGCAGA AAAAATCATT
GAGGAAACTG TTGAGGATAA AAAGTTAAAT CCAAGAAATA TATATGATTT ATCTGTAAAA
ATACTATTTT TATTAGATGG TATTGTGGAA GAATTTAAAG TTGAAAAACA ATATACAGGA
AATAGCTTAG GAGAGGAAAT AGTTGCTCTT TGTAGTGAAG ATACAAGGGA AGATATTAAA
ACCTTATTAT GTAGTGAAAT TAGGGAAGTT ATAGAACTTA TGCACCCAAC AACTATAAAA
TATAGCCCTG TAATTCAACA AATTATAAGT TATGTAAATG AAAACTATTA TGAAGAGGTA
AGTCTTAAGA CTTTAGCTCA AAAATATCAT ATAAATACTT CTTATTTAGG TCAAGTATTT
ACTAAAGAAG TTGGATGTTC ATTTTCTGAG TATTTAAATA AAACTAAAAA TATGAAAGCT
AAAGAGCTTA TATTAAATAC AAATATGAAG ATAAATGATA TAGCTAAAAA GGTAGGGTAT
TTAGATACTA GTTATTTCTA TAGAAAGTTT AAAAAGTACT ATGGAGTTTG TCCATCAACC
TTAAGAAATA TAAAAAATTA CTAA
 
Protein sequence
MYKVMLADDE NLILQGLENI IEWEELGLEI VNKASNGQEA IDKFKENPVD IVVTDINMPQ 
VTGLELLKEL KKINSDVKFI ILSGYDDFSY AKKAIELGVE NYILKPIDEE ELEKTLKNTI
NKIKQEKEEN KSSLGKHNIL IKLIKGKLAQ GEIEENKEGF YMNLNSERYS LCIINTRSRY
DSEEMLHNIV NVIKENTQNN FEIIYTLDEE LILINSWDEA LSKKEIKKYY DKLKEQIINE
YGIDVFLSVG EPVCDLYKIS SSYKEANNLK KYVLTLGYNK CITTEDVKDI NEKNINFSEV
LDKLNKRIIA KDIEGAEKII EETVEDKKLN PRNIYDLSVK ILFLLDGIVE EFKVEKQYTG
NSLGEEIVAL CSEDTREDIK TLLCSEIREV IELMHPTTIK YSPVIQQIIS YVNENYYEEV
SLKTLAQKYH INTSYLGQVF TKEVGCSFSE YLNKTKNMKA KELILNTNMK INDIAKKVGY
LDTSYFYRKF KKYYGVCPST LRNIKNY