Gene CPR_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2056 
Symbol 
ID4205610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2276192 
End bp2277715 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content24% 
IMG OID642566606 
ProductAraC family DNA-binding response regulator 
Protein accessionYP_699365 
Protein GI110802788 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAGG TTATGCTAGC TGATGATGAG AATTTAATTT TACAAGGACT TGAGAATATA 
ATTGAATGGG AAGAACTAGG GTTAGAAATT GTAAATAAGG CAAGTAATGG TCAAGAAGCC
ATAGATAAAT TTAAGGAAAA TCCAGTTGAT ATAGTGTTAA CTGATATTAA TATGCCACAG
GTTACTGGGT TAGAGTTATT AAAGGAATTA AAGAAAATTA ATTCTGATGT TAAGTTTATA
ATATTAAGTG GATATGATGA TTTTTCTTAT GCTAAGAAAG CAATAGAATT AGGTGTTGAA
AATTATATAT TAAAGCCTAT AGATGAAGAA GAATTAGAAA AAACTTTAAA AAATACAATA
AATAAAATAA AACAAGAAAA AGAAGAAAAT AAATCAAGTT TAGGAAAACA TAATATTCTT
ATTAAACTTA TAAAGGGTAA ATTGGATCAA GGTGAAATAG AGGAAAACAA AGAATGCTTT
TATATGAATT TAAATTCAGA AAGATATTCT CTATGTATCA TAAATACTAG AAGTAGATAT
GATAGTGAGG AAATGTTACA TAACATAGTT AATGTAATTA AGGAAAATAC TCAAAATAAC
TTTGAGATAA TATACACCTT AGATGAAGAA CTTATTTTAA TAAATTCTTG GGATGAAGCC
TTAAGTAAAA AAGAGATTAA AAAATATTAT GATAAGTTAA AAGAACAAAT AATAAATGAA
TATGGAATAG ATGTATTTTT AAGTGTTGGA GAACCTATTT GTGATCTTTA TAAAATTAGT
TCAAGTTATA AGGAAGCAAA TAATTTAAAA AAATATGTTC TTACTTTAGG ATATAATAAG
TGTATAACAA CAGAGGATGT TGAAGATATA AATGAGAAGA ATATAAACTT TAGTGAGGTT
TTAGATAAAT TAAATAAAAG AATAATTGCT AAAGATATAG AAGGGGCAGA AAAAATCATT
GAGGAAACTG TTGAGGATAA AAAGTTAAAT CCAAGAAATA TATATGATTT ATCTGTGAAA
ATACTATTTT TATTAGATGG TATTGTGGAA GAATTTAAAG TTGAAAAACA ATATACAGGA
AATAGCTTAG GAGAGGAAAT AGTTGCTCTT TGTAGTGAAG ATACAAGGGA AGATATTAAA
ACTTTATTAT GTAGTGAAAT TAGGGAAGTT ATAGAACTTA TGCACCCAAC AACCATAAAA
TATAGCCCTG TAATTCAACA AATAATAAGT TATGTAAATG AAAACTATTA TGAAGAGGTA
AGTCTTAAGA CTTTAGCTAG AAAATATAAT ATAAATACTT CTTATTTAGG ACAAGTATTT
ACTAAAGAAG TTGGATGTTC ATTTTCTGAG TATTTAAATA AAACTAAAAA TATGAAAGCT
AAAGATCTTA TATTAAATAC AAATATGAAG ATAAATGATA TAGCTAAAAA AGTAGGATAT
TTAGACACTA GTTATTTCTA TAGAAAGTTT AAAAAGTACT ATGGAGTTTG TCCATCAACT
TTAAGAAATA TAAAAAATTA CTAA
 
Protein sequence
MYKVMLADDE NLILQGLENI IEWEELGLEI VNKASNGQEA IDKFKENPVD IVLTDINMPQ 
VTGLELLKEL KKINSDVKFI ILSGYDDFSY AKKAIELGVE NYILKPIDEE ELEKTLKNTI
NKIKQEKEEN KSSLGKHNIL IKLIKGKLDQ GEIEENKECF YMNLNSERYS LCIINTRSRY
DSEEMLHNIV NVIKENTQNN FEIIYTLDEE LILINSWDEA LSKKEIKKYY DKLKEQIINE
YGIDVFLSVG EPICDLYKIS SSYKEANNLK KYVLTLGYNK CITTEDVEDI NEKNINFSEV
LDKLNKRIIA KDIEGAEKII EETVEDKKLN PRNIYDLSVK ILFLLDGIVE EFKVEKQYTG
NSLGEEIVAL CSEDTREDIK TLLCSEIREV IELMHPTTIK YSPVIQQIIS YVNENYYEEV
SLKTLARKYN INTSYLGQVF TKEVGCSFSE YLNKTKNMKA KDLILNTNMK INDIAKKVGY
LDTSYFYRKF KKYYGVCPST LRNIKNY