Gene CPF_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2212 
Symbol 
ID4202735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2457892 
End bp2459298 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content27% 
IMG OID638083077 
ProductGntR family transcriptional regulator 
Protein accessionYP_696636 
Protein GI110798608 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT TTAGTGTGGT TTTTAAAGAG GGTTTATGTA AATACTTAAT AATATACGAT 
AATATTAAAA GTTTAATTGA AAATGGGAAA ATATCAGAAG GAGAAAAGTT ACCAACTATA
AGAGAACTAG CAGACTTTTT AGAAGTTAAT AAGGTAACTG TTATTAATGC TTATAAAAAG
CTTGTACAAG AAGGATATGC ATATCAAGCT CAAGGAAGTG GAACTTATGC TAAAAATAAG
GATGTAGGAA AAAGTTTTAA GCATGATTAT AATGATTTAT TTAGAAAAAT AGCCTCTGGA
GATTTGAAGA ATTGGATAGA CTTTACTGGA GAAACTACTA GTGCAAACTT CTTCCAAGTA
GAAAAACTTA AGAAAGTTTT AGATAATGTA TTAGTTAGAG ATGGTGTAGA GGCATTAATG
TATAGTGACC CTTTAGGATA TTTAGAATTA AGAAGAAGTA TAAATGAAGA GTTCTGGAAT
GGAAAAAATA ATTTAGATAA TATACTTATA ATTTCTGGAG CACAACAAGG TATTGATATT
GTAGGTAAGT CCTTAGTAAA TATAAATGAT AATATTGTAA TAGAAAGGCC AACTTATGGA
GGAGCACTTT TAGTATTTAA GCTTAGAAGA GCAAATATAT TAGAAATTCC AATGGAGAAG
GATGGTCCTA ATATTGAAAA GTTTGAAAAT TTATTAAAGA GAAATAAAAT AAAGTGTTTT
TATACTATGA GCTATTTTCA GAACCCTACA GGTATAAGTT GTTCTTTAGA AAAAAAGAAA
AGAATAATAG AGCTAGCACA TAAATATGAT TTTTACATAC TAGAAGATGA TTATCTTTCA
GAATTAGTTT ATTCTAATGA TTTAGAGTAT ATACCATATA GGAGTTTAGA TTCTGAGAGA
GTTATATATA TAAAAAGTTT CTCTAAAATA TTTCTACCAG GTATAAGAAT GGGATATTTA
ATAGCACCAG ATAAATTTAA AGAGGAATTT CAAGCTTTAA AATTTAACAC TGATATAGCT
ACCTCAAGTT TAATGCAAAG AGCTCTTCAG GATTACATTG TAAAGGGATA TTGGAAAGAG
CATATAGAAA GATTAAATGA GGAATATTCA AAGAGATATA ATTTTATAAA AGAACTTATA
GATAATAAAT TAGGAGATAT GGTTTCTTAT AGAGAGCCTA AGGGCGGACT TAATTTATTC
TTAAATATAA ATAAGAACAT AGGCATAACT TCTAAAAAGT TATTTTATGA ACTGAAAGAT
AGACAAACTA TAATTACTCC AGGAAGCATA TTTTTTAAAA ATCCTAATGA TGGAGATAAA
AGCTTTAGAA TTGGATTTTC CCAAATAGAT TATAGTAAGA TAGAAAAAGG AATAGATAAT
ATATACGATG TATTAAAAGG TAGGTAG
 
Protein sequence
MNKFSVVFKE GLCKYLIIYD NIKSLIENGK ISEGEKLPTI RELADFLEVN KVTVINAYKK 
LVQEGYAYQA QGSGTYAKNK DVGKSFKHDY NDLFRKIASG DLKNWIDFTG ETTSANFFQV
EKLKKVLDNV LVRDGVEALM YSDPLGYLEL RRSINEEFWN GKNNLDNILI ISGAQQGIDI
VGKSLVNIND NIVIERPTYG GALLVFKLRR ANILEIPMEK DGPNIEKFEN LLKRNKIKCF
YTMSYFQNPT GISCSLEKKK RIIELAHKYD FYILEDDYLS ELVYSNDLEY IPYRSLDSER
VIYIKSFSKI FLPGIRMGYL IAPDKFKEEF QALKFNTDIA TSSLMQRALQ DYIVKGYWKE
HIERLNEEYS KRYNFIKELI DNKLGDMVSY REPKGGLNLF LNINKNIGIT SKKLFYELKD
RQTIITPGSI FFKNPNDGDK SFRIGFSQID YSKIEKGIDN IYDVLKGR