Gene CPR_1924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1924 
Symbol 
ID4205370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2127406 
End bp2128812 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content27% 
IMG OID642566474 
ProductGntR family transcriptional regulator 
Protein accessionYP_699234 
Protein GI110802822 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT TTAGTGTGGT TTTTAAAGAG GATTTATGTA AATACTTAAT AATATACGAT 
AATATTAAAA GTTTAATTGA AAATGGGGAA ATATCAGAAG GAGAAAAGTT ACCAACTATA
AGAGAGTTAG CAGACTTTTT AGAAGTTAAT AAGGTAACTG TCATTAATGC TTATAAGAAA
CTTGTACAAG AAGGATATGC ATATCAATCC CAAGGAAGTG GAACTTATGC TAAAAATAAG
GAGGTAGGAA AAAGTTTTAA GCATGATTAT AATGATTTAT TTAGAAAAAT AGCCTCTGGA
GATTTAAAAA ATTGGATAGA CTTTACTGGG GAAACTACTA GTGCAAACTT CTTCCAAGTA
GAAAAACTTA AGAAAGTTTT AGACAATGTA TTAGTTAGAG ATGGTGTAGA GGCACTAATG
TATAGTGATC CTTTAGGCTA TTTAGAACTA AGAAGAAGTA TAAATGAAGA GTTTTGGAAT
GGAAAAAATA ATTTAGATAA TATACTTATA ATTTCTGGAG CTCAGCAAGG TATTGATATT
GTAGGTAAGT CCTTAGTTAA TATAAATGAT AATATTGTAA TAGAAAGGCC AACTTATGGA
GGAGCACTTT TAGTATTTAA GCTTAGAAGA GCAAATATAT TAGAAATTCC AATGGAGAAG
GATGGTCCTA ATATTGAAAA GTTTGAAAAT TTATTAAAGA GAAATAAAAT AAAGTGTTTT
TATACTATGA GTTACTTTCA GAACCCTACA GGGATAAGTT GTTCTTTAGA AAAAAAGAAA
AAAATAATAG AGCTTGCACA TAAATATGAT TTTTACATAT TAGAAGATGA TTATCTTTCA
GAATTAGTTT ATTCTAATGA TTTAGAGTAT ATACCATATA GGAGTTTAGA TTCTGAAAGA
GTTATATATA TAAAAAGTTT TTCTAAAATA TTTCTACCAG GTATAAGAAT GGGATATTTA
ATAGCACCAG ATAAATTTAA AGAGGAATTT CAAGCTTTAA AGTTTAATAG TGATATAGCT
ACCTCAAGTT TAATGCAAAG AGCTCTTCAG GATTATATTG CAAAGGGATA TTGGAAAGAG
CACATAGAAA GATTAAATGA GGAATATTCA AAGAGATATA ATTTTATAAA AGAACTTATA
GATAATAAAT TAGGAGATAT GGTTTCTTAT AGAGAGCCTA AGGGAGGACT TAATTTATTC
TTAAACATAA ATAAGAACAT AGACATAACT TCTAAAAAGT TATTTTATGA ACTGAAAGAT
AGACAAACTA TAATTACTCC AGGAAGCATA TTCTTTAAAA ATCCTAATGA TGGAGATAAA
AGCTTTAGAA TTGGATTTTC CCAAATAGAT TATAGTAAGA TAGAAAAAGG AATAGATAAT
ATATACGATG TATTAAAAGG TAGGTAG
 
Protein sequence
MKKFSVVFKE DLCKYLIIYD NIKSLIENGE ISEGEKLPTI RELADFLEVN KVTVINAYKK 
LVQEGYAYQS QGSGTYAKNK EVGKSFKHDY NDLFRKIASG DLKNWIDFTG ETTSANFFQV
EKLKKVLDNV LVRDGVEALM YSDPLGYLEL RRSINEEFWN GKNNLDNILI ISGAQQGIDI
VGKSLVNIND NIVIERPTYG GALLVFKLRR ANILEIPMEK DGPNIEKFEN LLKRNKIKCF
YTMSYFQNPT GISCSLEKKK KIIELAHKYD FYILEDDYLS ELVYSNDLEY IPYRSLDSER
VIYIKSFSKI FLPGIRMGYL IAPDKFKEEF QALKFNSDIA TSSLMQRALQ DYIAKGYWKE
HIERLNEEYS KRYNFIKELI DNKLGDMVSY REPKGGLNLF LNINKNIDIT SKKLFYELKD
RQTIITPGSI FFKNPNDGDK SFRIGFSQID YSKIEKGIDN IYDVLKGR