Gene CPF_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1971 
Symbol 
ID4201767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2211671 
End bp2212744 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content29% 
IMG OID638082840 
Productradical SAM domain-containing protein 
Protein accessionYP_696404 
Protein GI110800781 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.144867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAA GACACTATAT AATCCCAATT TTTGTTCCTC ATGAAGGATG TCCACATGAT 
TGCGTGTTTT GTAATCAAGG GAAAATAACT GGAGAAAATA AAGAGATTAT ATTAGGGCCA
AAGTACAAGC AAGAAAATAA GGTTAATAGT GATTTTGTAA GAAAGACAAT TGAAGAATAT
ATAGAAACAA TTGGTGAAGG AGATAGAATA TTAGAGGTTT CTTTCTTTGG AGGAACTTTT
ACTGCTATTG ATATAAATAA GCAAAGAGAA CTATTAGCTG TTGCAAAAGA ATATAAGGAT
AAAAAAATTA TAGACTATAT AAGGTTGTCT ACTAGACCAG ATTATATTGA TGAGTTTATT
TTAGATCATT TAAAAAGTTA TAAAGTTGAT ATAATAGAAC TTGGAGTTCA GTCTTTAGAT
AAGGAAGTGT TGCATAAATC AGGTAGAGGT CATGGTTATG ATGAGGTTCT AAAAGCTTCT
AAGTTAATTA AAGAATATGG CTTTACTTTA GGTCATCAAA TAATGGTAGG ACTTCCTGAG
GACACTTTTG AGAAGGATAT AGAAACAACA AGAGAGTCTA TAAAGATGAA ACCTGATATA
TGTAGAATAT ATCCTGCTCT TACAGTGAAA AATACTCCTA TGGAGGATAT GTACTTAGAG
GGAACTTATA AACCATATAC TCTAGAAGAG GCTGTGTATA TAAGTGCTAA ACTTTATAAT
ATGTATAAAG AAAATAATAT ACAGGTTATA AGAATTGGTT TGCAGCCTAC AGATAATATA
GCTTTAGGTA AGGATATTGT AGATGGGCCT TTCCATCCTG CTTTTAGGGA ATTAGTAGAG
AGTAGTATTA TAAATGAAAA TATATATAAT ATCTTAAAGG ATAAAAGTGG AGAGGTAACT
ATAAGAATTA GCAATAAATC AGTTTCTAAG CTTTATGCTG ATAAAAAGAG ATACTTCAAT
GAACTTAAAG ACAAGGCACA AAATTGTAAT TTGAAGGTTA AAGTAGATAA TTCTATGGAA
GTAGATAAAA TAAATATAGA AGTAGAATCG AAAGTATATA AAATAGATTT ATAG
 
Protein sequence
MGKRHYIIPI FVPHEGCPHD CVFCNQGKIT GENKEIILGP KYKQENKVNS DFVRKTIEEY 
IETIGEGDRI LEVSFFGGTF TAIDINKQRE LLAVAKEYKD KKIIDYIRLS TRPDYIDEFI
LDHLKSYKVD IIELGVQSLD KEVLHKSGRG HGYDEVLKAS KLIKEYGFTL GHQIMVGLPE
DTFEKDIETT RESIKMKPDI CRIYPALTVK NTPMEDMYLE GTYKPYTLEE AVYISAKLYN
MYKENNIQVI RIGLQPTDNI ALGKDIVDGP FHPAFRELVE SSIINENIYN ILKDKSGEVT
IRISNKSVSK LYADKKRYFN ELKDKAQNCN LKVKVDNSME VDKINIEVES KVYKIDL