Gene CPR_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1689 
Symbol 
ID4205991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1885716 
End bp1886789 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content29% 
IMG OID642566239 
Productradical SAM domain-containing protein 
Protein accessionYP_699004 
Protein GI110803935 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0776794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAA GACACTATAT AATCCCAATT TTTGTTCCTC ATGAAGGATG TCCACATGAT 
TGCGTGTTTT GTAATCAAGG GAAAATAACT GGAGAAAATA AAGAGATTAT ATTAGGACCA
AAGTACAAGC AAGAAAATAA GGTTAATAGT AATTTTGTAA GAAAGACAAT TGAAGAATAT
ATAGAAACAA TTGGTGAAGG AGATAGAATA TTAGAGGTTT CTTTCTTTGG AGGAACTTTT
ACTGCTATTG ATATAAATAA GCAAAGAGAA CTATTAGCTG TTGCAAAAGA ATATAAGGAT
AAAAAAATTA TAGACTATAT AAGATTGTCT ACTAGACCAG ATTATATTGA TGAGTTTATT
TTAGATCATT TAAAAAGTTA TAAAGTTGAT ATAATAGAAC TTGGAGTTCA GTCTTTAGAT
AAGGAAGTGT TGCATAAATC AGGTAGAGGT CATGGTTATG ATGAGGTTCT AAAAGCTTCT
AAGTTAATTA AAGAATATGG CTTTACTTTA GGTCATCAAA TAATGGTAGG ACTTCCTGAG
GACACTTTTG AGAAGGATAT AGAAACAACA AGAGAGTCTA TAAAGATGAA ACCTGATATA
TGTAGAATAT ATCCTGCTCT TATAGTGAAA AACACTCCTA TGGAGGATAT GTACTTAGAG
GGAACTTATA AACCATATAC TCTAGAAGAG GCTGTATATA TAAGTGCTAA ACTTTATAAG
ATGTATAAAG AAAATAATAT ACAGGTTATA AGAATTGGTT TGCAGCCTAC AGATAATATA
GCTTTAGGTA AGGATATTGT AGATGGGCCT TTCCATCCTG CTTTTAGGGA ATTAGTAGAG
AGTAGTATTA TAAATGAAAA TATATATAAT ATCTTAAAGG ATAAAAGTGG AGAGGTAACT
ATAAGAATTA GCAATAAATC AGTTTCTAAG CTTTATGCTG ATAAAAAGAG ATACTTCAAT
GAACTTAAAG ACAAGGCACA AAATTGTAAT TTGAAGATTA AAGTAGATAA TTCTATGGAA
GTAGATAAAA TAAATATAGA AGTAGAATCG AAAGTATATA AAATAGATTT ATAG
 
Protein sequence
MGKRHYIIPI FVPHEGCPHD CVFCNQGKIT GENKEIILGP KYKQENKVNS NFVRKTIEEY 
IETIGEGDRI LEVSFFGGTF TAIDINKQRE LLAVAKEYKD KKIIDYIRLS TRPDYIDEFI
LDHLKSYKVD IIELGVQSLD KEVLHKSGRG HGYDEVLKAS KLIKEYGFTL GHQIMVGLPE
DTFEKDIETT RESIKMKPDI CRIYPALIVK NTPMEDMYLE GTYKPYTLEE AVYISAKLYK
MYKENNIQVI RIGLQPTDNI ALGKDIVDGP FHPAFRELVE SSIINENIYN ILKDKSGEVT
IRISNKSVSK LYADKKRYFN ELKDKAQNCN LKIKVDNSME VDKINIEVES KVYKIDL