Gene CPF_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1979 
Symbol 
ID4201930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2218254 
End bp2219462 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content27% 
IMG OID638082848 
Producthypothetical protein 
Protein accessionYP_696412 
Protein GI110801329 
COG category[R] General function prediction only 
COG ID[COG1323] Predicted nucleotidyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA CTGGCATAAT AACTGAATAT AACCCTTTTC ATCTAGGTCA TGAACTTCAT 
CTAAAAAGTT CAAAAGAGAT TACAAATTGC GATGGAGTTA TTTGTGTTAT GAGTGGAAAC
TTTGTGCAAA GAGGTTTGCC TGCTTTAACG GACAAATGGA CTAGAACAAA AATGGCCTTA
GAAGCTGGAG TTGATTTAGT TGTAGAACTT CCAACCCTTT TTGCAACTTC TTCAGCAGAA
TTTTTTGCCT TTGGTGCAGT ATCTTTGCTT AATTCTTTAA ATGTAGTTAA TAATATTTGT
TTTGGGTCAG AATGTGGAGA TATAGATTTA ATCAAAAAAC TTAGTGAAAT TATTATCAAT
GAACCTCCTC TATTTAAAGA ATATTTAAAG GATTATTTAA AGGAAGGACT TCCCTTTCCT
AAAGCTAGAA GTAAAGCTTT AATGAAGTAC TTAGATGATA ATAATTATAA AATTGATTTT
TCATACTTAG AAAAAGTTCT AAACTCTTCT AATAATATAT TAGCCATTGA ATATTGTAAA
AGCCTTTATA AGCTTCAAAG TTCTATAAAA CCTTTTACTA TACAAAGATT AGGAGCAGAT
TACAATGATG AAAAACTTTC AAAAAATGAA ATAGCCTCTG CTTCTGCCAT AAGAAAAAGT
ATTTATACTT CAAATATAGA AGAAAGTCTT GATTTTATGC CTGAGTATAG CTATAACTTA
TTAAAAAATA CTTCATTTAG TGATTTAGAC AAAATGTTTG ACTTAGTAAA ATACGCTATA
GTAAGCAATC CTAATGTATT AAAAGAAATA CCAGAGGCTT CTGAAGGAAT AGATAATAAG
ATAATTCAAA ACATAGGAAA AGCTAATTCT TTAGATGAAT TAATAAACCT TTGTAAAAGT
AAGCGTTATT CATACACTAG ATTAAACAGA ATTTTATGTC ACGTACTATT AAATGTAAAT
AAAGATCTTC TTTCTCTTAG AAAATCTTCT CCTAATTATG TAAGAATCTT AGGATTTAAT
AATAAAGGAA GGGAAATTTT AAAAGAGATT AAGAAAAATT CTGAAATAAA CATCGTTAAT
AAATTATCAA AAGCTAAATC AGATTCTTTG TTAGAATTTG ACATAAAAGC CACTAATATT
TATAGCTTTC TAAATCCATC AGTTAAAATT AACAGTGATT ATTTAATTAG TCCTATTATT
TTTAGATAA
 
Protein sequence
MNITGIITEY NPFHLGHELH LKSSKEITNC DGVICVMSGN FVQRGLPALT DKWTRTKMAL 
EAGVDLVVEL PTLFATSSAE FFAFGAVSLL NSLNVVNNIC FGSECGDIDL IKKLSEIIIN
EPPLFKEYLK DYLKEGLPFP KARSKALMKY LDDNNYKIDF SYLEKVLNSS NNILAIEYCK
SLYKLQSSIK PFTIQRLGAD YNDEKLSKNE IASASAIRKS IYTSNIEESL DFMPEYSYNL
LKNTSFSDLD KMFDLVKYAI VSNPNVLKEI PEASEGIDNK IIQNIGKANS LDELINLCKS
KRYSYTRLNR ILCHVLLNVN KDLLSLRKSS PNYVRILGFN NKGREILKEI KKNSEINIVN
KLSKAKSDSL LEFDIKATNI YSFLNPSVKI NSDYLISPII FR