Gene CPF_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1810 
Symbol 
ID4203281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2043346 
End bp2044845 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content26% 
IMG OID638082680 
Productphytoene dehydrogenase family protein 
Protein accessionYP_696244 
Protein GI110801106 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.149228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCAA ATAAAAAAGC TATAATAGTT GGAGCTGGAA TAGGTGGATT AGCTACTGCT 
GTTCGCCTTC TTATTAATAA TTTTGAAGTA GATATTTTTG AAAAAAACTC TAAGATAGGT
GGAAAAGTAA ATTTAATTGA ATACAAAGAT TTTAAGATAG ATTCTTCTGC TTCAATATTT
ATGCTCCCTA AACCCTATTT AGAAGTATTT AAATATGCAA AAAAAGACCC TAAAGACTAT
ATAGAGCTTG TAGAATTAAA TACTTTATAT AAAGTGTTTA ATGATGAAGG AGATAGTTTT
AATATTTATT CAGACTTTCT AAAAACTACA GAGTCCTTAG AAAAGGTATT TAATGATGAA
AGTTCAAATT ATTATAAATA TATATCTGAC TCATATAGAA GATATCTTTT AGTAGAAAAA
TACTTTTTAA ACAGAAGTTT TTTCACCTTA AATTATTGGA GATATTTTAA ATCTTTACCT
GAACTAATTA AAATACATCC TTTTAAAAAT TGTTATAAAA CCATTGAAGA ATATATAAGT
AATGAATATT TAAAAAACTT ATTAGCTTTT CAATGCATGT ATATTGGGGA GTCTCCCCTT
AAAAGTTCTA ATGTTTTTAA TTTAATTCCA TCAACCACTC AAATATATGG ATTATATTAT
ATTAAAGGTG GAATGTACTC TTATGTGAAG GCTTTAGAAA AATTAATCCT AGAACTTGGT
GGTAAAATAC ATCTTAACTC AAATGTAACT AATATACTTA TGGAAAAGAA TGTAGCAATT
GGAGCAAAAA TAAACCACGA AAATATATTT TCTGATTTAA TTGTTTGTAA TTCAGATTTT
ACTTATACCA TACAAAATTT ACTTCCTAGA AGTACATTTA AAAATAAAAT TTCTAGGCGA
AAACAAAATA ATTTATCCTT TTCTTGCTCT ACATTTATAC TACATTTATT TCTTAAGAAA
AAATATAAAA ATTTAGATGT ACATAATATA GTACTTAATT TAAATAAGAA AGAAGTTTTA
TTAGCTCCCT TTATAGATGG GCCCTTGCCA AAGGAATATA TATATTATAT CTATTGCCCA
AGCTCAATAG ATACTTCATT AACTCCTGAG GATTGTGAAT GCATTAATAT AACAGTACGT
GTTCCAAACT TAAAAAAATA TAAATCAAAA TGGACTGAAT CTACAATTGT TTCTTTGAGA
AACAAAATCT TGTATGACCT AAGTAAAATT AAAGGTTTAG AAGATATAAA AGAAAATATA
ATTTACGAAA GCTATACAAC GCCTATGACC TTAAAAAATG ATTTTAATTG CTTTTTTGGT
GCAGCTTTTG GTCTTAATCA TAATTTGCTA CAAACAACTA TTTTTAGACC TCAGGCAAAA
ATAAAAAAAC TAAAGAATAT ATATTTTGTA GGTGATTCAG TACATCCTGG CTCTGGAATA
TCAATGTCCT TAATCTCAGC TAAGCTATGC TGTGAAAAAA TAATATCAGA TTTTAGCTAA
 
Protein sequence
MSPNKKAIIV GAGIGGLATA VRLLINNFEV DIFEKNSKIG GKVNLIEYKD FKIDSSASIF 
MLPKPYLEVF KYAKKDPKDY IELVELNTLY KVFNDEGDSF NIYSDFLKTT ESLEKVFNDE
SSNYYKYISD SYRRYLLVEK YFLNRSFFTL NYWRYFKSLP ELIKIHPFKN CYKTIEEYIS
NEYLKNLLAF QCMYIGESPL KSSNVFNLIP STTQIYGLYY IKGGMYSYVK ALEKLILELG
GKIHLNSNVT NILMEKNVAI GAKINHENIF SDLIVCNSDF TYTIQNLLPR STFKNKISRR
KQNNLSFSCS TFILHLFLKK KYKNLDVHNI VLNLNKKEVL LAPFIDGPLP KEYIYYIYCP
SSIDTSLTPE DCECINITVR VPNLKKYKSK WTESTIVSLR NKILYDLSKI KGLEDIKENI
IYESYTTPMT LKNDFNCFFG AAFGLNHNLL QTTIFRPQAK IKKLKNIYFV GDSVHPGSGI
SMSLISAKLC CEKIISDFS