Gene CPF_1793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1793 
Symbol 
ID4202132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2020405 
End bp2021535 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content31% 
IMG OID638082665 
Producthypothetical protein 
Protein accessionYP_696229 
Protein GI110798901 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.672383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AATACATAAT AGCAATAGTA ACAGTAGTAA TACTTGCAGG GGTAGGAGTT 
GGAAGTTATT TTTTAAAACA AAGTATGAAT AAGGAATCAG TGGCAACTAT GGAGATATAC
ACTGTTCCCT CTACAGATAA GGTTTTTGTA AATGGGAAAA TAGAACCAGA AAAAGTTGAA
AATATATTTT TAGATGCTAC AAAGGGTACT GTAGATAAAG TAGAAGTTGA AAATGGAGAT
GTTGTTGAAA AAGGAGATAC TTTATTTACA TATAAAAATG ACCAAGTTCA AAGTCAGGTA
GAACAGTTAG AATTACAATT AAATTCTGCT AAAAATCAAA AAGAAGAAAT TAATAAACAA
AATGCAGAAG CAAAAAAACA ATTAGAGGAT TTAAAGAAAG CAGGATTAGA AAATCAAATG
CCTCAAGGAG GTCAAATGCC TAATTTAGGA CAAAATGCAG GTGGAGAGAT ATCAACTGGA
AGTGTAGATG AGCAAATAAA ACTTCTAGAA AAGCAAATAA AAGCATTAAA AGATAAGGAA
TATTATAAAG TAACTGCGCC TATAGGTGGA AAGGTAATTT TGGCAGAATC AAGTACAAAT
CCTACACAAC CATATATTAC TGTGGAATCA GGTGATTATT ATATTAGTGG AAGTGTAAAT
GAAAAAGATC AACCTAAGAT AAAGGAAGGA CAAGAAGTTC AGATAACTAT TCTTTCAACA
AATAAAAATA TAAATGGTAA AATATCCTCT GTTGGAAACA CTCCTATAGA TAATGGAGCT
TCTTTAGCGG CACAAACAGG TGCACAGGGA GGCGCAAGTG GAAATATGTC TTATTATGAA
GTTAAGATAA CACCAGATTC TCAGGAAGAT TTAACTAATG GATTCCATGT TCAAGCATCT
GTTAATTTAG ATAAAAAGCC AATAGAAATT CCTAAGGAAG CGATTTTAAG TGTGGATAAT
GAGGAATTTG TATTTAAAAA TGTAGATGGA AAGCTTGTTA AGCAAGTTAT AACATATTCA
CCTAAAGAAG GAAGTGAAGA TGAAGTTATA GTAAGCAGTG GATTAAATGA AGAAGATAAA
ATAGTTTCTA AGCCTACTCC TAACATGAAA GAGGGGATGA ATGTTGAGTA A
 
Protein sequence
MKKKYIIAIV TVVILAGVGV GSYFLKQSMN KESVATMEIY TVPSTDKVFV NGKIEPEKVE 
NIFLDATKGT VDKVEVENGD VVEKGDTLFT YKNDQVQSQV EQLELQLNSA KNQKEEINKQ
NAEAKKQLED LKKAGLENQM PQGGQMPNLG QNAGGEISTG SVDEQIKLLE KQIKALKDKE
YYKVTAPIGG KVILAESSTN PTQPYITVES GDYYISGSVN EKDQPKIKEG QEVQITILST
NKNINGKISS VGNTPIDNGA SLAAQTGAQG GASGNMSYYE VKITPDSQED LTNGFHVQAS
VNLDKKPIEI PKEAILSVDN EEFVFKNVDG KLVKQVITYS PKEGSEDEVI VSSGLNEEDK
IVSKPTPNMK EGMNVE