Gene CPF_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1797 
Symbol 
ID4202230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2024185 
End bp2025291 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content30% 
IMG OID638082668 
Producthypothetical protein 
Protein accessionYP_696232 
Protein GI110800794 
COG category[S] Function unknown 
COG ID[COG4086] Predicted secreted protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGA AAAAGAGTAT TATGAAAGTT CTTTGTGGAG CAATAGTTAG TACATTCATG 
TTTGGAATGG CTACTCCAGT ATTTGCACAA AGTGAAAAAA CTGAGGCTGT AGTAACTTTA
GGAGCTAATT TAACAAAGTC AGAAAGATTA CAAATGTTAG ATGCTTTTGG AGTAAAAGCT
AATGATGTTA AAATAATAGA TGTAACTAAT CAAGATATAA GAGAACAATT AGGTTTAGAT
ACAAGTAAAC CAATACCTGC TAGCAGTCAA TCAATATCAA GTTCTTACGT TGTGGTTAAG
GACAAAGGTG GAATAAATGT AACTACTAAC AATTTAACAG AGGTAACAGG AAGTATGCTT
GCTAATGCCC TTCTTACTTC AGGGGTAAAT AATGCTGATG TAAAGGCTGA TGCTCCATTT
AAGGTTACAG GAACAGCTGC CTTAGCTGGT ATTTTAAAAG GATTTGAAGA TGCATCAGGA
GAGGAGTTAT CTCTTCCAAA GAAAGAGGCT GCAAGAGAGG AAATTTCTTT AACTAATAAT
CTTAGTAATG CTAAAACTAA AGATGGACAA ACACTAGGAA AAGATGAAGC CGCTGTAGTA
GTAAATGATA TTAAAACTGA TGTAATTAAA GATAAACCTA AAAATGATGA GGAAATAGGT
AAGATAGTAA ACAATGTTAC AAATAACTAT AATATACTTT TAACACAAGG GCAACAAGAA
CAAACAATAA AATTTATGTC TAAAATAAAT GATTTAGACT ATAACTATGG TGCTATGAAA
GAATCTTTAA ATCAAATGAA TGATAAGCTT CAACAGATAT TAAAAGACAC AGGAAAACAA
TTAGAAGAAA GTGGTCTTTT AGAAAAAGCA TTAAATGGTA TAAAAAATGT TTTAGTTGAT
ATTAAGGATT TTCTAGTAAA TATGTTTAGC TCAGCTAGTG AAAAAGTTAA AGATGGAATA
ACTTATGATG AAAATGGCAA TATAGTTATA AAAACAGGAA ATAATTCTGA TGAATCAAAG
AATGAAGAAA GTATCCAAGA TAAGCCACAA ACTCAGTCAA ATGATAATAA TCAAAATCAA
GAAAATGAAC AAGGACAAAA TAATTAG
 
Protein sequence
MIKKKSIMKV LCGAIVSTFM FGMATPVFAQ SEKTEAVVTL GANLTKSERL QMLDAFGVKA 
NDVKIIDVTN QDIREQLGLD TSKPIPASSQ SISSSYVVVK DKGGINVTTN NLTEVTGSML
ANALLTSGVN NADVKADAPF KVTGTAALAG ILKGFEDASG EELSLPKKEA AREEISLTNN
LSNAKTKDGQ TLGKDEAAVV VNDIKTDVIK DKPKNDEEIG KIVNNVTNNY NILLTQGQQE
QTIKFMSKIN DLDYNYGAMK ESLNQMNDKL QQILKDTGKQ LEESGLLEKA LNGIKNVLVD
IKDFLVNMFS SASEKVKDGI TYDENGNIVI KTGNNSDESK NEESIQDKPQ TQSNDNNQNQ
ENEQGQNN