Gene CPF_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1584 
Symbol 
ID4203065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1801706 
End bp1802890 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content30% 
IMG OID638082462 
ProductHK97 family phage major capsid protein 
Protein accessionYP_696027 
Protein GI110799270 
COG category 
COG ID 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0506282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAT TTGAAAGATT AAAAGAATTA AGAGCAAAGA AGAAAGACTT AGAGGAAAGA 
AGAAAAGTAA TAGTAGAAGA GATTAGATCA TTAGCTAAAG AAGAGAAGGA AGAGGAAATA
AGAAGTAAAG CTATTGAAAG AGAAAAGATA GAGGCTAGAA TGGAAATAAT TGAGGAAGAA
ATAGAATCAG TTATGGAAGC CATTGAAGAG GAAAGAAGCA ACAGTAACTT TTCAGGTGGA
AGAGTTTTAG GTGGAGAAGG TTCAAAAGAA GAAAAAAGAA GTTTACAATT AAGCGCAATG
AGTAAAGTTG TAAGAGGAAT ATCTTTAAGT GAAGAAGAAA GAGATGTTAT GTCAGCAACA
AATAATGGAG CTGTAATACC TCAAGAATTT GTTAATGAAT TTGAAAAATT AAAAGAGGGA
TATCCAGCTT TAAAATCATA CTGTCATGTA ATACCAGTTG CAAGAAATTC AGGAAAGTTA
CCTGTAAGAG CTGGGGGAAG TGTTACTAAA CTTGCAAATT TAGAAGAAGA TACAGAATTA
GTTAAGGCTA TGATGAAAAC TAAACCTATG TCATATGATA TAAATGATTA TGGATTACTT
GCACCGATAG ATAACTCATT ACTTGAAGAT AGTGAAATAA ACTTTTTAGA ATTTGTAAAT
GAAGAATTCG TAGAATATGC AGTTAATACT GAAAATAGCG AAGTAGTAGA TCAAGCTAAT
AAATTACTAG CTACTGAAGA AGTGAAAGAT TATATAGAAA TGGTTGAGAA AATAAATTCA
TTAGTTCCTA ATGCAAGAAG TAGAGCTGTT ATTGTTACTA ATTCTTTAGG TAGAGGTTAT
TTAGATGCAT TAATGGATAA GCAAGGTAGA CCACTTTTAA AAGAATTATC AGATGGAGGA
AGTTTAATAT TTAAGGGAAG AGATGTGGTT GAATTAGACT CAACTACATT TAATACAGGA
GAGGAAATCA AGTTTATAAT TGCAGATTTA AAGACATTAA TTAAATTTAT GGATAGAAAA
CAATATTTAA TAGATCAATC AAAAGAAGCT GGATATACTA AGAATCAAAC TATAGCTAGA
ATTATAGAAA GATTTGATGT TGAATCACCA TTAAAAAAAT CAGAAGATGC AGCAGTAATA
AGAAAATTTG GATTAATAGT AAAAGTAAAT GAAGCTAAAG CGTAG
 
Protein sequence
MKLFERLKEL RAKKKDLEER RKVIVEEIRS LAKEEKEEEI RSKAIEREKI EARMEIIEEE 
IESVMEAIEE ERSNSNFSGG RVLGGEGSKE EKRSLQLSAM SKVVRGISLS EEERDVMSAT
NNGAVIPQEF VNEFEKLKEG YPALKSYCHV IPVARNSGKL PVRAGGSVTK LANLEEDTEL
VKAMMKTKPM SYDINDYGLL APIDNSLLED SEINFLEFVN EEFVEYAVNT ENSEVVDQAN
KLLATEEVKD YIEMVEKINS LVPNARSRAV IVTNSLGRGY LDALMDKQGR PLLKELSDGG
SLIFKGRDVV ELDSTTFNTG EEIKFIIADL KTLIKFMDRK QYLIDQSKEA GYTKNQTIAR
IIERFDVESP LKKSEDAAVI RKFGLIVKVN EAKA