Gene Cphy_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2969 
Symbol 
ID5744030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3632037 
End bp3633593 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content39% 
IMG OID641294069 
ProductSPP1 family phage head morphogenesis protein 
Protein accessionYP_001560065 
Protein GI160881097 
COG category 
COG ID 
TIGRFAM ID[TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAA GTAGAGAGTA CTGGAAAAAG CGATTTGAAA AGTTAGAGGA CGATACGTAT 
CGAAAGAGTA AGGCATATTA CGATGATCTA CAAGAGCAGT TTAGACGAGC TACAAGTGAA
TTAGAAAAAG ATGTTTCTAC ATGGTATTAT CGAGTTGCAG AGAATAACGA AATTAGCTAT
GCTCAAGCTA AGCGGTATCT GAATCGGAAA GAAGTCAAAG AGTTCCGTTG GACCGTAGAT
GAATATATTA AGTACGGTAA GGAAAATGCT CTTGATCAGA AATGGATGAA GGAACTTGAA
AACGCTTCGG CTAGAGTACA CATTACCAGG CTAGAAACAG TTAAACTCCA ATTACAGCAG
TCGGTGGAAA AGCTGTTTGT TGAGTACGAG GGCGGTGTCT CGGATTTGTT GAATAAAGCT
TTTAAATCAT CCTACTATAA ATCTGCTTTT GAGATTGCGA AAGGCACTGG AGTAGGTAAG
GATCTTCATG CACTAGACGA TAATTTGGTA AGTAGTTTCC TTCGTAAGCC TTGGGCTGCT
GATGGCAAGG ATTTTAGTAG TCGTATATGG CAAGACAAAG AAAAGCTTAT ACGTGAGCTA
CATACGGAGC TTACGCAACA GCTGATACGT GGAAGTGATC CTGGTAAGAC TATTGCAGCA
CTTGCCAAGA AGATGGATGT CAGCAAGAGC AAAGCCGGTA ACTTGGTAAT GACTGAAACA
GCAGCTATAC ATTCAATGGC TCAGAAGGAA TGTTATAAAA ATCTTGAAAT TGAAGAGTTT
CAAAATGTAG CAACACTTGA CCTTAGAACA AGCGATATTT GCATAGGGAT GGATGGCACC
CATTTTCCTA TAAGTGAGTA CAAAGTCAAC GTTACTGCGC CACCTTTCCA TTGTAATTGC
CGCACTTGCA CATGCCCTTA CTTTAACGAC GAATTCACCG AAGGAGAGGA AAGAGCAGCA
AGAGATCCGG TTACCGGAAA GACACATAAT GTGCCTGCGG ATATGACATA TAAGGAATGG
CATGAGAAGT ACGTTAAGGG CAATCCTCAA GCAGACCTTG CAGAGAAAAA GTCAAATAAT
ACGCTTGCTG ATCAGAAACA GTATGACAAG TATAAAAGTA TCCTTGGAAA AGACACTCCT
AAAAACCTTG ATGATTTCCA AAATTTGAAG TATACTAATG ATGAGAAATG GAATATGTTG
CAACTGGCAT ATAAGGACCA GAAAGTACGT AATAATATCC GTTCTGATGA TACCGTAAAA
ATAATTGAAA CTGGTAAACA GGGCAAGCAT ATTAAAGGAC ATAATAATTA TATAGAAGGT
CGTAGTTACT TAACTATAAC TGAGAAGGAA ACCCAAGATT TAGTCAACAA GTATGCTGGT
ACAGGAGAAA TCAAGAGAGA TAACAGAGGG AACTGGAACA ATAAAGAGGT TGTAGACTTT
AAGAAGGAAA TAGGTATCAG TATTGATAAT CTGACTGGCA TTGAATACGT CACAACTAAG
GCGAAGATAC ACTACAGTAA AAAAGGTGTT CACGTAGTTC CGCATGGAAA GGAATGA
 
Protein sequence
MAKSREYWKK RFEKLEDDTY RKSKAYYDDL QEQFRRATSE LEKDVSTWYY RVAENNEISY 
AQAKRYLNRK EVKEFRWTVD EYIKYGKENA LDQKWMKELE NASARVHITR LETVKLQLQQ
SVEKLFVEYE GGVSDLLNKA FKSSYYKSAF EIAKGTGVGK DLHALDDNLV SSFLRKPWAA
DGKDFSSRIW QDKEKLIREL HTELTQQLIR GSDPGKTIAA LAKKMDVSKS KAGNLVMTET
AAIHSMAQKE CYKNLEIEEF QNVATLDLRT SDICIGMDGT HFPISEYKVN VTAPPFHCNC
RTCTCPYFND EFTEGEERAA RDPVTGKTHN VPADMTYKEW HEKYVKGNPQ ADLAEKKSNN
TLADQKQYDK YKSILGKDTP KNLDDFQNLK YTNDEKWNML QLAYKDQKVR NNIRSDDTVK
IIETGKQGKH IKGHNNYIEG RSYLTITEKE TQDLVNKYAG TGEIKRDNRG NWNNKEVVDF
KKEIGISIDN LTGIEYVTTK AKIHYSKKGV HVVPHGKE