Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2594 |
Symbol | |
ID | 4204188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2826682 |
End bp | 2828847 |
Gene Length | 2166 bp |
Protein Length | 721 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642567144 |
Product | phage infection protein, putative |
Protein accession | YP_699841 |
Protein GI | 110802444 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | [TIGR03061] YhgE/Pip N-terminal domain [TIGR03062] YhgE/Pip C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTGGA GTAACATTTT AAAAGTGTTT AAGAGAGATA TGAAAGCAAT AATAAAAAAT CCGATAGCAC TTTTAATAAT AGGTGGAATA TGCTTTATAC CATCGCTTTA CGCATGGGTA AATATAGCAG CATGTTGGGA TCCTTATGAA AATACTAGCA GTGTTCCAAT TGCCGTAGTA AATAATGATA AAGGTGCTTC ATTCAATGGA AAAGAAATGA ACATAGGTAA TGAAGTTATT GATGAATTAA AAAATAATCA TGCAATTGGA TGGAAATTTG TTGATACAAA ACAAGCAGAT AGGGGATTAA TAGATGGCAC CTATTATGCA ATGATAGAAA TTCCGGAAGA TTTTTCTAAA GATTTAACTA GTGTTTTATC TGAAAATCCT AAAAAGCCTG AAATAATATA CAAGGTTAAC ACAAAAGCTA ACCCTGTTGC AGGTAAAATA ACAGGAGTTG CACAAAGTAC TCTTGTAAAT CAAATAACAT CAAATTTTAT AACTACCGTT AATGAGACCG CATTTTCTTC ATTAAATGAC TTGGGATATG ATGTAAATAA AAATAAAGAT AATATTATTA AGTTAAAAAA TTCTATAATA GCTATAGATA AAAATATGAG TTCTATTAAA AATATACTTA ACAATATAAA TAGCAATTCA GCAAACTTAA GTACTTATTT AAAAGAAGTT CAAAATACTA TGCCATCAAT AACAAATGGT TTAAATAGTA TATCTAAAAA TACTGATAAT ACTGGTAATT TAATCAATAA TTCTAAGGAA TCTTTAAATA GTACCTTTGA TAATATAAGA TTAAATTTAA GTGAATCACA AACATCATTA AATAAGATAC AAAGTAATTT AGATGAATTA TCTTCAATAG CTAGTGATGC AAGCTCTGCT AAGATTAATT CATTAATAAG TAAATCTATC AATGAAATAA ATAAAGTTGA TAATAGTATT ACAGTTGTAA CTAACTTTTT AGATGTTATT AACAAAAAAA ACAACAATGA AAAAATAGCT AATATGATAA CTTCATTAAA AAATATTCAA AATTCATTAG CAGAAGAGAA AACTAAGCTT AATGATTTAC AAAGCAAAGT TAATAATGGT AAAGATTTAG ATAAAGGACT TCTTAGTTCA ATAAGTGATT TAACTAAGAA AATAGAAGGG CAATTAGATA ATTCTTTAAA TAGATTTGAT AATGATACAA GACCTGCTTT AAATACAATT GCTGATGGAT TTGTTACCGC TACTAAAGGG GCATCAGATT TACTAGGAAA AGCAAATGGA TTGGTAGATC AAATAAATAA TCTATTAAGC ACAGCTAATC AAGGGGCTCA ACTTGCAAAT TCAACATCAG AAAAATTAAA AAATAGTTTA GAAGAATTTT CAGGAGTTAT AAGTGAATTA AGTTCAAAGC TTAAAGATGT TAGTGATGAT GATCTTGGTA AAATAGTTGG AATACTACAA AGTAATCCAG AATTTATGGG AGATTTCATT GCAAATCCAT TTAATATAGA AGTAGATGCA ATATACAAAG TAGCTAACTA TGGATCAGGA ATGGCACCTA TATATTCGGT ATTAGCTTTA TGGGTTGGCT CATTAATATT AATATCATTA TTAAAAACAG AGTCAGCTGA ATTTGAAGGT AGTGAGAATA TAAATTTAAG AGAAAGACAT TTTGGAAAGA TGTTAACTTT CGTAAGTTTA GGAATATTAC AAGGATTTAT AGTAGCCTTT GGGGATAAGT TCCTTTTAGG AGTTCAAACA GTAAATACAG CATTATTAAT TTTTGTATCA ATGTTTGCTT CAGTTGTATT CACTATAATT GTATTTACGC TTATGTCTGT CTTTGGAAAC CTAGGTAAAG CCTTGGCTAT TATACTTATG GTTTTACAAT TAGCAGGTAG TGGTGGATCA TATCCTATAC AAGTAGATCC ATTATTTTTT AGAATAATTC AACCAACTTT CCCATTTACT TATGCTATAT CAGGGTATAG GGAAGCTATT GCAGGACCTT TAGTAAGTAC TGTAATTTTA GATTTTGTTG TTTTAACAAT AATGGGATTA GTATTTATAT TATTAGGATA TTTATTGAAA GGTCCTTTAA ATCCTAGAGT TAGAAAATTT GAAGATATGT TTGAAGAATC TGGAATAGCA GAATAA
|
Protein sequence | MKWSNILKVF KRDMKAIIKN PIALLIIGGI CFIPSLYAWV NIAACWDPYE NTSSVPIAVV NNDKGASFNG KEMNIGNEVI DELKNNHAIG WKFVDTKQAD RGLIDGTYYA MIEIPEDFSK DLTSVLSENP KKPEIIYKVN TKANPVAGKI TGVAQSTLVN QITSNFITTV NETAFSSLND LGYDVNKNKD NIIKLKNSII AIDKNMSSIK NILNNINSNS ANLSTYLKEV QNTMPSITNG LNSISKNTDN TGNLINNSKE SLNSTFDNIR LNLSESQTSL NKIQSNLDEL SSIASDASSA KINSLISKSI NEINKVDNSI TVVTNFLDVI NKKNNNEKIA NMITSLKNIQ NSLAEEKTKL NDLQSKVNNG KDLDKGLLSS ISDLTKKIEG QLDNSLNRFD NDTRPALNTI ADGFVTATKG ASDLLGKANG LVDQINNLLS TANQGAQLAN STSEKLKNSL EEFSGVISEL SSKLKDVSDD DLGKIVGILQ SNPEFMGDFI ANPFNIEVDA IYKVANYGSG MAPIYSVLAL WVGSLILISL LKTESAEFEG SENINLRERH FGKMLTFVSL GILQGFIVAF GDKFLLGVQT VNTALLIFVS MFASVVFTII VFTLMSVFGN LGKALAIILM VLQLAGSGGS YPIQVDPLFF RIIQPTFPFT YAISGYREAI AGPLVSTVIL DFVVLTIMGL VFILLGYLLK GPLNPRVRKF EDMFEESGIA E
|
| |