Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1576 |
Symbol | |
ID | 4201424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 1795557 |
End bp | 1798811 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 638082454 |
Product | TP901 family phage tail tape measure protein |
Protein accession | YP_696019 |
Protein GI | 110800747 |
COG category | [S] Function unknown |
COG ID | [COG5412] Phage-related protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.929856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCTA ATGTAAAAAT AGGAGCAAAT ACATCTGACT TTCAAAAGCA AATGAAGGAT ATGGTAAGAG AGCTTAAATC TCTTGGCAGT ACATTTAATT TAGCAAGTAC TCAGGCTAAA TTATTTGGTA ATGCTAATGA ACAATTAAAA GTAAAACAGG CTGAGCTTAC TGAAAAAATG AAACTTCAAA ATAGAATGAT TAATTTACAA AAAGAAGCAA TTAATAAGTT GACTAATGAT GTTCAGAAGC AAAAAGAAAA AAGAGAAGAA TTATCTAAAA AAATAGAAGA AGTAACTAAA AAATATAAAG AAAGTTCTGA AGCTACTGGA AAGAATAGCG AGGAAACAAA AAAGTTAGGA AAATCTTTAG CTGATTTAAA AGAAGAATAT GCTAGGAATG AAAGAGCAAT AGATTCATCT AATAGAAAAA TAGATACTGC AAATATGAAA ATGAATAAAT CTAAGACAGA GCTATTAGAA AATAAAAAAG CTTTAGAAGA GGTTGATAAG AAATTAAAAG ATATCAATTT AGATAAATTC TCTCAAAAAA TGGACAAGGT AAGCAATGCT ACAGGGAAGG CAGCTAGTGC TTTAAAACCA GCAGCAATAG CTGTTACTGG ATTTGGTGTT GCTGCTGCAG TGACAGAAAT GAAGTTTCAG GATGGTATAG CAAATATAAA TACTCTTTTA GATGACCAAA GTCATTTAGA AGGATATAAA AATAAAATAA TGGAAGTATC AAACCAAACA GGAATTGATT TGAAGATTGT AACAGATGGT ATGTATCAGG CTATTTCATC AATTGGAGAT GGAGGAGCAG AAACAGAGAA GATATTTGAT ACTATGGCTA AAAGTGCTAA GGCTGGTGGA GCAGAGGTTA AAGATGCAGT TGCTTTAATT AGTGCAGGAA TGAAAGGATA TAATCAAGTT AATGATGAAA CAGCAAAGAA GATAAGTGAT TTAGCCTTCC AAACAGCTAA GTTGGGAGTA ACAACTTTTC CAGAAATGGC TTCAAGCATG CAACCATTAT TCCCACTAGC AAGCAATTTA AATTTATCAA TGACAGATTT ATTTACTAAT ATGGCTACAT TAACAGGGGT TACAGGAAAC ACTTCAGAAG TATGTACCCA GTTAAAAGCA GTATTTAGTA ACTTAATTAA ACCAACAGCA GATATGCAAA AATTAATGGA GAAATATGGC TTCCAAAATG GTCAAGCAAT GTTAAAAAGT GAGGGCTTAA TTGGTACTTT AAAAATATTA CAAAAAGAAA CTGGTGGACA ATCAGATAAA ATGGGAAAAC TTTTTAGTAG TACTGAAGGA CTAACAGCAA TTACAGCATT AACAAGTTCA CAGTTTGATA CGTTGGCAGA TAAGTCTAAG AAAATGAATG AGGCTATAGG AACAACTGAT TCAGCTCTTG CAAAAATAAA TAATACTACT GGAAATGATT TAAGAACATC TTTAAACTTA GCTAAAAATA GTCTAGTTGG ATTTGGTGAA GTATTAGCAC CATTTATTTC TTTAGCTGCT AAGGGAATAG GTGGTATAGC TAAAGCTTTT GGTGGATTAA GTGAAGGGCA AAAAAAGGCA GTAGTGGGAA TAGGAGCTTT TGTAGTTGGT TCTTATGGAG CTTTAGCAGT AACAACTAAA GTTACAGGAA CAATAAGAAA TGCCATTAAA GATTATAAAG CATTTAGAGA TGTTATGGAA AAATTAAAAA TTGCTACAAA GTTACAAACA GCAGCACAGA AAGCTTTAAA TTTTGTTACT GAAATATCTC CAGTAGGAAA AATTATATTA GTCGTTGGAT TAGTAGTAGG AGCTTTAACT TATTTATATA ATCATTGCGA GTGGTTTAGA AATGGAGTTA ATAAAATATT TAGTGGACTT ATAAAGTTTT TTACAGAAAC AGTACCTAAT AAATTAAAGC AGTTAATAAA TTTCTTTAAA AATGATTGGA AGGAAATTTT GTTGCTTATA GTTAATCCTT TTGCTGGAGC TTTTGCATTA GCTTACAAGC ATTGTGAAGG ATTTAGGAAT GGAGTGGACA AATTATTTTC AAATATTAAA GAGTTTTTTA CTAAAAGTAT TCCTGATTTC TTTAAGTGGT TAATATCTAA ACTTACACAA TTTAAAACAG ACTTCATAAA TAAAATAAAA GAGTTTGGTA CTATAATAAA AACTAGAATA AAAAATTATA TAGAAGAAGT AAAATTTATA TTTAGTAATT TACCTAAACT TATGGGGATA TTGATAGGAA AAATTGCAGG AGAAATTTAT AAGGGATTTT TAAACATAAA AATATTTATC ACTAAAACTA TTCCAGATGC AATAAATTCT ATAAAGCAAT GGTTTGCACA GCTTCCTGAA GCAATAGGGA AACAATTACT TGATTCATAT ATGAAAGTAA AGACATGGGG AAATAATTTA TATATTTCAG CAAAAAAGAC AGGTAAGGAT TTTATATATG GAGTGATAGA TTATGTTAAG GAACTACCAC ATAAAATTTG GAATAAGATT ACTGAAGCTT ATGATGGAGT TACTACATGG GGAAATAATA TGTACCAAGA AGCTAAGAAA GTGGGAAAGG AATTTGTTGA TTCTATAGTA GATCATGTTA AAGAACTACC AACAAGATTT AAAAATTGGT TAAAAGAATC TTGGAATAAG GTAAGTGCAT GGGGAGAAGA TTTAAAGACT GCTGGTAAAG AGAGTGGTAA AAAGTTAGTT GATTCTATAG TAGATACAGT AAAAGCTATA CCAGGACAAA TGAAAGAGAT AGGTAAAAAT ATAGTTCATG GGATTTGGGA TGGTATAACA GGTGCTATTG GATGGATAAA AAGTAAGATA AAACAATTTT GTGATGGAAT TGTAGAAGGA TTTAAAAGTT CTTTAGATAT ACATTCTCCA TCTAGGGTGT TAAGAGATCA GGTTGGTAAA TTTATGGCTC AAGGTGTTGG AGTAGGTTTT GTCAATGAAA TGGAAGATAT AAATTTAGAT ATAAAGCAAA GTTTGGATAG AACTATAAAT ACTAATATAG TACCTTCTAT ATCAAATGTT GACTTAGAAA AAATAAATAC AAAACTTAAT AATACAAACA ATAATAATAT AGTAGTTGTA ATTGAAAATG TAACTAATTT AGATGGTGAA GTAATCAGTA CTAAAGTATA TAAGAAAGTA GCTAAACAAA TGAAAAGTGA TGAAAATAGT TATAGAGTTA CTAAAGGAAA GAAGGGAGGT CGAATATGTG CATAA
|
Protein sequence | MGANVKIGAN TSDFQKQMKD MVRELKSLGS TFNLASTQAK LFGNANEQLK VKQAELTEKM KLQNRMINLQ KEAINKLTND VQKQKEKREE LSKKIEEVTK KYKESSEATG KNSEETKKLG KSLADLKEEY ARNERAIDSS NRKIDTANMK MNKSKTELLE NKKALEEVDK KLKDINLDKF SQKMDKVSNA TGKAASALKP AAIAVTGFGV AAAVTEMKFQ DGIANINTLL DDQSHLEGYK NKIMEVSNQT GIDLKIVTDG MYQAISSIGD GGAETEKIFD TMAKSAKAGG AEVKDAVALI SAGMKGYNQV NDETAKKISD LAFQTAKLGV TTFPEMASSM QPLFPLASNL NLSMTDLFTN MATLTGVTGN TSEVCTQLKA VFSNLIKPTA DMQKLMEKYG FQNGQAMLKS EGLIGTLKIL QKETGGQSDK MGKLFSSTEG LTAITALTSS QFDTLADKSK KMNEAIGTTD SALAKINNTT GNDLRTSLNL AKNSLVGFGE VLAPFISLAA KGIGGIAKAF GGLSEGQKKA VVGIGAFVVG SYGALAVTTK VTGTIRNAIK DYKAFRDVME KLKIATKLQT AAQKALNFVT EISPVGKIIL VVGLVVGALT YLYNHCEWFR NGVNKIFSGL IKFFTETVPN KLKQLINFFK NDWKEILLLI VNPFAGAFAL AYKHCEGFRN GVDKLFSNIK EFFTKSIPDF FKWLISKLTQ FKTDFINKIK EFGTIIKTRI KNYIEEVKFI FSNLPKLMGI LIGKIAGEIY KGFLNIKIFI TKTIPDAINS IKQWFAQLPE AIGKQLLDSY MKVKTWGNNL YISAKKTGKD FIYGVIDYVK ELPHKIWNKI TEAYDGVTTW GNNMYQEAKK VGKEFVDSIV DHVKELPTRF KNWLKESWNK VSAWGEDLKT AGKESGKKLV DSIVDTVKAI PGQMKEIGKN IVHGIWDGIT GAIGWIKSKI KQFCDGIVEG FKSSLDIHSP SRVLRDQVGK FMAQGVGVGF VNEMEDINLD IKQSLDRTIN TNIVPSISNV DLEKINTKLN NTNNNNIVVV IENVTNLDGE VISTKVYKKV AKQMKSDENS YRVTKGKKGG RICA
|
| |