Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_C0004 |
Symbol | |
ID | 4206678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008265 |
Strand | - |
Start bp | 6108 |
End bp | 9644 |
Gene Length | 3537 bp |
Protein Length | 1178 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | |
Product | phage tail tape measure protein, TP901 family, core region domain protein |
Protein accession | YP_699933 |
Protein GI | 110804068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATT TAGAAAAACG AATAACGGCT AAAATGGTTT TAGATGATAG TGGTTATTCT AATACATTAA AAGGTATTAA TGCTACTTTA AGAGAAAATA AAAGTGAACT TAAAGCAGCT ACAAGTGGAT TAAATGCATT TGGTAAAAGT ACCAGTAATG TAGAAAGGGT TCAAAGAGCT TTTAAGGATC AGCTAGAAAC ACAAGCTAAA AAAGTTAAAG TTTATAAAGA TAGCTTAAAA AAAGCGAATG ATACTCTTGA AAAGAATGTT GCTACTAGAT CTAAATTAGT TAAAAGTATT TCACAAGAAG AAGCTAAACT TAGTTCTTTA AAAAAGAAAT ATGGGGAAAA TAACGATGCT GTAAGAAATG TAGAAAAAAG ACTAGAAGAA TATAAAGAAA AACTTGAAAA AGTAGATAGT TCTATTGAAA GTAATTCTAA GAGAATACAA TCTTATTCTA CACAATTAAA TAATGCAGAA GCTGAATTAA ATAAAGCAGC TGCTGCTGCT AAAAAATTTA ATGATGAAGT AGCAAAAAAT AATGGTTTAA AAACTACTTC TAAAAAATTA GAGGAAGCAA GTTCTAACTT TAAAAAATTT GGAGAAGGAG CTAAGAAAGT AGGTTCTTCT TTAACAACAC ATGTAACTTT ACCACTTGCT GGTGTAGCAG CAGCATCTAC TGCTGTAGGT ATGGAGTTTG AGGCTCAGAT GGATAAAGTT GCAGCAATAT CAGGAGCAAC TGGAGAAGAT TTTAAAAAGT TAAAAGCTAA AGCAGAAGAA ATGGGAGCTA AAACTAAATT TAGTGCTAGT GAAGCTGGTC AAGGCTTAGA ATATATGGCT ATGGCGGGTT GGAAAACTGG CGATATGTTA AATGGTATAG AGCCAATTTT AAATTTAGCT ATAGCTTCTG GGGAAGAACT TGGAACAACA TCTGATATAG TTACAGATGC TTTAACTGCG TTTGGATTAA CTGCTAAAGA TGCTGGAATG TTTTCTGATG TTTTAGCAGC TGCATCATCT AATGCTAATA CTAATGTTGG TATGATGGGT GAAACATTTA AATATGCAGC ACCAGTCTGC GGAGCATTAG GTTATAATGC TAAAGATACA GCACTAGCAA TAGGATTAAT GGCTAACTCT GGTATAAAAG CAAGTCAAGC TGGTACAGCA CTTAGAGCAG GATTAACTAA CTTAGTAAAA CCTACAGATT CTATGGCTGC TGTAATGGAT AAATACGGTA TTTCATTAAA AGATAGTAAT GGTAATATGA AGTCTTTCAA AACAGTAATG GAAGACTTAA GAACTAAATT TGGTAAGTTA GATAAATCAA CTCAAGCAGC TGCTGTAAGT ACATTGTTTG GTAAAGAAGC AATGTCAGGT TGGTTAGCAA TAATAAATGC ATCAAGTGCT GATTTTGATA AACTTTCTGG AGCTATTGAT AAAAGTGAAG GTGCTACCGC ACAAATGGCA AAAACTATGA GTGAAAATGC TAAAGGTTCT ATAGCAGAAA TGAAGAGTGC TTTAGAAGGA GCAGCAATAA AAGTTTTTGA AGCATTAGCA CCTTCCATAA CTAAAGTTGC AAATGCAGTT TCTGATTTAG CTACTAAATT TAGTAATTTA AGTCCAGAAA CACAAGAGTT TATTGTTAAA GCAGGATTGG CTGCTATGGC AGCTGGACCG CTTATAAGTG GAGTAGGTAA GTTATCTACT GGAATTGGTG GTGTATTAAA AGTTGGTAGC TTATTAACTA AGGGAATTGG AGCAGTTACA ACTGGAGCAG AAGTATTAAC TGGAGTAACT GGAGCTGTCG GAGTAGCAGC TGAAGGAACT GCTGTTGCTA CTGGTGCAGC TGGAACTGCT GTTGCTGGAT TTGGAGCAGT AGCATCTAGT GTTCTTTTAC CACTTGCTGG AGTAGTCGCA GTAGTAGGGG CTGTTGGCTA TGCTAGTTAT AAAACAGCTA AATATTTAAA AGAAGATGCA ACACCAGCTG TAGATTTATT TGCAGATAAA GCTGTTTATA GTACAGAAAC AATTGCTACA AGTCATGGTA AAATGACCAT GCAAGTACAA ACTGATACTA TAAAAATATC TGAGTCTACT AAAAAAAATG TACAATCTTA TTTAGATATG GATAAAAAAG CTAGTGATAG TTTAATGAAT TTAAGAATGA ATAGTGATAA ATTTACTAAT GAAACAAAAG AAACTGTTCT AAAAAACTTT GAAGATATGT CTAAAAAATC AAGCAGTTTA TCTGAAGAAC AAAGAAATGC AATGACAGTA AACTTTAAGA AGTTAGTAAG CGATACTGGA ACTTTAACAC AAAAAAATAA AGATGAAATA ATAAAACAAT ATACAGCTAT GGTTAATGGA ACTAAGAAGT TAACTGAAGA ACAAAAACAA AAAACAATAA AAGATTTTAC TGATACTTTA AACCAGAGCG TTGGATTGTC TAAAAAGCAA TCTATAGAAA TGCAGAAAAT ATATAAAGAT ATGAGTGAAA AAATTAAAGT TGGCATGGAT AAAAAAAGAG AAGAAGAGCT TAAGAAGCAA AAAGAATTTT TTGATAAGTC TAATGCTTTA AGTGAAAAAG AAGAAGCTGA AGCTATTAAA AAAACAGGTG AGTTTTGGGA AAAGCAAAAA TCTAAAATTG ATGAAGGCCA AAAAAGAATA GAAATGATTT ATCAAAAAGC AGCTGAAGAA CATAGAAAAA TTACAGAAGA TGAATTTAAA GCTATAAGTA GTATCCAACA TGGCATGAAA GAGGATGCTG TTAAAACATT AAGTGATAAT GAAGTGGAAG CTAAAGTTAT ATTAGAGAGA ATGAAGGGAA ATGATGAACG TATTACAGCA GAACAAGCTA GTCAACATAT CAAGGAATTA AATAAATCTC GTGATGAAGC TATTAAAGCA GCAAATGAAG AATGTGATAA GCGTATAGCT GAAATAATTA GAATGAGAGA TGAAACTGGC ACATTAACAG CAGAACAAGC TGATAAATGT ATAGAAGATG CTAAAAGGCA AAGAGATGAA ACTGTAAGTG CAGCTGAAGA AACTAGAAAT CAAGCTGTAG ATAAAATAGC TTCTATGAAC TCTAATATTA GAGATAGTGT AAATACTACT ACTGGAGAAG TAAAATCTAA CTGGGATAAA TTAAAAGATT GGTGGGATAA TTGGCATCCT GTCAAAAAAA TTTTTGAAAT TTTCACTAAA CATACAAGTG ATGGAAAATC AGCAGATCAA AACTGGACTG GTAACTCATA TTTCAAAGGA GGATTTACAA CTCTACATGA AAGAGGATAT GAATTATATG ATTTACCTTC AGGATCAAGA ATCTATAATC ATGAATCAAG TGAAGAAATG GTTTTAGAAA CAGCTAGACA AACAGCTCAA GGAGTTATTG AAACAATGCT AGGAAATCAA GAAGGAAATA CTGGTGATAT TATAATTCCA GTTTCTATAG CTGGTGAAGA AATTGATAGA ATAGTAGTTC CTAGAGTAAG TAATAAACTA GCACAAAATA TAAGGGGAAG GAGATAA
|
Protein sequence | MSDLEKRITA KMVLDDSGYS NTLKGINATL RENKSELKAA TSGLNAFGKS TSNVERVQRA FKDQLETQAK KVKVYKDSLK KANDTLEKNV ATRSKLVKSI SQEEAKLSSL KKKYGENNDA VRNVEKRLEE YKEKLEKVDS SIESNSKRIQ SYSTQLNNAE AELNKAAAAA KKFNDEVAKN NGLKTTSKKL EEASSNFKKF GEGAKKVGSS LTTHVTLPLA GVAAASTAVG MEFEAQMDKV AAISGATGED FKKLKAKAEE MGAKTKFSAS EAGQGLEYMA MAGWKTGDML NGIEPILNLA IASGEELGTT SDIVTDALTA FGLTAKDAGM FSDVLAAASS NANTNVGMMG ETFKYAAPVC GALGYNAKDT ALAIGLMANS GIKASQAGTA LRAGLTNLVK PTDSMAAVMD KYGISLKDSN GNMKSFKTVM EDLRTKFGKL DKSTQAAAVS TLFGKEAMSG WLAIINASSA DFDKLSGAID KSEGATAQMA KTMSENAKGS IAEMKSALEG AAIKVFEALA PSITKVANAV SDLATKFSNL SPETQEFIVK AGLAAMAAGP LISGVGKLST GIGGVLKVGS LLTKGIGAVT TGAEVLTGVT GAVGVAAEGT AVATGAAGTA VAGFGAVASS VLLPLAGVVA VVGAVGYASY KTAKYLKEDA TPAVDLFADK AVYSTETIAT SHGKMTMQVQ TDTIKISEST KKNVQSYLDM DKKASDSLMN LRMNSDKFTN ETKETVLKNF EDMSKKSSSL SEEQRNAMTV NFKKLVSDTG TLTQKNKDEI IKQYTAMVNG TKKLTEEQKQ KTIKDFTDTL NQSVGLSKKQ SIEMQKIYKD MSEKIKVGMD KKREEELKKQ KEFFDKSNAL SEKEEAEAIK KTGEFWEKQK SKIDEGQKRI EMIYQKAAEE HRKITEDEFK AISSIQHGMK EDAVKTLSDN EVEAKVILER MKGNDERITA EQASQHIKEL NKSRDEAIKA ANEECDKRIA EIIRMRDETG TLTAEQADKC IEDAKRQRDE TVSAAEETRN QAVDKIASMN SNIRDSVNTT TGEVKSNWDK LKDWWDNWHP VKKIFEIFTK HTSDGKSADQ NWTGNSYFKG GFTTLHERGY ELYDLPSGSR IYNHESSEEM VLETARQTAQ GVIETMLGNQ EGNTGDIIIP VSIAGEEIDR IVVPRVSNKL AQNIRGRR
|
| |