Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0422 |
Symbol | |
ID | 4203432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 505111 |
End bp | 506766 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638081306 |
Product | PTS system, IIBC component |
Protein accession | YP_694879 |
Protein GI | 110799845 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR02003] PTS system, IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT TTAAATTAGG GTCTTTTGAT TTCTGGCAAA AGTTTGGTAA GGCATTATTA GTAGTTGTTG CTGTAATGCC TGCTGCCGGA CTTATGATTT CAATAGGTAA AGTAATGGGA ATTTACATTG ATGTAGATTT CATAAAAACT ATAGCGAGAG TTATGGAGGA TATAGGTTGG GCAATAATTG GTAACCTAAA CTTACTATTT GCCGTTGCAA TAGGAGGATC TTGGGCTAAA GAACGTGCTG GGGGAGCTTT TGCAGGTACC ATAGCATTTG TTCTTATTAA TAGAATCACT GGTGTAATAT TTAGCGTAAG CAATGCTATG TTAGCTGGTG ACTCAACTGC TACAGTTCAA TCCTTTACAG GGCAAACACT TTTAGTTAAA GATTACTTTG TTTCAGTGTT AGGTGCTCCA GCACTTAATA TGGGTGTATT TGTTGGTATT ATTTCAGGGT TCTTAGGAGC AGCATTATAT AATAAGTACT ATAATTATAA TAAATTACCT AATGCATTAG CATTCTTCAA TGGTAAACGT TTTGTACCTT TCATGGTTAT AATAGGCTCA ACAATTTCAG CAATAATACT TTCTTTTGTA TGGCCATTTG TACAATACGG ATTAAATACT TTTGGTCAAT GGATAGCTAC ATCTAAGGAT ACAGCTCCAA TAATAGCACC ATTCTTATAT GGTACATTAG AGCGTTTATT ATTACCATTT GGATTACACC ATATGCTTAC AGTTCCAGTA AACTATACAG AGCTTGGTGG AGTTTATCAT ATACTTACAG GTCCAACAGC AGGACAAGTA GTTGCAGGAC AAGATCCATT ATGGCTTGCT TGGATAACTG ATTTAAATAA CTTTAAGCAA GCAGGAGATA TAACTTCATA CAATCAACTT ATAGGGTCAG TGGTACCTGC TCGTTTCAAA GCAGGACAAG TAATACTTTC AAGTGCTTCA TTAATAGGTG TAGCATTAGC TATGTATAAA AATGTTGATC CAGATAAGAA GAAAAAATAT AAATCAGTAT TTTTCTCAGC AGCTATAGCA GTATTTTTAA CTGGTGTAAC TGAACCAATT GAATTTATGT TCATGTTTAT TTCACCAATA TTATATGTAG TTTATGCAGT AATTGCAGGT TTAGGATTTG CCATAGCTGA TATAATAAAC TTAAGAGTTC ATGCTTTTGG ATTCATAGAG CTTATAACTC GTTCACCACT TATGATAAAT GCAGGCTTAA CTAAGGATTT AATAAACTTT ATAATAGTTG CAGTAGTATT CTTCTTCTTA AATTACTTTG TATTTAGTTT CTTAATTAAG AAATTCAAAA TAGCTACACC AGGACGTATG GGTAATTATA TTGATAATGA AGATGAAAAT ACAAACAAAG CTTCATCAAA TAGCAAGGCT TCAATGGATG AATTAGCCGT TAAAGCTATT GAATTACTAG GTGGAAAAGA AAATATAGTA GATGTAGACG CTTGTATGAC TCGTCTTCGT GTTACTGTTA AAGAGATTGA AAAGGTTGGA GATGAAAAAG CTTGGAAAGA TAATAAAGCC TTAGGTCTTA TAGTTAAAGA TAAAGGGGTT CAAGCTATAT ATGGTCCTAA AGCAGATGTA TTAAAATCAG ATATTCAAGA TATCTTAGGT ATTTAG
|
Protein sequence | MKKFKLGSFD FWQKFGKALL VVVAVMPAAG LMISIGKVMG IYIDVDFIKT IARVMEDIGW AIIGNLNLLF AVAIGGSWAK ERAGGAFAGT IAFVLINRIT GVIFSVSNAM LAGDSTATVQ SFTGQTLLVK DYFVSVLGAP ALNMGVFVGI ISGFLGAALY NKYYNYNKLP NALAFFNGKR FVPFMVIIGS TISAIILSFV WPFVQYGLNT FGQWIATSKD TAPIIAPFLY GTLERLLLPF GLHHMLTVPV NYTELGGVYH ILTGPTAGQV VAGQDPLWLA WITDLNNFKQ AGDITSYNQL IGSVVPARFK AGQVILSSAS LIGVALAMYK NVDPDKKKKY KSVFFSAAIA VFLTGVTEPI EFMFMFISPI LYVVYAVIAG LGFAIADIIN LRVHAFGFIE LITRSPLMIN AGLTKDLINF IIVAVVFFFL NYFVFSFLIK KFKIATPGRM GNYIDNEDEN TNKASSNSKA SMDELAVKAI ELLGGKENIV DVDACMTRLR VTVKEIEKVG DEKAWKDNKA LGLIVKDKGV QAIYGPKADV LKSDIQDILG I
|
| |