Gene CPF_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0422 
Symbol 
ID4203432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp505111 
End bp506766 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content33% 
IMG OID638081306 
ProductPTS system, IIBC component 
Protein accessionYP_694879 
Protein GI110799845 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR02003] PTS system, IIBC component 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TTAAATTAGG GTCTTTTGAT TTCTGGCAAA AGTTTGGTAA GGCATTATTA 
GTAGTTGTTG CTGTAATGCC TGCTGCCGGA CTTATGATTT CAATAGGTAA AGTAATGGGA
ATTTACATTG ATGTAGATTT CATAAAAACT ATAGCGAGAG TTATGGAGGA TATAGGTTGG
GCAATAATTG GTAACCTAAA CTTACTATTT GCCGTTGCAA TAGGAGGATC TTGGGCTAAA
GAACGTGCTG GGGGAGCTTT TGCAGGTACC ATAGCATTTG TTCTTATTAA TAGAATCACT
GGTGTAATAT TTAGCGTAAG CAATGCTATG TTAGCTGGTG ACTCAACTGC TACAGTTCAA
TCCTTTACAG GGCAAACACT TTTAGTTAAA GATTACTTTG TTTCAGTGTT AGGTGCTCCA
GCACTTAATA TGGGTGTATT TGTTGGTATT ATTTCAGGGT TCTTAGGAGC AGCATTATAT
AATAAGTACT ATAATTATAA TAAATTACCT AATGCATTAG CATTCTTCAA TGGTAAACGT
TTTGTACCTT TCATGGTTAT AATAGGCTCA ACAATTTCAG CAATAATACT TTCTTTTGTA
TGGCCATTTG TACAATACGG ATTAAATACT TTTGGTCAAT GGATAGCTAC ATCTAAGGAT
ACAGCTCCAA TAATAGCACC ATTCTTATAT GGTACATTAG AGCGTTTATT ATTACCATTT
GGATTACACC ATATGCTTAC AGTTCCAGTA AACTATACAG AGCTTGGTGG AGTTTATCAT
ATACTTACAG GTCCAACAGC AGGACAAGTA GTTGCAGGAC AAGATCCATT ATGGCTTGCT
TGGATAACTG ATTTAAATAA CTTTAAGCAA GCAGGAGATA TAACTTCATA CAATCAACTT
ATAGGGTCAG TGGTACCTGC TCGTTTCAAA GCAGGACAAG TAATACTTTC AAGTGCTTCA
TTAATAGGTG TAGCATTAGC TATGTATAAA AATGTTGATC CAGATAAGAA GAAAAAATAT
AAATCAGTAT TTTTCTCAGC AGCTATAGCA GTATTTTTAA CTGGTGTAAC TGAACCAATT
GAATTTATGT TCATGTTTAT TTCACCAATA TTATATGTAG TTTATGCAGT AATTGCAGGT
TTAGGATTTG CCATAGCTGA TATAATAAAC TTAAGAGTTC ATGCTTTTGG ATTCATAGAG
CTTATAACTC GTTCACCACT TATGATAAAT GCAGGCTTAA CTAAGGATTT AATAAACTTT
ATAATAGTTG CAGTAGTATT CTTCTTCTTA AATTACTTTG TATTTAGTTT CTTAATTAAG
AAATTCAAAA TAGCTACACC AGGACGTATG GGTAATTATA TTGATAATGA AGATGAAAAT
ACAAACAAAG CTTCATCAAA TAGCAAGGCT TCAATGGATG AATTAGCCGT TAAAGCTATT
GAATTACTAG GTGGAAAAGA AAATATAGTA GATGTAGACG CTTGTATGAC TCGTCTTCGT
GTTACTGTTA AAGAGATTGA AAAGGTTGGA GATGAAAAAG CTTGGAAAGA TAATAAAGCC
TTAGGTCTTA TAGTTAAAGA TAAAGGGGTT CAAGCTATAT ATGGTCCTAA AGCAGATGTA
TTAAAATCAG ATATTCAAGA TATCTTAGGT ATTTAG
 
Protein sequence
MKKFKLGSFD FWQKFGKALL VVVAVMPAAG LMISIGKVMG IYIDVDFIKT IARVMEDIGW 
AIIGNLNLLF AVAIGGSWAK ERAGGAFAGT IAFVLINRIT GVIFSVSNAM LAGDSTATVQ
SFTGQTLLVK DYFVSVLGAP ALNMGVFVGI ISGFLGAALY NKYYNYNKLP NALAFFNGKR
FVPFMVIIGS TISAIILSFV WPFVQYGLNT FGQWIATSKD TAPIIAPFLY GTLERLLLPF
GLHHMLTVPV NYTELGGVYH ILTGPTAGQV VAGQDPLWLA WITDLNNFKQ AGDITSYNQL
IGSVVPARFK AGQVILSSAS LIGVALAMYK NVDPDKKKKY KSVFFSAAIA VFLTGVTEPI
EFMFMFISPI LYVVYAVIAG LGFAIADIIN LRVHAFGFIE LITRSPLMIN AGLTKDLINF
IIVAVVFFFL NYFVFSFLIK KFKIATPGRM GNYIDNEDEN TNKASSNSKA SMDELAVKAI
ELLGGKENIV DVDACMTRLR VTVKEIEKVG DEKAWKDNKA LGLIVKDKGV QAIYGPKADV
LKSDIQDILG I