Gene CPF_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2255 
Symbol 
ID4202718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2504207 
End bp2505217 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content32% 
IMG OID638083120 
Producthypothetical protein 
Protein accessionYP_696678 
Protein GI110798572 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.473897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAGAG ATAGTAGAAA GAAGAGGAGA AGTAAAAAGA AGTGGAAGAC TATAGCGATA 
TTTATTGCTT TTGAATTCAT ATTCACTGCT GTTACAGCGC CTTTTATACT TTTATATGGA
CCATTTGAAA ATGCTAAAAG GACTTATGTA GGAGCAGCCA TGACTAGTTT TAGTCATCAG
TGGATGGCAA CTACATTTTT ATCGGATGAA AAAATTAATG AGATATTAAA TTCTAATATT
GAGGATACAA ATACAAATCA TAAAAATACA AATATAAAGA CAAATGTTAA TTTACCAACT
AAACATGATA ATAGTATAGA ATTATACTCT TTTGAAAACT TTAAATATAG TGGATATTAT
ATAGTGGTAA AAGATCCTAC TAGAGTAAAA ATAGGAGTTT CTAAATATCT TGGAGAAGAA
GGACAAACTA CTTCTGAAAT AGCTAGGGAA TACAATGCTG TTGCGGCTGT AAATGGAGGA
GCTTTTACAG ATAAATCTAG TACTGCTCAA TGGACTGGTA ATGGTGGAAC TCCTGCTGGA
ATAGTTATAT CAGAGGGGAA ATTGGTTTAT AAAGATGTAC CAGACGATGA AAAAATTGAG
TTAGTTGGCA TAACAAAAGA AGGAAAAATG ATTGCAGGTA TGTATTCATT TAATAATCTT
AAAGAATTAA ATGTTAAAGA AGCTGTAAGT TTTGGACCTG TTTTAGTTAA AGAAGGAGAA
CCTACACCTA TGAAAGGTGA TGGTGGATGG GGAGTTGCTC CAAGAACTGC TATGGGACAA
AGAGCTGATG GATCAATAGT AATGTTAGTT ATTGATGGTA GAAGTTTAAC AAGTGGAGGA
GCTACTTTAA AGGAATTACA GGAAGTATTA TTAAATACTT GTAATGTAGT TACTGCTATA
AACCTTGATG GTGGTAAATC AACTACTATG TACTTAAATG GAAAAGTAAT AAATAATCCA
GCATCAAATG TAGGGGAGAG ATCTATTCCT TCAGCTATAA TAGTAAAATA A
 
Protein sequence
MGRDSRKKRR SKKKWKTIAI FIAFEFIFTA VTAPFILLYG PFENAKRTYV GAAMTSFSHQ 
WMATTFLSDE KINEILNSNI EDTNTNHKNT NIKTNVNLPT KHDNSIELYS FENFKYSGYY
IVVKDPTRVK IGVSKYLGEE GQTTSEIARE YNAVAAVNGG AFTDKSSTAQ WTGNGGTPAG
IVISEGKLVY KDVPDDEKIE LVGITKEGKM IAGMYSFNNL KELNVKEAVS FGPVLVKEGE
PTPMKGDGGW GVAPRTAMGQ RADGSIVMLV IDGRSLTSGG ATLKELQEVL LNTCNVVTAI
NLDGGKSTTM YLNGKVINNP ASNVGERSIP SAIIVK