Gene CPR_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1967 
Symbol 
ID4205992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2173511 
End bp2174521 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content31% 
IMG OID642566517 
Producthypothetical protein 
Protein accessionYP_699276 
Protein GI110803936 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAGAG ATAGTAGAAA GAAGAGGAGA AGTAAAAAGA AATGGAAGAC TATAGCAATA 
TTCATTGCTT TTGAATTCAT ATTTACTGCT GTTACAGCAC CTTTTATACT TTTATATGGA
CCATTTAAAA ATGCTAAAAG GACTTATGTA GGTGCAGCAA TGACTAGTTT TAATCATCAG
TGGATGGCAA CTACATTTTT ATCAGATGAA AAAATTAATG ATATATTAAA TTCTAATATT
GAGGATACAA ATACAAATCA TAAAAATACC AATATAAAGA CAAATGTTAA TTTACCAACT
AAACATGATA ATAGTATAGA ATTATACTCT TTTGAAAACT TTAAATATAG TGGATATTAT
ATAGTTGTAA AAGATCCTAC TAGAGTAAAA ATAGGAGTTT CTAAATACCT AGGAGAAGAA
GGACAAACTA CTTCTGAGAT AGCTAGAGAA TACAATGCTG TTGCTGCTGT AAATGGAGGA
GCTTTTACAG ATAAATCTAG TACGGCTCAA TGGACTGGTA ATGGAGGAAC TCCTGCTGGA
ATAGTTATAT CAGAGGGGAA ATTAGTTTAT AAAGATGTAC CAGATGACAA GAAAATTGAG
TTAGTTGGTA TAACAAAAGA AGGAAAAATG ATTGCAGGAA TGTATTCATT TAATAATCTT
AAAGAATTAA ATGTTAAGGA AGCTGTAAGT TTTGGTCCTG TTTTAGTTAA AGAAGGAGAA
CCTACACCTA TGAAAGGTGA TGGTGGATGG GGAGTTGCTC CAAGAACTGC TATGGGACAA
AGAGCTGATG GATCAATAGT AATGTTAGTT ATTGATGGTA GAAGTTTAAC AAGTGGAGGA
GCTACTTTAA AGGAATTACA GGAAGTATTA TTAAATACTT GTAATGTAGT TACTGCTATG
AACCTTGATG GTGGTAAATC AACTACTATG TACTTAAATG GAAAAGTAAT AAATAATCCA
GCATCAAATG TAGGGGAGAG ATCTATTCCT TCAGCTATAA TAGTAAAATA A
 
Protein sequence
MGRDSRKKRR SKKKWKTIAI FIAFEFIFTA VTAPFILLYG PFKNAKRTYV GAAMTSFNHQ 
WMATTFLSDE KINDILNSNI EDTNTNHKNT NIKTNVNLPT KHDNSIELYS FENFKYSGYY
IVVKDPTRVK IGVSKYLGEE GQTTSEIARE YNAVAAVNGG AFTDKSSTAQ WTGNGGTPAG
IVISEGKLVY KDVPDDKKIE LVGITKEGKM IAGMYSFNNL KELNVKEAVS FGPVLVKEGE
PTPMKGDGGW GVAPRTAMGQ RADGSIVMLV IDGRSLTSGG ATLKELQEVL LNTCNVVTAM
NLDGGKSTTM YLNGKVINNP ASNVGERSIP SAIIVK