Gene CPR_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2193 
Symbol 
ID4204871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2421710 
End bp2422837 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content27% 
IMG OID642566743 
Productglycosyltransferase 
Protein accessionYP_699493 
Protein GI110801920 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00106475 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAT GTATTGATGG AAGAGCAGCT ACCTTATACC GAGGTACTGG AATTGGTAAC 
TACACTTATC AAATAATAAA TAATCTACAC CAGATAGATT TTTTAAATGA ATATAACATA
CTTACTCCAG AAGCATCCTC TCTAAAATTA CCTAAAAAGA ATAACTTTAA TTATTTATCC
TCAAGTACAA ATGATAAAAA AAACTTTTGG GAATTTATAA ATACAAAAAA TCCTAAAGAA
AATATTATAG GTGATGTATA TCATATTCCT CAAAATGGTA TAGGCTTTTC AAAACCTAAT
GATATAAAGA CCGTAATTAC CCTACATGAT ATTATCCCTA TGAAAATGCC TGATACAGTT
AGTGAAACAT TCTTAAAAAT TTTTAATGAA AATATACAAA ATATTTTAGA TAACACAGAT
GGAATTATTA CAGTTTCTAA TTTTTCTAAA GAAGATATAA GTAAAACTTT TTCTTACCCA
AAGGAAAAAA TATTTGTTAC TCATTTGGCT GCTGAAGAAA TATATACCCC ATTAAATAAA
TTTCACTCAT CTCAATATCT AAAAAAACAC TATGGTATAG ATAGGGATTT TTTATTATAT
GTAGGTGGTT TTAGCCCTAG AAAAAATATT TTAGGACTTA TAGATGCCTT TAACTTAGTA
AAAAATTCTT ATAAAAGAGA TTTAAAGCTT GTTATTATAG GAACTAAGGG ACCTTCATAT
GAAATTTACA GAAAAAAAGT AGATGAGTTA AATTTATCTT CCTCTGTTAT TTTTACTGGA
TTTATTCCTA TAGATGATAT GCCTATATTC TATAGTGCTA GTAAAGCTTT AGTTTATCCT
TCTTTTTATG AAGGATTTGG ACTACCTCCT ATAGAATGCA TGGCCTGCGG TACTCCTGTA
ATAGCATCTA ATTTAACCTC AATGCCTGAG GTATGCCAAG ATGCTGCTCT TTTAGTTGAT
CCTTATGATG TTGATGAAAT AAAGGAAAAT ATACTAACTT TATTAAATAA CCATAAATTT
TATTCCCTTA TGATTTATAA AGGGCTAAGT CATTCAAGTA AATTTAATTG GAAAAAAACT
GCTTATAATA CACTTGAGGT CTACAAACAT ATTTATTCAC AAATTTAA
 
Protein sequence
MKICIDGRAA TLYRGTGIGN YTYQIINNLH QIDFLNEYNI LTPEASSLKL PKKNNFNYLS 
SSTNDKKNFW EFINTKNPKE NIIGDVYHIP QNGIGFSKPN DIKTVITLHD IIPMKMPDTV
SETFLKIFNE NIQNILDNTD GIITVSNFSK EDISKTFSYP KEKIFVTHLA AEEIYTPLNK
FHSSQYLKKH YGIDRDFLLY VGGFSPRKNI LGLIDAFNLV KNSYKRDLKL VIIGTKGPSY
EIYRKKVDEL NLSSSVIFTG FIPIDDMPIF YSASKALVYP SFYEGFGLPP IECMACGTPV
IASNLTSMPE VCQDAALLVD PYDVDEIKEN ILTLLNNHKF YSLMIYKGLS HSSKFNWKKT
AYNTLEVYKH IYSQI