Gene CPR_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2197 
Symbol 
ID4204982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2425890 
End bp2427035 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content28% 
IMG OID642566747 
Productglycosyltransferase 
Protein accessionYP_699497 
Protein GI110803134 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.122722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAACT TAATAGATGC AAGAGGTGCA ACTTGGTATA GAGGGACAGG TATAGGAACA 
TACACATATA ACCTATTAAC AAACATATTA AAATTAGATA AGGATTCATC TTATGATTTA
TTATGCTCTG TAGAAAAAAT ACCTGAGTTT GAAAGTGAAA ATACTAAAAT AATAATGTCT
TCTAGAAAAC ATCAACGTTT TTTTGAAAAT TACTATATAC CATCCTATGG AATAAAAGAG
GATATAGATT TGCTTCACCT TCCTCAAAAT GGACTTGGAC TTTCCTCTGA AGGAAACTTT
GCAAAATTAG TTACTATTCA TGACTTAATT CCTTACATTT TACCTGAAAC TGTTGGTAAA
GGTTACTTAA AAAAATTTAT TCAAAGTATG CCTGAAATAA TTGATAATTC AACAGGAATA
ATCACTGTAT CAGAATACTC TAAAAGTGAT ATTATAAGAT TTTTTCCACA CTTTCCTGCT
GAAAATATTT TTGTAACTCC ACTGGCAGCT AATGAAAATT ATAAACCTTT AGATAAAGAA
AAATGCCTAT TTGATGTTAA TAAGAGATTT AATTTTAATG GACCATTTAT TGTGTATATA
GGTGGCTTTA GTTTAAGAAA AAATGTTAAG GGACTAGTTG ATGCTTTTAA CAGTATTCAT
AAAAATATTG ATGAAAATTA TAAACTTTTA ATTGTAGGTG GATTAAGAGA CCAGGGATTA
AAATTAAAGG CTTATACTGA AAGTTTACCT ATAAAAGATA AGATTATTTT TACTGGATTT
ATAGAGGATG AATATTTACC AACCTTGTAT AACGCAACTA CTCTCTTTGT CTATCCTTCT
TTATATGAAG GTTTTGGGCT ACCTCCTTTA GAAGCTATGA GTTGTAAGAC TGCCGTTTTA
ACCTCAAATA TAACTTCTAT TCCTGAAGTG GTTCCCTTTA AAGAAAGCTT ATTTAACCCA
AATAACCCTA AGGAGTTATC TCTAAAACTA GAAAATCTAT TAAATGATTC AAAACTTAGA
AATAATTTAG AAAATATATG TTTTGAAAGA AGCAAAGAAT TTACTTGGGA GAAAACAGCA
AAGAAAACCT TAGATGTATA TAAGAAGGTA ATAGAAATCT CTAAAAACTC ATCAATAGGA
GAATAA
 
Protein sequence
MYNLIDARGA TWYRGTGIGT YTYNLLTNIL KLDKDSSYDL LCSVEKIPEF ESENTKIIMS 
SRKHQRFFEN YYIPSYGIKE DIDLLHLPQN GLGLSSEGNF AKLVTIHDLI PYILPETVGK
GYLKKFIQSM PEIIDNSTGI ITVSEYSKSD IIRFFPHFPA ENIFVTPLAA NENYKPLDKE
KCLFDVNKRF NFNGPFIVYI GGFSLRKNVK GLVDAFNSIH KNIDENYKLL IVGGLRDQGL
KLKAYTESLP IKDKIIFTGF IEDEYLPTLY NATTLFVYPS LYEGFGLPPL EAMSCKTAVL
TSNITSIPEV VPFKESLFNP NNPKELSLKL ENLLNDSKLR NNLENICFER SKEFTWEKTA
KKTLDVYKKV IEISKNSSIG E