Gene CPR_2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2333 
SymbolmalQ 
ID4206289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2557517 
End bp2559010 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content31% 
IMG OID642566883 
Product4-alpha-glucanotransferase 
Protein accessionYP_699598 
Protein GI110803610 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAG CAAGTGGAAT AATAATGCAT ATTGCTTCCT TACCTGGAAA GTATGGAATA 
GGTACTTTTG GTAAGGAAGC ATTTGAGTTT GTAGATTTTT TAAAGAAAGC TGGACAAGGA
TATTGGCAAA TATTGCCCTT AGGTCCTACA AGTTATGGGG ATTCACCATA TCAATCATTT
TCAGCCTTTG CAGGAAATCC ATATTTTATA GATTTTGATA TTTTAAATAA AGAAGGATTA
CTTAATAAAA AAGATTACCA AGGAATTAAT TTTGGAAATG ACCCAGAAAA AATAGATTAT
GCTTTATTAT TTGACAAGAA GATGAGAGTG TTAAGAATAG CATATGAAAA ATCTTTAGAT
AAAAACAAAG AAGAAATTGA AAAGTTTAGA GAAGAAAATA AACTTTGGCT TGAAGATTAT
GCTTTATATA TGGCAATCAA AAATGAAAAT GAATTAGTAA GTTGGCAAGA ATGGGATGAA
AAATTAAGAT TAAGAGATAA AAAGATCTTA GAAGAATATA AAGTTAAATT AGAAAAAGAA
ATAAACTACT GGGTATTCTT ACAATATAAT TTCTTTAAGC AATGGAATGA ATTAAAAGAG
TATGCAAATA GTTTTGGAAT TAAGATAATT GGAGATATGC CTATATATGT TGCAGAGGAT
AGTGCAGATG TTTGGGCAAA TCCAAAAGCA TTTTTATTAG ATGAAAATAA TATTCCTAAA
AAGGTTGCTG GATGTCCACC AGATGCTTTT TCAGAAACAG GTCAATTATG GGGAAATCCT
ATATATGATT GGAATTACAT GGATGACACA GGGTATTCTT GGTGGATTGA TAGAGTAAGA
GAAAGCTTTA AGCTTTACGA CATATTAAGA ATAGATCACT TTAGAGGATT TGAAGCTTAC
TGGCAAATAC CATATGGAGA TGAAACTGCT GTAAATGGTG AGTGGGTTAA AGGCCCTGGA
ATAAAATTAT TTAATGCAAT TAAGGAAGAG TTAGGAGAGG TCAATGTAAT AGCAGAAGAC
CTTGGTTATT TAACTCAAGA GGTTATAGAT TTTAGAAATG AAACTGGATT CCCAGGAATG
AAGGTTTTAC AATTTGCCTT TGATTCTAGG GAAGAAAGTG ATTATCTTCC ACATAATTAT
CCAGTTAACT CAATAGCTTA TACAGGTACT CATGATAATG ATACATTTAG AGGTTGGTTT
GAAGTTACAG GAAATAGAGA AGATGTGGAA TATTCTAAAA AATATTTAAA ACTTACTGAA
GAGGAAGGGT ATAACTGGGG ATTTATCAGA GGAGTTTGGA GCAGTGTATC GCATACAGCG
ATAGCTCTAA TGCAAGATTT CTTAAACTTA GGAAATGAGG CAAGAATAAA CTATCCATCT
ACTCTTGGTG GCAATTGGCA ATGGAGAGTT AAAGATGATG CTCTAACTGA TGAATTAGCA
GAGAAGATAT ATGATATAAC AAAATTATAT GGAAGGGTGA ATATCAATGA ATAA
 
Protein sequence
MRRASGIIMH IASLPGKYGI GTFGKEAFEF VDFLKKAGQG YWQILPLGPT SYGDSPYQSF 
SAFAGNPYFI DFDILNKEGL LNKKDYQGIN FGNDPEKIDY ALLFDKKMRV LRIAYEKSLD
KNKEEIEKFR EENKLWLEDY ALYMAIKNEN ELVSWQEWDE KLRLRDKKIL EEYKVKLEKE
INYWVFLQYN FFKQWNELKE YANSFGIKII GDMPIYVAED SADVWANPKA FLLDENNIPK
KVAGCPPDAF SETGQLWGNP IYDWNYMDDT GYSWWIDRVR ESFKLYDILR IDHFRGFEAY
WQIPYGDETA VNGEWVKGPG IKLFNAIKEE LGEVNVIAED LGYLTQEVID FRNETGFPGM
KVLQFAFDSR EESDYLPHNY PVNSIAYTGT HDNDTFRGWF EVTGNREDVE YSKKYLKLTE
EEGYNWGFIR GVWSSVSHTA IALMQDFLNL GNEARINYPS TLGGNWQWRV KDDALTDELA
EKIYDITKLY GRVNINE