Gene CPF_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2647 
SymbolmalQ 
ID4201910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2915110 
End bp2916603 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content31% 
IMG OID638083513 
Product4-alpha-glucanotransferase 
Protein accessionYP_697027 
Protein GI110801309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAG CAAGTGGAAT AATAATGCAT ATTGCTTCCT TACCTGGAAA GTATGGAATA 
GGTACTTTTG GTAAGGAAGC ATTTGAGTTT GTAGATTTTT TAAAGAAAGC TGGACAAGGA
TGTTGGCAAA TATTGCCCTT AGGTCCTACA AGTTATGGTG ATTCACCATA TCAATCATTT
TCAGCCTTTG CAGGAAATCC ATATTTTATA GATTTTGATA TTTTAAATAA AGAAGGATTA
CTTGATAAAA AAGATTACCA AGGAATTAAT TTTGGAAATG ACCCAGAAAA AATAGATTAT
GCTTTATTAT TTGACAAGAA GATGAGAGTG TTAAGAGTAG CATATGAAAA ATCTTTAGAT
GAGAATAAAG AAGAAATTGA AAAGTTTAGA GAAGAAAATA AACTTTGGCT TGAAGATTAT
GCTTTATATA TGGCAATCAA AAATGAAAAC GAATTAGTAA GTTGGCAAGA ATGGGATGAA
AAATTAAGAT TAAGAGATAA AAAGACCTTA GAAGAATATA AAGTTAAATT AGAAAAAGAA
ATAAACTACT GGGTATTCTT ACAATATCAT TTCTTTAAGC AATGGAATAA ATTAAAAGAG
TATGCAAATA GTTTTGGAAT TAAGATAATT GGAGATATGC CTATATATGT TGCAGAGGAT
AGTGCAGATG TTTGGGCAAA TCCAAAAGCA TTTTTATTAG ATGAAAATAA TATTCCTAAA
AAGGTTGCTG GATGTCCACC AGATGCTTTT TCAGAAACAG GTCAATTATG GGGAAATCCT
ATATATGATT GGAGCTACAT GGATGACACA GGATATTCTT GGTGGATTGA TAGAGTAAGA
GAAAGCTTTA AGCTTTATGA CATATTAAGA ATAGATCACT TTAGAGGGTT TGAAGCTTAC
TGGCAAATAC CATATGGAGA TGAAACTGCT GTAAATGGTG AGTGGGTTAA AGGCCCTGGA
ATAAAATTAT TTAATGCAAT TAAAGAAGAG TTAGGTGAGG TTAATGTAAT AGCAGAAGAC
CTTGGTTATT TAACTCAAGA GGTTATAGAT TTTAGAAATG AAACTGGATT CCCAGGAATG
AAGGTTTTAC AATTTGCCTT TGATTCTAGA GAAGAAAGTG ATTATCTTCC ACATAATTAT
CCAGTTAACT CAATAGCTTA TACAGGTACT CATGATAATG ATACATTTAG AGGTTGGTTT
GAAGTTACAG GAAATAGAGA AGATGTGGAA TATTCTAAAA AATATTTAAA ACTTACTGAA
GAGGAAGGGT ATAACTGGGG GTTTATCAGA GGAGTTTGGA GCAGTGTATC ACATACAGCT
ATAGCTCTAA TGCAAGATTT CTTAAACTTA GGAAATGAGG CAAGAATAAA CTATCCATCT
ACTCTTGGTG GCAATTGGCA ATGGAGAGTT AAATATGATG CTCTAACTGA TGAATTAGCA
GAGAAAATAT ATGATATAAC AAAATTATAT GGAAGGGTGA ATATTAATGA ATAA
 
Protein sequence
MRRASGIIMH IASLPGKYGI GTFGKEAFEF VDFLKKAGQG CWQILPLGPT SYGDSPYQSF 
SAFAGNPYFI DFDILNKEGL LDKKDYQGIN FGNDPEKIDY ALLFDKKMRV LRVAYEKSLD
ENKEEIEKFR EENKLWLEDY ALYMAIKNEN ELVSWQEWDE KLRLRDKKTL EEYKVKLEKE
INYWVFLQYH FFKQWNKLKE YANSFGIKII GDMPIYVAED SADVWANPKA FLLDENNIPK
KVAGCPPDAF SETGQLWGNP IYDWSYMDDT GYSWWIDRVR ESFKLYDILR IDHFRGFEAY
WQIPYGDETA VNGEWVKGPG IKLFNAIKEE LGEVNVIAED LGYLTQEVID FRNETGFPGM
KVLQFAFDSR EESDYLPHNY PVNSIAYTGT HDNDTFRGWF EVTGNREDVE YSKKYLKLTE
EEGYNWGFIR GVWSSVSHTA IALMQDFLNL GNEARINYPS TLGGNWQWRV KYDALTDELA
EKIYDITKLY GRVNINE