Gene CPF_2803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2803 
Symbol 
ID4201180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3059933 
End bp3061384 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content29% 
IMG OID638083671 
ProductMazG family protein 
Protein accessionYP_697168 
Protein GI110800524 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.311825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAAGA TAATGGGTCT AGGGCCAGGT GCCTATGAAG CATTAACAAT AGGTGCTTTA 
AAAGAATTAA AAAACAATAA AAATATATAT TTTAGAACAG AAAAACATCC TACAGTGGAT
TTCTTAAAAG ATGAAGGAAT TAAGTTTGAA TCATATGATC ATGCATATGA AAAATATGAT
AGCTTTGATG ATGTATATAA ATATATTGCA GAGGATTTAA TAACTAAAAT TAAGGATGAT
GAGGATTTAA TATATGCAGT ACCAGGTCAT CCTTTAGTGG CAGAAAAATC AGTAATTAAT
TTAATTGAAT TATGTAAGGA AAATAATATT CAGTATGAAG TTTTGCCAGC AGTTAGCTTT
GTAGATGCTA TGATGGAAGC TTTGCAAGTA GATCCAATAG AGGGTGTAAA AATAATTGAT
GCCTTTGACA TGAAAAATCA AATATTAGAT AAGCGTGTTG GAACTATAAT AACTCAAGTT
TATAATAATT TCATAGCTTC AGAGGTTAAG TTAAGACTTC TAGAAGGATA TGAAGATGAT
ACTGAAATAA TATTTGTAAG AGCGGCTGGA GTTGAAGGTT TAGAGAGTAT AAGAAAAATA
CCTTTATATG AATTAGATTG GCAAGAAGAT ATAGATTATT TAACTTCAAT ATATATACCT
AAGGATTTAG GAAATAAAAA AGATTTTCAA GATTTATTAG ATATTATAGA AACTTTAAGA
AACCCAGGTG GATGTCCTTG GGATAGAGAG CAAACTCATG AGAGTTTAAA GAGTGCTCTT
TTAGAAGAAT GCTATGAAGT AATTGATGCT ATTGAAAATG AGGATGAGGA TGCTTTAATA
GAAGAGCTTG GTGATGTTTT ACTTCAAGTT GTATTCCACG CTTCTATAGG TAAGGAAGAT
GGATATTTTG ATATTATGGA TGTAATAGGT GGAATATCAA ATAAAATGAT AAATAGACAT
CCTCATGTCT TTGGAAATGA AGAAGTAAAT ACTTCAGAGC AAGTTTTAGT AAATTGGGAT
GAAATTAAGA AAGAAGAAAA AGGTATTAAA ACTTTAACAG AAGAAATGCA AAACATAGCT
AAATCTCTTC CAGCTACAAC AAGAGCTTAT AAGGTTCAAA AGAAAGCAAA AAAAGTAGGC
TTTGACTGGG ATGATGTAAA TTGTGCTATG GATAAAGTAA AAGAAGAATT AAATGAAATA
AAAGAGGTTT ATAATTGTGA AGATAAGTCA ATAATAGAAG GTGAAGTAGG TGACTTACTT
TTTGCTTGTA TAAATGTAGC AAGATTCCTA GAAGTTGATG GAGAATTAGC TTTAGATAAA
ACAATTAAAA AGTTTATAAA GAGATTCTCT TATATTGAAA ATGAAGCAAT AAAAAACAAT
AAGAATTTAA AAGATATGAC CTTAGAAGAG ATGGATAAAC TATGGGAAGA GGCTAAAACA
AGTGAAAAAT AA
 
Protein sequence
MLKIMGLGPG AYEALTIGAL KELKNNKNIY FRTEKHPTVD FLKDEGIKFE SYDHAYEKYD 
SFDDVYKYIA EDLITKIKDD EDLIYAVPGH PLVAEKSVIN LIELCKENNI QYEVLPAVSF
VDAMMEALQV DPIEGVKIID AFDMKNQILD KRVGTIITQV YNNFIASEVK LRLLEGYEDD
TEIIFVRAAG VEGLESIRKI PLYELDWQED IDYLTSIYIP KDLGNKKDFQ DLLDIIETLR
NPGGCPWDRE QTHESLKSAL LEECYEVIDA IENEDEDALI EELGDVLLQV VFHASIGKED
GYFDIMDVIG GISNKMINRH PHVFGNEEVN TSEQVLVNWD EIKKEEKGIK TLTEEMQNIA
KSLPATTRAY KVQKKAKKVG FDWDDVNCAM DKVKEELNEI KEVYNCEDKS IIEGEVGDLL
FACINVARFL EVDGELALDK TIKKFIKRFS YIENEAIKNN KNLKDMTLEE MDKLWEEAKT
SEK