Gene CPF_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0084 
SymboliolD 
ID4201952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp99329 
End bp101248 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content34% 
IMG OID638080965 
Productmyo-inositol catabolism protein IolD 
Protein accessionYP_694548 
Protein GI110801040 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGA CTACAGGCCA AGCGCTAGTA AAATTTTTAG ATAATCAATA CGTTTCTTTT 
GATGGTAAAG AAGAAAAATT TGTTGATGGG ATATTCACTA TATTTGGACA TGGTATTGTA
GTTGGATTGG GAGAAGCTTT ATATGAAAAT CCAGGTGAAC TTAAAGTTTA TCAAGGAAGA
AATGAGCAAG GTATGGCTCA TGTTTCAACA GCTTTCGCAA AACAAAACAA TCGTAGAAAA
ATAATAGCTT GTTCAAGTTC AGTTGGCCCT GGAGCTGCTA ATATGGTTAC TGCAGCAGCT
ACAGCAACAG TGAATAATAT ACCATTATTA TTATTACCAG GAGATTCATT TGCTACAAGA
CAACCAGATC CTGTACTTCA ACAGATAGAA CAGTCCTATA ATTTAGGAAT AACAACAAAT
GACGCATTTA AGCCAGTTTG TAAGTATTGG GATAGAATAA ATAGACCAGA GCAACTTATG
TCAGCAATGA TAAATGCTAT GAGAGTATTA ACAGATCCTG CTGAAACAGG TGCTGTTTGT
ATAGCATTAC CACAAGATGT TCAAGGAGAA GCTTATGACT TCCCAGAATA TTTCTTTAAG
AAAAGGGTTC ATAGGATAAC TAGACCTCTA GCTGTACAAG AAGAGTTTGA AGAGGCATTA
GATATAATAA TGAATAAAAA GAAACCTATA ATAATTTGTG GTGGTGGAGT TAGATACTCA
GAAGCAGGGG AAGCTTTAGT TGATTTTGCA GAGGAGTTTA ATATTCCAAT ATGCGAAACA
CAAGCAGGTA AGAGTGCTAT TAAATCAAGT CACCCATTAA ATTTAGGTGG TATAGGGGTT
ACTGGTAACC TTGCTGCAAA TATGATTGCT AAAGATGCTG ATTTAGTAAT TGGTGTTGGA
ACAAGATTCT CAGACTTCAC AACTTCATCA AAATCATTAT TTAAGAATCC AGAAGTTGAT
TTTATAACTG TAAACGTATC AAAATTCCAT GGTGAAAAAA TGGATGCTCA CAAGATAATA
GGTGATGCAA AGGTATGTAT AGAAGAATTA CAAGCTATGT TAGAAGCAAA TAACTATGAA
TCATCTTATG AAGATGAAAT AGTAAATGCT AAAAAAGCTT GGAAAGAAGA GATGAAGAGA
TTAACAAATA TTAAATATGA TGAAAACTTT GAAGCTTTAA TAAAACCTAA GAGAGAAGGA
TGCATAGAAG AGTTTAGTGT ATTAACAGGA GGTTTAATCA CTCAAACAGC TGCTTTAGGA
GTTATTAGAG AAACAATAGA TGATGATGCC ATAGTAGTTG GAGCTGCTGG AAGTTTACCA
GGAGATCTTC AAAGAATGTG GGAAACAGAT GTTAGAGATT CATACCACAT GGAATATGGA
TATTCATGTA TGGGATATGA AATAGCAGCT ACATTAGGAG CTAAGTTAGC AGAGCCAGAA
AGAGAAGTTT ACTCAATGGT TGGTGATGGA AGTTACTTAA TGCTTCATTC AGAGATGGTA
ACAGCTATGC AAGAGCAAAA GAAGATAAAT ATACTTTTAT TTGATAACTG TGGATTTGGA
TGTATAAACA ATCTACAAAT GTCAAATGGT ATAGGAAGCC TTGCTACAGA GTTTAGATAT
AGAGATGAAA ATGGTAAGTT AGAAGGGGGA TTAATCCCTA TAGATTTTGC TAAGGTAGCT
AGTGGGTATG GATTGAAAAC ATATTCAGTA AAAACTTTAG CTCAATTAAA AGAAGCTTTA
GAAGATGCTA AAAAGCAAAA GGTTTCAACT TTAATAGACA TAAAGGTGTT ACCAAAAACA
ATGACAGATG GCTATGATGC ATGGTGGCAT GTTGGAATAG CTGGAGAATC AAAAATTGAT
GGTGTAAATA AGGCGTTTGA GAACAAAGAG AAAAATTTAA AAGCCGCTAG AAGATATTAA
 
Protein sequence
MRMTTGQALV KFLDNQYVSF DGKEEKFVDG IFTIFGHGIV VGLGEALYEN PGELKVYQGR 
NEQGMAHVST AFAKQNNRRK IIACSSSVGP GAANMVTAAA TATVNNIPLL LLPGDSFATR
QPDPVLQQIE QSYNLGITTN DAFKPVCKYW DRINRPEQLM SAMINAMRVL TDPAETGAVC
IALPQDVQGE AYDFPEYFFK KRVHRITRPL AVQEEFEEAL DIIMNKKKPI IICGGGVRYS
EAGEALVDFA EEFNIPICET QAGKSAIKSS HPLNLGGIGV TGNLAANMIA KDADLVIGVG
TRFSDFTTSS KSLFKNPEVD FITVNVSKFH GEKMDAHKII GDAKVCIEEL QAMLEANNYE
SSYEDEIVNA KKAWKEEMKR LTNIKYDENF EALIKPKREG CIEEFSVLTG GLITQTAALG
VIRETIDDDA IVVGAAGSLP GDLQRMWETD VRDSYHMEYG YSCMGYEIAA TLGAKLAEPE
REVYSMVGDG SYLMLHSEMV TAMQEQKKIN ILLFDNCGFG CINNLQMSNG IGSLATEFRY
RDENGKLEGG LIPIDFAKVA SGYGLKTYSV KTLAQLKEAL EDAKKQKVST LIDIKVLPKT
MTDGYDAWWH VGIAGESKID GVNKAFENKE KNLKAARRY