Gene Cphy_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3356 
Symbol 
ID5741638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4090732 
End bp4091997 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content40% 
IMG OID641294459 
Product3-isopropylmalate dehydratase, large subunit 
Protein accessionYP_001560448 
Protein GI160881480 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02083] 3-isopropylmalate dehydratase, large subunit
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATGA CAATGACTCA AAAGATTTTG GCTGCACATG CAAATCTACC AGAAGTAAAA 
GCTGGACAGT TGATTGAGGC AAATCTTGAC CTAGTGTTAG CAAATGACAT TACAGGTCCG
GTTGCTATTC ATGAAATACA AAGATTAAAA AAGAAAACAG TATTTGATAA AGATAAAATC
GCATTAGTGC CAGATCATTT TACGCCAAAT AAAGATATCA AATCGGCCGA ACATTGTAAG
TGTGTCAGAG AGTTTGCGAA AGAGCATGAC ATTACGAACT ATTTTGAGAT CGGTGAGATG
GGAATTGAGC ATGCCCTCCT ACCAGAAAAA GGACTGATTG TAGCAGGAGA GACTTGTATT
GGAGCTGACT CACATACCTG TACTTATGGA GCACTTGGAG CATTTTCTAC CGGTGTTGGC
AGTACCGATA TGGGTGCTGG TATGATTACT GGAAAAGCTT GGTTTAAAGT TCCAGCTGCA
ATTAAGTTTA TATTAACTGG AGAACCAAAA GAGTGGGTGA GTGGAAAAGA TGTTATTTTA
CATATTATCG GTATGATTGG TGTAGATGGT GCATTGTATA AATCCATGGA ATTTGTAGGA
GCTGGAATTA AGAATTTGAC CATGGATGAT CGATTTACGA TTGCGAATAT GGCAATAGAA
GCCGGTGCTA AGAATGGTAT CTTTCCGGTG GATGATTTAA CCATCTCATA TATGAAGGAA
CATGGGGCAA AGCCATATAC CATCTATGAA GCAGATGAAG ATGCAGAATA CGAGCAGATA
ATTACAATAA ATTTATCAGA ATTAGAACCA ACCGTAGCAT TTCCTCATCT CCCTGAAAAT
ACGAAGACTG TGAAAGAAGC AGGAGAGGTT AGGATAGATC AGGTTGTCAT AGGATCCTGT
ACCAATGGAA GAATTGGTGA TTTAAGAATT GCTGCCAAGG TTTTAGAGGG AAGAAAGGTA
GCAAAAGGGA TGCGAGCTAT TGTATTTCCT GCGACTCAGG CAATTTATTT ACAAGCAATT
GAGGAAGGAT TAATTCAAAC CTTTATTAAA GCAGGTTGTG TAGTAAGTAC CCCAACCTGT
GGACCATGTC TTGGAGGGCA TATGGGGATT CTTGCAGCAG GAGAACGAGC TGCATCCACA
ACGAATCGTA ACTTTGTTGG ACGTATGGGA CACGTAGAAT CTGAAGTATA CCTTTGCAGT
CCTGCAGTCG CAGCAGCAAG TGCAGTAACT GGAAAGATAA GTGAGCCATC CGAACTCTTT
TCATAA
 
Protein sequence
MGMTMTQKIL AAHANLPEVK AGQLIEANLD LVLANDITGP VAIHEIQRLK KKTVFDKDKI 
ALVPDHFTPN KDIKSAEHCK CVREFAKEHD ITNYFEIGEM GIEHALLPEK GLIVAGETCI
GADSHTCTYG ALGAFSTGVG STDMGAGMIT GKAWFKVPAA IKFILTGEPK EWVSGKDVIL
HIIGMIGVDG ALYKSMEFVG AGIKNLTMDD RFTIANMAIE AGAKNGIFPV DDLTISYMKE
HGAKPYTIYE ADEDAEYEQI ITINLSELEP TVAFPHLPEN TKTVKEAGEV RIDQVVIGSC
TNGRIGDLRI AAKVLEGRKV AKGMRAIVFP ATQAIYLQAI EEGLIQTFIK AGCVVSTPTC
GPCLGGHMGI LAAGERAAST TNRNFVGRMG HVESEVYLCS PAVAAASAVT GKISEPSELF
S