Gene Cphy_3348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3348 
Symbol 
ID5741630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4077540 
End bp4079213 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content42% 
IMG OID641294451 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001560440 
Protein GI160881472 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0168284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTG ATGCAGTAAC CAAAGGGATA CAACAAGCAC CTCATCGATC TTTATTTAAT 
GCGTTGGGAT TAACCAAGGA AGAACTAGAT AAACCACTCA TAGGTATTGT AAGTTCTTAT
AATGAAATTG TACCAGGACA TATGAACTTA GATAAGATAG TGGAAGCCGT GAAATTAGGA
GTTGCGATGG CAGGAGGAAC ACCAATCGTA TTTCCAGCGA TTGCAGTATG TGATGGAATT
GCGATGGGAC ATATCGGTAT GAAATATTCC CTAGTAACGA GGGATTTGAT TGCTGATTCT
ACTGAAGCAA TGGCAATGGC ACATAGTTTT GATGCTTTAG TAATGGTTCC AAACTGTGAT
AAGAATGTTC CAGGTTTACT TATGGCAGCA GCTAGAGTAA ATATTCCTAC CATATTTGTA
AGTGGAGGAC CAATGCTTGC AGGTCGTGTT CACGGAGAAA AGAGAAGCCT TAGCAGTATG
TTTGAAGCAG TTGGCGCACA TGCAGCTGGT AAGATGACGG AAGAGGAAGT TGAGGAATTT
GAAAATAAAG TTTGCCCAAC CTGCGGATCT TGCTCCGGAA TGTATACGGC AAACAGTATG
AACTGTTTAA CAGAGGCGCT AGGAATGGGA CTGAAAGGAA ATGGAACAAT TCCAGCAGTA
TACTCTGAAC GTATTCGACT TGCAAAACAT GCCGGTATGA AGATTATGGA GCTTCTACAG
AATAATATAC GTCCAAGAGA TATTATGTCA GAGAAGGCTT TTCTTAATGC GTTGGCAGTC
GATATGGCAC TTGGCTGTTC TACAAACAGT ATGTTACACC TACCAGCCAT TGCTCATGAG
GCAGGAGTTG ATTTAAATGT AGATATCGCA AATGAAATCA GTGCAAAGAC TCCAAACCTA
TGTCACCTTG CTCCGGCTGG TCATACTTAC ATGGAAGATT TGAATGAAGC CGGCGGTGTC
TATGCTGTTA TGAATGAACT TGATAAGAAG GGATTATTGT ATACAGACCT AATTACTTGT
ACAGGTAAGA CTATTAAAGA GAATATTGAA GGCTGTGTAA ATAGAGATCC AGATACAATT
CGTCCAATTG AAAATCCATA TAGTCAAACT GGTGGAATCG CAGTGTTAAA GGGTAATCTA
GCGCCAGACT CCGGTGTAGT AAAACGCTCT GCTGTAGCAC CTGAGATGAT GGTGCATGTT
GGACCTGCAA GAGTATTTGA TTGTGAAGAG GATGCAATTG ACGCAATTAA GAGTGGGAAA
ATTGTTGCGG GAGATGTCGT AGTAATTCGA TATGAAGGAC CAAAGGGTGG ACCTGGTATG
CGAGAAATGC TAAACCCTAC CTCTGCTATT GCAGGTATGG GACTTGGTTC TTCTGTTGCA
TTAATTACAG ATGGCCGTTT CTCTGGTGCA TCCAGAGGTG CATCGATAGG TCACGTATCA
CCGGAAGCAG CGGTTGGTGG TAATATCGCT CTCATAGAGG AGGGGGATAT CATCAAAATT
GATATACCGA ATAATTCTCT TAACTTCGTA GTATCCGACG AGGAGTTAGA GAGAAGAAGA
GTCAATTGGA GCCCAAGAGA GCCCAAAATT ACGACGGGTT ACCTTGCACG TTATACTGCT
ATGGTTACCT CTGGAAATCG TGGTGCAATT TTAGAAGTTC CACGTGTTAA GTAA
 
Protein sequence
MKSDAVTKGI QQAPHRSLFN ALGLTKEELD KPLIGIVSSY NEIVPGHMNL DKIVEAVKLG 
VAMAGGTPIV FPAIAVCDGI AMGHIGMKYS LVTRDLIADS TEAMAMAHSF DALVMVPNCD
KNVPGLLMAA ARVNIPTIFV SGGPMLAGRV HGEKRSLSSM FEAVGAHAAG KMTEEEVEEF
ENKVCPTCGS CSGMYTANSM NCLTEALGMG LKGNGTIPAV YSERIRLAKH AGMKIMELLQ
NNIRPRDIMS EKAFLNALAV DMALGCSTNS MLHLPAIAHE AGVDLNVDIA NEISAKTPNL
CHLAPAGHTY MEDLNEAGGV YAVMNELDKK GLLYTDLITC TGKTIKENIE GCVNRDPDTI
RPIENPYSQT GGIAVLKGNL APDSGVVKRS AVAPEMMVHV GPARVFDCEE DAIDAIKSGK
IVAGDVVVIR YEGPKGGPGM REMLNPTSAI AGMGLGSSVA LITDGRFSGA SRGASIGHVS
PEAAVGGNIA LIEEGDIIKI DIPNNSLNFV VSDEELERRR VNWSPREPKI TTGYLARYTA
MVTSGNRGAI LEVPRVK