Gene Acel_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0433 
Symbol 
ID4485668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp462937 
End bp464697 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content67% 
IMG OID639729200 
Productdihydroxy-acid dehydratase 
Protein accessionYP_872193 
Protein GI117927642 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.436013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTG CTTCGAATTC GTCATCGCGT AACGCGCCGT TGCGCAGCGC CCGTTGGTTC 
GACGGCGACG ACGACGTTGC CGTCGAACAC CGCGCGGCGC TTCGCTCAGC GAACCCTGAG
TTCCGTCCCG GCGGATCGCA GCCAGTGATC GGGATCGCGG ACACGTCAAG TGAACTCAAC
CCGTGCAATT TGCCGCTGCG CGGACTCATC CCCGACGTGG CGCAAGGAAT TCACGACGCC
GGCGGGATTC CGGTCACCCT CCCCGCCATG TCGCTGGGTG AAGACCTGAT GAAGCCGACC
GCGATGCTCT ATCGCAATCT GCTGAGCATC GAAATCGAAG AATATCTCCG CGCCTACCCA
CTCGACGGGA TCGTGCTCCT CGCCAACTGC GACAAGACCG TGCCCGGCTC TGTCATGGGT
GCCGTGAGCG CCAATTTCCC AACAATGATG CTCATCGGCG GTCCTCGGCC CATCCAAACA
TTCCGCGGAC GCCGGATCGG CAGTGGTACC GCGTTATGGC GCGCCTTCGA CCAGCACCGT
TCGGGCGAGC TGGACGATGC CGCATGGGCG GAATTTGAGC AGTGCCTGAG CTGCGGCCAA
GGCGCGTGCA ACACCATGGG CACAGCCGCC TCCATGGCCG TCGTCGTGGA AACCCTCGGC
TTCACCCTGC CTGGTACGGC GACGATGCCC GCCGACGACC CGGCCCGCCG GGCCGTCGCG
CACGAGACCG GTCGCCGCGC GGTCGCCGCC GTCCGGGAGA ACGTCCGACC CCGCGACCTC
ATCACCGGAA TCAGTCTTCG CAACGCAATC CGCGCCCTCA ACGCCTGCGG CGGCTCAACC
AACGCGATCC TTCACCTCAT CGCGATCGCC CGGCGCGCCG GCAACGCCCT GTCACCAGCG
GACGTCGCCG CCGCCGGCCG TGGCGTTCCG GTGCTCCTCG ATATCGAACC GCACGGTCAG
GGTCTCGTGC CGGATTTTCA CGCGGCCGGC GGCGTCCCGG CGATCCTTGC CACCCTCGGC
GATTTCATCG ACCGCTCTGC GCTCGCCGGA AATGGCGAAC CGTGGTCGCG CGTCCTGCGG
AATGCGCCGG TCGTCAACGA GACGACCGTC ATCCGCCCGC TGGACGCGCC ATTGCGATCC
GACGGCGCTT TCGCCTTCCT GCACGGCAAC CTTGCACCGC GCGGCGCCGT ACTGAAAACC
GTCGCGGCGA GTGAGCGTCT TTTCCAGCAC CGGGGTCCAG CCGTCGTTTT CCACGGCTAC
GACGATCTGT GGTCGCGCAT TGACGATCCA GATCTCGAGG TCACGCCGGA GAGTGTGCTC
GTCCTCGCCG GATGTGGACC GATCGGCGGC CCCGGAATGC CTGAATGGGG CATGATTCCC
ATCCCGAAGA AGCTGGCGAA AGCCGGCGTC CGTGACATGG TGCGCGTGAG CGACGCGCGC
ATGAGCGGTA CGTCGTTCGG CACCTGTGTG CTCCACGTCG CACCTGAGGC CGCAATCGGC
GGGCCGTTGG CGCTGGTCCG GGACGGTGAC ATCATCCACC TCGACGTCAC CCGCGGCCGC
CTCGACGTGG AAATCAGCGA GGCGGAACTG CGGCGTCGCG CCGCCGAGTG GGTGACGCCG
CCCAATCCCT ACCGGCGGGG TTGGATCGCG CTGTACCGCG CCCACGTCAC CCAAGCCGAC
GAAGGCTGCG ATCTCGATTT TCTCCAGCCC CGCACGCCGC AAGACATCGA ATTCGTCGAA
CCCACCATCG GTCGTTCGTA A
 
Protein sequence
MTAASNSSSR NAPLRSARWF DGDDDVAVEH RAALRSANPE FRPGGSQPVI GIADTSSELN 
PCNLPLRGLI PDVAQGIHDA GGIPVTLPAM SLGEDLMKPT AMLYRNLLSI EIEEYLRAYP
LDGIVLLANC DKTVPGSVMG AVSANFPTMM LIGGPRPIQT FRGRRIGSGT ALWRAFDQHR
SGELDDAAWA EFEQCLSCGQ GACNTMGTAA SMAVVVETLG FTLPGTATMP ADDPARRAVA
HETGRRAVAA VRENVRPRDL ITGISLRNAI RALNACGGST NAILHLIAIA RRAGNALSPA
DVAAAGRGVP VLLDIEPHGQ GLVPDFHAAG GVPAILATLG DFIDRSALAG NGEPWSRVLR
NAPVVNETTV IRPLDAPLRS DGAFAFLHGN LAPRGAVLKT VAASERLFQH RGPAVVFHGY
DDLWSRIDDP DLEVTPESVL VLAGCGPIGG PGMPEWGMIP IPKKLAKAGV RDMVRVSDAR
MSGTSFGTCV LHVAPEAAIG GPLALVRDGD IIHLDVTRGR LDVEISEAEL RRRAAEWVTP
PNPYRRGWIA LYRAHVTQAD EGCDLDFLQP RTPQDIEFVE PTIGRS