Gene Acel_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1591 
Symbol 
ID4484644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1789464 
End bp1790885 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID639730375 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_873349 
Protein GI117928798 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.777227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG GCACGGGCGT ACGCGGCCGA ACGATGGCCG AGAAAATCTG GGAGGAGCAC 
GTCGTCCGGC GTGCGGACGG TGAACCCGAC CTTCTCTACA TCGACCTGCA TTTGATCCAT
GAGGTGACGA GTCCGCAGGC TTTCGATGGA CTGCGACTGG CGGGCCGCCG CGTCCGCCGT
CCGGACCTCA CTATCGCCAC CGAGGACCAC AACGTGCCGA CAACGGACAT CGACAAGCCG
ATCGCTGACC CTGTGTCACG GACCCAAGTC GAGACGTTGC GCCGCAATTG CGCGGAGTTC
GGCATCCGGC TGCATCCGAT GGGGGATCGT GAGCAGGGCA TCGTGCACGT CATCGGCCCC
CAGCTCGGGC TCACCCAGCC GGGCATGACC ATCGTCTGTG GTGATTCCCA CACCTCCACG
CACGGCGCCT TCGGGGCGTT GGCGTTCGGG ATCGGCACCA GTGAGGTCGA ACACGTGCTG
GCGACCCAGA CCCTGCCGCA GTACAAGCCC AAGACGATGG CCGTGACCGT CGACGGCACG
CTCCGCCCGG GGGTGACGAG CAAGGACATC ATCCTCGCCC TCATCGCGAA AATCGGGACC
GGCGGCGGTC AGGGGCACGT CATCGAATAC CGGGGCGAGG CGATTCGCGC ATTGTCCATG
GAAGCGCGGA TGACGATCTG CAACATGTCC ATCGAAGCAG GCGCCCGCGC CGGGATGATC
GCCCCGGACG ATACCACATT TGCCTATCTG GAAGGGCGTC CGCACGCGCC GAAGGGGCGT
GACTGGGAGG CGGCGCTGGA GTATTGGCGC TCACTGCCGA CCGACCCTGA TGCGGTCTTC
GACGAGGAAG TGCTGCTCGA TGCGGGCAGT CTCTCGCCGT ACGTCACCTG GGGAACCAAT
CCCGGACAGG GTGCGCCTCT CGACAGTGTC GTCCCAGATC CGGCGACTTT CTCCGACCCG
ATCGAACGTG CGGCGGCGGA ACGGGCGCTT GCCTACATGG ATCTGCAGCC CGGCACGCCG
TTGCGTGAGA TCCGGGTTGA CACCGTGTTC ATCGGGTCGT GCACCAACGG TCGCATCGAG
GATCTTCGCG CCGCGGCGTC CGTGCTCCAG GGACGCAAGG TCGCCCCTGG TGTCCGGGTG
CTCGTCGTCC CCGGTTCGAT GGCGGTCAAG GCGCAGGCCG AGGCGGAAGG ACTGGATCGC
ATCTTCCGGG ATGCCGGAGC GGAATGGCGG AACGCCGGCT GCTCGATGTG CTTGGGGATG
AATCCCGACC AGCTCGCCCC GGGCCAGCGT TCGGCTTCCA CCTCCAACCG CAATTTCGAG
GGGCGGCAAG GACGCGGCGG GCGCACCCAC CTCGTGTCCC CGCTCGTCGC CGCGGCCACC
GCGGTCGCCG GTCACCTCGC CGCGCCCTCG GATCTCGATT GA
 
Protein sequence
MSDGTGVRGR TMAEKIWEEH VVRRADGEPD LLYIDLHLIH EVTSPQAFDG LRLAGRRVRR 
PDLTIATEDH NVPTTDIDKP IADPVSRTQV ETLRRNCAEF GIRLHPMGDR EQGIVHVIGP
QLGLTQPGMT IVCGDSHTST HGAFGALAFG IGTSEVEHVL ATQTLPQYKP KTMAVTVDGT
LRPGVTSKDI ILALIAKIGT GGGQGHVIEY RGEAIRALSM EARMTICNMS IEAGARAGMI
APDDTTFAYL EGRPHAPKGR DWEAALEYWR SLPTDPDAVF DEEVLLDAGS LSPYVTWGTN
PGQGAPLDSV VPDPATFSDP IERAAAERAL AYMDLQPGTP LREIRVDTVF IGSCTNGRIE
DLRAAASVLQ GRKVAPGVRV LVVPGSMAVK AQAEAEGLDR IFRDAGAEWR NAGCSMCLGM
NPDQLAPGQR SASTSNRNFE GRQGRGGRTH LVSPLVAAAT AVAGHLAAPS DLD