Gene Acry_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1046 
Symbol 
ID5160237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1165863 
End bp1167716 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content69% 
IMG OID640552964 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001234181 
Protein GI148260054 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.641708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCAAT ACCGTTCCCG CACCTCCACC CACGGCCGCA ACATGGCGGG CGCCCGCGCG 
CTGTGGCGCG CGACCGGCAT GGGCGATGCG GATTTCGGCA AGCCGATCAT CGCCATCGCC
AATTCCTTCA CCCAGTTCGT GCCGGGCCAT GTGCACCTGA AGGATCTCGG CCAGCTCGTC
GCCCGCGAGA TCGAGGCGGC CGGCGGGGTG GCGAAGGAAT TCAACACCAT CGCCGTCGAT
GACGGCATCG CCATGGGTCA TGGCGGCATG CTGTATTCGC TGCCCTCGCG CGAGCTGATC
GCCGATGCGG TGGAATACAT GGTCAACGCC CATTGCGCCG ACGCGCTGGT CTGCATTTCC
AACTGCGACA AGATCACGCC GGGCATGCTG ATGGCGGCGA TGCGGCTGAA CATCCCGACC
ATCTTCGTCT CGGGCGGGCC GATGGAGGCG GGCAAATACA TCGCCGATGG CGAGACCAGG
GCCGCCGACC TGATCACCGC CATGGTCGTC GCCGCCGACC CGACCAAGAC CGACGAGCAG
GCCGCGGTGA TGGAACGCTC CGCCTGCCCC ACCTGCGGCT CGTGCTCGGG CATGTTCACC
GCCAATTCGA TGAACTGCCT GACCGAGGCG CTCGGCCTCG CCCTGCCGGG CAACGGCTCG
CTGCTCGCCA CCCATGCCGA CCGCAAGCGG CTGTTCGTCG AGGCGGGGTG GCAGATCGTC
GATCTCGCCC GGCGCTATTA CGAGCAGGAC GACGAGGGCG TGCTGCCGCG CCGGATCGGC
GGGTTCAAGG CGTTCGAGAA CGCGATGTCG CTCGATATCG CGATGGGCGG GTCGACCAAC
ACGGTGCTGC ACCTGCTGGC CGCGGCACGC GAGGCGGAAC TCGACTTCAC CATGGCGGAC
ATCGACCGGC TGTCGCGCCG GGTGCCCAAT CTCTGCAAGG TCTCGCCCTC GGTCAGCAAT
GTCCACATGG AGGACGTGCA CCGCGCCGGC GGCATCATGG GCATTCTCGG CGCGCTCGAC
CGCGCCGGGC TGATCCATCG CGACTGCGCC ACGGTGCACG AGAAGACGAT CGGCGAGGCG
ATCGACCGCT GGGACGTGAT GCGCGGCGGC GAGACGGCGA AGACGCTCTA CAGCGCCGCC
CCCGGCGGGG TGCGGACGAC GGAGGCGTTC AGCCAGAGCC GGCGCTACGA AAGCCTCGAT
CTCGACCGCG AGAAGGGCGT CATCCGCGAC GCCGAGCACG CGTTCAGCAA GGATGGCGGG
CTCGCGGTGC TGTATGGCAA CATCGCGCTC GACGGCGCGA TCGTGAAAAC GGCCGGCGTC
GATGCGTCGA TCCTCGTCTT CGAGGGGCCG GCGCGGATCT TCGAGAGCCA GGAGGACGCG
GTCGCCGGCA TTCTCGGCGA CAGGGTGAAG GCGGGCGACG TGGTGCTGAT CCGCTACGAG
GGGCCGAAAG GCGGGCCGGG GATGCAGGAG ATGCTGTATC CGACCTCGTA CCTGAAATCG
AAGGGCCTCG GCAAATCCTG CGCGCTGATC ACCGACGGGC GGTTCTCCGG CGGCACGGCG
GGGCTGTCGA TCGGGCATAT CTCGCCGGAA GCGGCGCAGG GCGGGGCGAT CGGGCTGGTC
GAGGAGGGCG ACATCATCGC CATCGACATC CCGAACCGCA AGCTCGACGT GAAGCTCGAC
GAGGCGACGC TGGAAGCGCG GCGCGCGGCG ATGGAGGCGA AGGGCAAGGC GGCGTGGAAA
CCGGCCGCGC GCGAGCGCGT GGTCTCCGCC GCGCTGCAGG CCTATGCGGC GCTGACCACG
AGTGCGGCCA ACGGCGCGGT GCGCGACGTG ACGCAGGTGC AGCGCGGGCG CTAG
 
Protein sequence
MPQYRSRTST HGRNMAGARA LWRATGMGDA DFGKPIIAIA NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADAVEYMVNA HCADALVCIS
NCDKITPGML MAAMRLNIPT IFVSGGPMEA GKYIADGETR AADLITAMVV AADPTKTDEQ
AAVMERSACP TCGSCSGMFT ANSMNCLTEA LGLALPGNGS LLATHADRKR LFVEAGWQIV
DLARRYYEQD DEGVLPRRIG GFKAFENAMS LDIAMGGSTN TVLHLLAAAR EAELDFTMAD
IDRLSRRVPN LCKVSPSVSN VHMEDVHRAG GIMGILGALD RAGLIHRDCA TVHEKTIGEA
IDRWDVMRGG ETAKTLYSAA PGGVRTTEAF SQSRRYESLD LDREKGVIRD AEHAFSKDGG
LAVLYGNIAL DGAIVKTAGV DASILVFEGP ARIFESQEDA VAGILGDRVK AGDVVLIRYE
GPKGGPGMQE MLYPTSYLKS KGLGKSCALI TDGRFSGGTA GLSIGHISPE AAQGGAIGLV
EEGDIIAIDI PNRKLDVKLD EATLEARRAA MEAKGKAAWK PAARERVVSA ALQAYAALTT
SAANGAVRDV TQVQRGR