Gene Dole_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2041 
Symbol 
ID5694884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2474166 
End bp2475701 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content61% 
IMG OID641264642 
Product2-isopropylmalate synthase 
Protein accessionYP_001529922 
Protein GI158522052 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAA AACTGATCAT ATTCGACACG ACCTTGAGAG ACGGCGAACA GTCGCCCGGC 
GCCAGCATGA ACGTGGCGGA AAAGCTGCGC ATTGCCTCCC GGCTGGAAGA GCTGGGGGTG
GACGTGATTG AGGCGGGGTT TCCGGCCGCC TCCAAGGGGG ACATGGAAGC GGTCTCCCAG
GTGGCCCAGA AGCTGTCACG AAGTGCGGTG GCCGGTCTTG CCCGCACCAA CAAGGATGAT
ATCGACAAGG CATGGGCCGC GGTCTCCGGT GCCAGGCACC CCCGGATTCA CGTGTTTATC
GCCAGCTCGG ACATTCACAT GGAGCACAAG CTGCGCATGC CCAGGGACAC CGTTGTGGAG
CACGCCATTG CCGGGGTCCG TTATGCCAGG ACCTTTACCG ATGACGTGGA GTTTTCCGCC
GAAGACGCCT CCCGCAGCGA CAGGGTCTTT CTGTGCAAGC TGTTTGAAGC CGCCATCTCC
GCCGGCGCCA CCACCATCAA TATTCCCGAT ACCGTTGGAT ACGCCATTCC CAGCGAGTTC
GCCGACCTGG TCAAATACGT GAGGCAGCAC ACCCCCAATA TTCACAAGGC CGTTATCAGT
GTCCATTGCC ACAACGATCT GGGGCTGGCC ACGGCCAACA CCCTGGCGGC CCTGGCCGCG
GGCGCCCGGC AGGCCGAGGT GACCATCAAC GGCATCGGCG AAAGGGCCGG CAACACCTCC
ATGGAGGAGG TAGTGATGGC CATCCGCACC CGGGCCAGCT CTTTTCCGCT GATCTCCACC
ATCGACACCG CCAAGATCTA CCCCACCAGC AAGCTGGTGA GCATGCTCAC CGGCATGATC
GTCCAGCCCA ACAAGGCCAT CGTGGGGGCC AACGCCTTTG CCCACGAGGC GGGCATTCAC
CAGGACGGCA TGCTGAAAAA CCCCATGACC TACGAGATCA TGCGGCCCGA AGACGTAGGC
GTGAGCAGCA GTACTCTGGT GCTGGGCAAG CACTCGGGCA GAAAGGCCCT GTACGACCGC
CTGAAAGAGA TCGGTTACAA CCTTTCCCCC CCGGAGATCG ACACGGTTTT TGTCAAATTC
AAGGAACTGG CCGACCGAAA GAAAAACATC GTGGAAGAGG ACCTGGAAAT CCTGGTCTCT
GAAAACATCA TGGACACCGC CGACCTGTTC CAGCTGGAAT ACCTGCACGT GACCAGCGGC
ACCACGGTGT CGCCCGTGGC CAGCGTGAAA ATGGTTATCA ATGGAAAATC GGTGAAGGGC
GAAAGCTCGG GCAACGGTCC CATTGACGCG GCATACCGGG CCATTGCCAA GCTCACCAAA
ACCGAATCCG AGATGCTGCG GTTTACCATC AGCGCCCTTA CCGGCGGCAC CGACGCCCAG
GGCGAGGTGA CCGTGCGCCT GAAGGAGCAG GGCCTGGTGG CCCTGGGCCG GGGCGCGGAC
CCGGATATCA TCATTGCCAG CGTCAAGGCC TACATCAACG GATTAAACCG CCTGGCCTAT
CTCAAGCGCC ATCCTGTCGC CGACGGCATG ATCTGA
 
Protein sequence
MTEKLIIFDT TLRDGEQSPG ASMNVAEKLR IASRLEELGV DVIEAGFPAA SKGDMEAVSQ 
VAQKLSRSAV AGLARTNKDD IDKAWAAVSG ARHPRIHVFI ASSDIHMEHK LRMPRDTVVE
HAIAGVRYAR TFTDDVEFSA EDASRSDRVF LCKLFEAAIS AGATTINIPD TVGYAIPSEF
ADLVKYVRQH TPNIHKAVIS VHCHNDLGLA TANTLAALAA GARQAEVTIN GIGERAGNTS
MEEVVMAIRT RASSFPLIST IDTAKIYPTS KLVSMLTGMI VQPNKAIVGA NAFAHEAGIH
QDGMLKNPMT YEIMRPEDVG VSSSTLVLGK HSGRKALYDR LKEIGYNLSP PEIDTVFVKF
KELADRKKNI VEEDLEILVS ENIMDTADLF QLEYLHVTSG TTVSPVASVK MVINGKSVKG
ESSGNGPIDA AYRAIAKLTK TESEMLRFTI SALTGGTDAQ GEVTVRLKEQ GLVALGRGAD
PDIIIASVKA YINGLNRLAY LKRHPVADGM I