Gene Dole_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2040 
Symbol 
ID5694883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2472529 
End bp2474157 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content59% 
IMG OID641264641 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001529921 
Protein GI158522051 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGG TACGCATATA CGACACAACC CTGCGGGACG GTATGCAGGG CGACACCATC 
AACTTTACGG TGGATGAAAA GATCCGCGTC GCCCGTCAAC TGGACGACCT TGGTGTTCAT
TACATCGAGG GCGGATGGCC GGGATCCAAT CCCAGGGATG TCCAGTTTTT TGAAAAGGCC
GCGAAGATCG ACTTCAAAAA CGCCCGGCTC ACCGCCTTTG GATCCACCCG GCGGGCCGGC
CTGGCCGTGG AAAAAGACGA TAACATTCGG CTGCTGCTGG AATGCGGGGC GCCTGCCGTG
GCCCTGGTGG GAAAGACATG GGATCTGCAT ATCACCGAAG TGATGAGCAA CACACTTGAA
AGCAACCTGG AGATGATTCA CGACTCGGTG GCCTACGTCA AGTCCCACGG CCGTGAGGTG
TTCTTTGACG CCGAGCACTT TTTCGACGGC TGTACCCACA ACAGGGAGTA CACCTTCAAA
GCGATTCTGA CCGCGGCCCA GGCAGGGGCC GACGCCGTGA TCCTGTGCGA CACCAACGGC
GGCGCCCTGC CCCACGACGT GGAGGCCATC ACCGCCGAGG TATGCACCAC ACTGGCGGAC
CGGTTTCCCG GTCCGGACGG CGGCTCCACC GTGCAAGTCG GCATTCACAC TCATAATGAC
AGTAACCTGG CCGTGGCCAA CAGCATTGCC GCAATACGGG CCGGGGCACG AATCGTCCAG
GGCACCATCA ACGGCTATGG GGAGCGGTGC GGCAACGCCG ACCTCACCTC CATTATCCCG
ATTCTTGCCG CAAAAATGGA ATATGACTGC ATCACACCGG AAAATCTTAA AAAACTGCGG
AAGGTGTCCC GGTTTGTCAG CGAAACCGCC AACATGACAC CGGTCAACAG CCGGCCCTTT
GTGGGCAAAA GCGCCTTCTC CCACAAGGGC GGCCTTCATG TCAGCGCCAT CATGAAAAAC
CCCCGGGCCT ACGAGCACAT GGACCCGGAA CTGGTGGGCA ACAAGCGGCG GGTCCTGATA
TCCGACCTGT CGGGCCGAAG CAACGTCACC TACAAGGCCA GGGAACTGGG CATCAATACC
GACACCGAAC ATTTTGACGT GGACCGCATC CTGTCCGAAG TCAAGATGCT GGAGTTAGAA
GGATTCCAGT TCGACGCGGC GGACGGCTCC TTCAAGATCG TGATGGAAAA GATTTCCGGC
CTGTATACCC CCCTTTTTGA TCTGCTCTCC TTCCGGGTGA CAGTGGAAAA AGAGAAAGAC
CGGCCCTGCA CGGCCCACGC GACCATACGG CTGGGGGTGG GAGAGTTGGA GACCACCACC
GCCGCAGAGG GAGACGGCCC GGTGAGCGCC CTGGACACGG CCCTGCGAAT GGCCATCGCC
GAGTTCTATC CCGATTCTTT GGGCCTGGAC GCCATGCAGC TGGTGGACTT CAAGGTGCGG
GTCCTGGACG GACGGGACGG CACATCGGCC AAGGTCCGGG TGCTGATCGA CTCCAGGGAC
GAAGACGAGG TGTGGGGCAC CATCGGCGTG TCGGAAGACA TCATCGAGGC CAGCTGGGAG
GCCCTTGCCG ACAGCTGCCA GTACAAACTT TCCAAGGAAC TGAACAAGAA AAAGAAAAAA
CAGGACTAG
 
Protein sequence
MEQVRIYDTT LRDGMQGDTI NFTVDEKIRV ARQLDDLGVH YIEGGWPGSN PRDVQFFEKA 
AKIDFKNARL TAFGSTRRAG LAVEKDDNIR LLLECGAPAV ALVGKTWDLH ITEVMSNTLE
SNLEMIHDSV AYVKSHGREV FFDAEHFFDG CTHNREYTFK AILTAAQAGA DAVILCDTNG
GALPHDVEAI TAEVCTTLAD RFPGPDGGST VQVGIHTHND SNLAVANSIA AIRAGARIVQ
GTINGYGERC GNADLTSIIP ILAAKMEYDC ITPENLKKLR KVSRFVSETA NMTPVNSRPF
VGKSAFSHKG GLHVSAIMKN PRAYEHMDPE LVGNKRRVLI SDLSGRSNVT YKARELGINT
DTEHFDVDRI LSEVKMLELE GFQFDAADGS FKIVMEKISG LYTPLFDLLS FRVTVEKEKD
RPCTAHATIR LGVGELETTT AAEGDGPVSA LDTALRMAIA EFYPDSLGLD AMQLVDFKVR
VLDGRDGTSA KVRVLIDSRD EDEVWGTIGV SEDIIEASWE ALADSCQYKL SKELNKKKKK
QD