Gene GSU2879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2879 
SymbolleuB 
ID2688662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3154503 
End bp3155591 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content65% 
IMG OID637127572 
Product3-isopropylmalate dehydrogenase 
Protein accessionNP_953921 
Protein GI39997970 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGG TGTTCAAGGT GGCGGTTCTT CCCGGCGACG GCATCGGTCC CGAGGTGATG 
GCCGAGGCGC TCCGGGTGCT GGATGCCGTG GAGGCAAAAT ACGACGTGAA GTTCGAGCGG
ACCCACGCCA ACGTGGGTGG CGCCGGCATT GATATCGAGG GGAAAGCTCT CCCCGAAACC
ACCGTGAACA TCTGCAAGGC TGCCGACGCC ATCCTCTTCG GCTCCGTGGG CGGACCCAAG
TGGGAGTCGC TCCCTCCGGA CGAGCAGCCC GAGCGGGGCG CGCTCCTGCC GCTGCGGAAG
ATCTTCGGTC TCTACGCCAA CCTGCGGCCG GCCATCATTT TCCCGTCCCT CACCGGCGCC
TCCTCCCTCA AGGAAGAGGT AATAGCCGGG GGCTTCAACG TGCTGGTCAT CCGGGAGCTC
ACCGGCGGCA TCTACTTTGC CCAGCCCAAG GGGATCGAAG GCGAAGGGCG CGACCGGGTC
GGCTTCGACA CCATGCGCTA CAGCGTGCCC GAGATCGAGC GGATAACCCA TGTGGCCTTC
CAGGCTGCCC GCAAGCGGGG CAAGAAGGTC TGCTCCATCG ACAAGGCCAA CGTCCTCTCC
TCGTCGGTTC TCTGGCGCGA GGTGGTCACC GGCATTGCCA AGGAATACCC GGACGTGGAA
CTCTCCCACA TGTACGTGGA CAACGCCGCC ATGCAACTGG TCCGCTGGCC CAAGCAGTTC
GACGTGATCC TGTGCGAGAA CATGTTCGGC GACATCCTCT CGGACGAGGC GGCCATGCTG
ACCGGCTCCC TCGGGATGCT GCCTTCCGCG TCCCTGGCCG AGGGGACCTT CGGCATGTAC
GAGCCCTCCG GCGGCAGCGC CCCTGACATA GCTGGCCAGG GGATCGCCAA CCCCATCGCC
CAGATCCTCT CCATGGGGAT GATGCTCAAG TTCTCCTTCG GCATGGTCGA CGCGGCCGAC
GCCATCGACA ACGCGGTGGC AACGGTGCTT GACCAGGGCT TCCGGACCCG CGACATCTAC
CAGCAGAAAG ACGGCGAGAA ACTCGTCAAC ACCAAGGAGA TGGGCGACGC GATCATCGCA
GCCCTGTAA
 
Protein sequence
MAQVFKVAVL PGDGIGPEVM AEALRVLDAV EAKYDVKFER THANVGGAGI DIEGKALPET 
TVNICKAADA ILFGSVGGPK WESLPPDEQP ERGALLPLRK IFGLYANLRP AIIFPSLTGA
SSLKEEVIAG GFNVLVIREL TGGIYFAQPK GIEGEGRDRV GFDTMRYSVP EIERITHVAF
QAARKRGKKV CSIDKANVLS SSVLWREVVT GIAKEYPDVE LSHMYVDNAA MQLVRWPKQF
DVILCENMFG DILSDEAAML TGSLGMLPSA SLAEGTFGMY EPSGGSAPDI AGQGIANPIA
QILSMGMMLK FSFGMVDAAD AIDNAVATVL DQGFRTRDIY QQKDGEKLVN TKEMGDAIIA
AL