Gene GSU1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1903 
Symbol 
ID2688457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2080770 
End bp2082053 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content59% 
IMG OID637126594 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionNP_952952 
Protein GI39997001 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit
[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.473608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAA CTATTGCGGA AAAGATTTTT GCCAGTCATC TGGTTGATGA ACCCTTTGCC 
GGCACCAAGG TGCTGAGGCT CGATGTGGTG ATGTGTCACG AGATCACCAC GCCCATCGCC
ATTGCCGACC TGATGGCGCG CGGCAAGGAT CGGGTTTTCG ACCCGACAAA GATCAAGGCG
GTCATTGATC ACGTGACCCC CAGCAAGGAC TCCAAGACGG CCACCCAGGC GAAGATGTTG
CGCGATTGGG CGCGGCGCCA TGGCATCGTG GACTTTTTCG ACGTCGGAGC TAACGGCGTT
TGTCATGCAC TGTTCCCCGA GAAGGGGTTC ATCCGGCCCG GCTATACGGT GATCATGGGC
GATTCGCATA CCTGTACTCA CGGAGCGTTC GGCGCGTTTG CGGCAGGCAT CGGCACCACC
GATCTCGAGG TCGGCATTCT CAAGGGAGTC TGCGCCTTCC GCGAGCCGAA AACGATCCGC
ATCAACCTGA ACGGCTCTCT GCCTGAAGGG GTGTATGCCA AGGACGTAAT CCTCCATGTC
ATCGGCAGGA TTGGTGTGAA CGGGGCAACC GACCGGGTAA TGGAGTTCCG CGGATCGGTG
GTCGACACCA TGACCATGGA GTCGCGGATG ACACTGTGCA ACATGGCCAT CGAGGCGGGT
GGCACCTCGG GTATCTGTAT GCCTGACATG GTGACGGTCG ATTACCTCTG GCCCTTCCTC
AAGGATGAGT ACCAGTCCCG GGAGGCCGCC CTTGCGGCAT TCAGCCTGTG GCGGTCCGAC
GAAGACGCCG TCTACGAGCA GGTGCTTGAT TTTGATGTGT CGAGTCTCGA ACCCATTGTC
ACATTTGGCT ATAAGCCGGA CCAGGTGAAG CCGGTCTCTG AAATTGCCGG TACCCCCGTG
GACCAGGTTT ATCTCGGCTC CTGCACCAAC GGCCGGCTGG AGGATCTGCG CATTGCCGCA
CGGATTCTCA AGGGAAAGAA GATCGCACCC ACGGTACGCG GCATACTCTC TCCCGCCACG
CCGAAGATCT ACCAGGATGC CATGCGTGAG GGGCTGATCG ATATCTTCAT GGAAGCCGGC
TTCTGTGTGA CCAATCCCAC CTGCGGCGCC TGTCTCGGCA TGAGCAACGG CGTCCTGGCC
GAAGGAGAAG TCTGCGCATC GACCACAAAC CGTAACTTCA TGGGGCGGAT GGGCAAGGGG
GGGATGGTCC ACCTCATGTC TCCGGCAACG AGTGCTGCAA CCGCTATCGA GGGTAAGATA
GCCGACCCCC GGAAGTATCT GTAG
 
Protein sequence
MGKTIAEKIF ASHLVDEPFA GTKVLRLDVV MCHEITTPIA IADLMARGKD RVFDPTKIKA 
VIDHVTPSKD SKTATQAKML RDWARRHGIV DFFDVGANGV CHALFPEKGF IRPGYTVIMG
DSHTCTHGAF GAFAAGIGTT DLEVGILKGV CAFREPKTIR INLNGSLPEG VYAKDVILHV
IGRIGVNGAT DRVMEFRGSV VDTMTMESRM TLCNMAIEAG GTSGICMPDM VTVDYLWPFL
KDEYQSREAA LAAFSLWRSD EDAVYEQVLD FDVSSLEPIV TFGYKPDQVK PVSEIAGTPV
DQVYLGSCTN GRLEDLRIAA RILKGKKIAP TVRGILSPAT PKIYQDAMRE GLIDIFMEAG
FCVTNPTCGA CLGMSNGVLA EGEVCASTTN RNFMGRMGKG GMVHLMSPAT SAATAIEGKI
ADPRKYL