Gene Rsph17029_2523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2523 
Symbol 
ID4896667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2660984 
End bp2662411 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content68% 
IMG OID640113122 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001044397 
Protein GI126463283 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.520601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC CGAGAACGCT CTATGACAAG ATCTGGGACG ACCATGTCGT CCACCAGTCC 
GAAGACGGCA CCTGCCTGCT CTATATCGAC CGCCATCTCG TCCACGAGGT GACGAGCCCG
CAGGCCTTCG AAGGCCTGCG CATGACGGGC CGCAAGGTGC GCGCCCCCGA GAAGACCATC
GCCGTGCCGG ACCACAACGT CCCCACGACG GAAGGCCGCG ACACGAAGAT CGACAACGAG
GAGTCGCGCA TCCAGGTCGA GGCGCTCGAC AGGAATGCCC GCGACTTCGG GATCAACTAC
TATCCCGTGT CGGACATCCG GCAGGGGATC GTCCATATCG TCGGGCCCGA GCAGGGCTGG
ACGCTGCCGG GCATGACGGT GGTCTGCGGC GACAGCCACA CGGCCACCCA CGGCGCCTTC
GGCGCGCTGG CCCACGGCAT CGGCACCTCC GAGGTCGAGC ATGTGCTGGC CACGCAGACG
CTGATCCAGA AGAAATCGAA GAACATGAAG GTGGAGATCA CCGGGAGCCT GCGCCCCGGT
GTCACCGCCA AGGACATCAC GCTGTCGGTC ATCGGTCTCA CCGGGACGGC GGGCGGCACC
GGCTATGTCA TCGAATATTG CGGGCAGGCG ATCCGCGAAC TGTCGATGGA AGGCCGGATG
ACCGTCTGCA ACATGGCGAT CGAGGGCGGC GCGCGCGCGG GCCTGATCGC GCCGGACGAG
AAGACCTTCG CCTATGTGAT GGGTCGTCCG CATGCGCCGA AGGGCGCAGC CTGGGAGGCG
GCGCTCGCCT ACTGGAAGAC GCTCTTCACC GACAAGGGCG CGCAGTTCGA CAAGGTCGTG
ACGATCCGCG GCGAGGACAT CGCCCCGGTC GTGACCTGGG GCACCTCGCC CGAGGATGTG
CTGCCGATCA CCGCGACCGT CCCCGCCCCG GAGGACTTCA CCGGCGGCAA GGTCGAGGCC
GCCCGCCGCA GCCTCGAATA CATGGGCCTG ACCCCGGGCC AGAAGCTGAC CGACATCAAG
ATCGACACGG TCTTCATCGG CTCCTGCACC AACGGCCGGA TCGAGGATCT GCGGGCCGCG
GCCGAGATCC TGAAGGGCAA GAAGGTGGCG CCGGGAATGC GGGCCATGGT CGTGCCGGGC
TCGGGCCTCG TGCGCGCGCA GGCCGAGGAA GAGGGGCTGG CGCAGATCTT CATCGACGCG
GGCTTCGAAT GGCGCCTCGC GGGCTGCTCG ATGTGCCTCG CGATGAACCC CGACCAGCTC
TCGCCGGGGG AACGCTGCGC CTCGACCTCG AACCGGAACT TCGAGGGCCG TCAGGGCCGC
AACGGCCGCA CCCATCTCGT CAGCCCCGGA ATGGCCGCCG CTGCGGCGAT CACCGGTCAC
CTGACCGACG TGCGCGACCT GATGATGGCG CCGGCCGAGC CGGCGTGA
 
Protein sequence
MTAPRTLYDK IWDDHVVHQS EDGTCLLYID RHLVHEVTSP QAFEGLRMTG RKVRAPEKTI 
AVPDHNVPTT EGRDTKIDNE ESRIQVEALD RNARDFGINY YPVSDIRQGI VHIVGPEQGW
TLPGMTVVCG DSHTATHGAF GALAHGIGTS EVEHVLATQT LIQKKSKNMK VEITGSLRPG
VTAKDITLSV IGLTGTAGGT GYVIEYCGQA IRELSMEGRM TVCNMAIEGG ARAGLIAPDE
KTFAYVMGRP HAPKGAAWEA ALAYWKTLFT DKGAQFDKVV TIRGEDIAPV VTWGTSPEDV
LPITATVPAP EDFTGGKVEA ARRSLEYMGL TPGQKLTDIK IDTVFIGSCT NGRIEDLRAA
AEILKGKKVA PGMRAMVVPG SGLVRAQAEE EGLAQIFIDA GFEWRLAGCS MCLAMNPDQL
SPGERCASTS NRNFEGRQGR NGRTHLVSPG MAAAAAITGH LTDVRDLMMA PAEPA