Gene RPC_0227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0227 
Symbol 
ID3971137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp243589 
End bp245010 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content66% 
IMG OID637923340 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_530121 
Protein GI90421751 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00400027 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGATT CAAAATCCGC CGCCCCGACC ACCCTGTACG ACAAGATCTG GAACGACCAT 
CTGGTGCACG AGGCCGACGA CGGCACCTGC CTGCTCTATA TCGATCGCCA CCTGGTGCAC
GAGGTGACGT CGCCGCAGGC GTTCGAAGGG CTGCGCACCG CCGGCCGCAA AGTCCATGCG
CCGGAGAAGA CGCTGGCGGT GGTCGATCAC AACGTGCCGA CCACCGATCG CTCAAAACCC
AATCCGGATC CGGAAAGCGC CGACCAGATC GCGGCCTTGG CCGAGAACGC CCGCGAATTC
GGCGTCACCT ACTACAACGA ATTCGACAAG CGGCAGGGCG TGGTCCACGT CATCGGCCCG
GAGCAGGGCT TCACGCTGCC CGGCACCACC ATCGTCTGCG GTGACAGCCA CACCTCGACG
CATGGCGCGT TCGGCGCGCT GGCGCACGGC ATCGGCACCT CCGAGGTCGA GCACGTGCTG
GCGACGCAGA CGCTGATCCA GAAGAAGGCC AAGAACATGC GCGTCACCGT TGACGGCGCA
TTGCCGGACG GCGTCACCGC GAAAGATATC ATCCTGGCGA TCATCGGCGA GATCGGCACC
GCGGGCGGCA CCGGCTACGT GCTGGAATAT GCCGGCAGTG CGATCCGCGC GCTGTCGATG
GAAGGCCGCA TGACGGTGTG CAACATGTCG ATCGAAGGCG GCGCCCGCGC CGGCCTGATC
GCACCGGACG AAAAAGCCTA TGCGTACTTG AAGGGCCGGC CGATGGCGCC GACCGGCGCC
AATTGGGACG CCGCGATGCG CTATTGGGAA ACGCTACGCT CCGACGAAGG CGCGCATTTC
GACCACGAGC TCCGGCTCGA TGCGGCCGCA CTGCCGCCGA TCGTCACCTG GGGCACCTCG
CCCGAAGACG TGATCTCGAT CTTGGGCAGC GTGCCGAACC CGGCCGACAT CGCCGACGAG
GCCAAGCGGC TCTCCAAGGA GCGCGCGCTG GCCTATATGG GCCTCACTGC CGGCACCAAG
ATCACCGACA TCAAGATCGA CCGCGCCTTC ATCGGCTCCT GCACCAACGG CCGGATCGAG
GATCTGCGCG CCGCGGCGAA AGTCGCCGAG GGCAAGACCG TCAACGGCAA CGTCAACGCC
ATCATCGTGC CGGGCTCCGG CCTGGTGAAG GAACAGGCGG AAGCCGAAGG GCTCGACAAG
ATCTTCATTG CGGCGGGCTT CGAATGGCGC GAGCCGGGCT GCTCGATGTG CCTGGCGATG
AACCCCGACA AGCTGGCGCC GGACGAGCGC TGCGCCTCGA CCTCGAACCG CAATTTCGAA
GGCCGCCAGG GCTTCAAGGG CCGCACCCAT CTGGTGTCGC CGGCGATGGC GGCGGCGGCG
GCGATCGCCG GCCACTTCGT CGATATCCGC GACTGGCGCT GA
 
Protein sequence
MTDSKSAAPT TLYDKIWNDH LVHEADDGTC LLYIDRHLVH EVTSPQAFEG LRTAGRKVHA 
PEKTLAVVDH NVPTTDRSKP NPDPESADQI AALAENAREF GVTYYNEFDK RQGVVHVIGP
EQGFTLPGTT IVCGDSHTST HGAFGALAHG IGTSEVEHVL ATQTLIQKKA KNMRVTVDGA
LPDGVTAKDI ILAIIGEIGT AGGTGYVLEY AGSAIRALSM EGRMTVCNMS IEGGARAGLI
APDEKAYAYL KGRPMAPTGA NWDAAMRYWE TLRSDEGAHF DHELRLDAAA LPPIVTWGTS
PEDVISILGS VPNPADIADE AKRLSKERAL AYMGLTAGTK ITDIKIDRAF IGSCTNGRIE
DLRAAAKVAE GKTVNGNVNA IIVPGSGLVK EQAEAEGLDK IFIAAGFEWR EPGCSMCLAM
NPDKLAPDER CASTSNRNFE GRQGFKGRTH LVSPAMAAAA AIAGHFVDIR DWR