Gene RPD_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2069 
Symbol 
ID4022551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2315425 
End bp2317023 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content63% 
IMG OID637962262 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_569205 
Protein GI91976546 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.285589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.580851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGG AACGTTTGTA TCTCTACGAC ACCACGTTGC GCGACGGCGC GCAGACCAAC 
GGCGTCGATT TCACGCTGCA CGACAAGCGG CTGATCGCGG GGCTGCTCGA CGACCTCGGC
ATCGATTATG TCGAAGGCGG CTATCCCGGC GCCAATCCGC TCGACACCGA GTTCTTCGCC
ACCGAGCAGA AGCTCGAGCG CGCGACCTTC GCGGCGTTCG GCATGACGCG GCGGCCGGGC
CGCTCGGCCT CGAACGATCC CGGCGTCGCG CTGCTGCTCG ACGCCAAGGC GGATGCGATC
TGCTATGTCG CGAAATCGTC GGAGTATCAG GTCCGCGTCG CGCTCGAAAC CACCAACGAA
GAGAACATCG CCTCGATCCG TGACAGCGTC GCGATCGCCA AGGACAGAAG CCGCGAAGTT
CTGGTCGATT GCGAGCACTT CTTCGATGGC TACAAGGAGA ACCCGGCGTT CGCGCTGGAC
TGCGCCAAGG CGGCCTATGA GTCCGGCGCT CGCTGGGTGG TGTTGTGCGA TACCAATGGC
GGCACCATGC CCGACGAGGT CGAGGCGATC GTCGGCGAGG TGGTGAAACA CATCCCCGGC
AGCCATGTCG GCATCCACGC CCACAACGAC ACCGAACAGG CCGTGGCCGT GTCGTTCGCC
GCGGTGCGCG CCGGCGCACG ACAGATCCAG GGCACGCTGA ACGGGCTCGG CGAGCGTTGT
GGTAACGCCA ATCTGGTGTC GATGATCCCG ACGTTGAAGC TGAAGAAGGA ATTCGCCGAC
CGATTCGAGA TCGGCGTCTC CGACGACAAG CTGGCGACGC TGGTGCAGGT GTCGCGCGCG
CTCGACAATA TTCTCGACCG CGCACCCAAT CCGCACGCGC CCTATGTCGG CGGCAGCGCC
TTTGTCACGA AAACGGGGAT CCATGCCTCG GCGGTGATGA AGGACCCGCA CACCTACGAG
CACGTCACGC CGGAATCGGT CGGAAATCAT CGCAAGGTGC TGGTATCGGA TCAGGCCGGC
CGCTCCAACG TGGTGGCGGA ATTGTCGCGT ACTACGATCG AGTTCGACCG CAACGATCCG
AAGCTCGGCC GCCTGATCGA GAAGATGAAG GAGCGCGAGG CGGCCGGATA CGCCTACGAA
TCCGCCAACG CTTCGTTCGA TCTCCTGGCG CGCGGCACGC TCGGCAAGGT GCCGGAATTC
TTCCGCGTCG AGCAGTTCGA CGTCAATGTC GAGCAGCGCT ACAACTCGCA CGGCGAACGC
GTTACCGTGG CGATGGCGGT GGTCAAGGTC GAGGTCGACG GCGAGACGCT GATCTCGGCC
GCGGAAGGCA ACGGCCCGGT CAATGCGCTC GACGTCGCCT TGCGCAAGGA TCTCGGCAAG
TATCAGAAGT ACATCGAGAA CCTGAAGCTG ATCGACTATC GCGTCCGTAT CCTCAATGGC
GGCACTGAAG CGGTGACGCG CGTGCTGATC GAGAGCGAGG ACGAACTCGG CGAGCGCTGG
ACCACGATCG GCGTATCGCC GAATATCATC GACGCCTCGT TCCAGGCGCT GATGGATTCG
GTGGTCTACA AGCTGGTGAA GTCGAACGCG CCGGCGTGA
 
Protein sequence
MSRERLYLYD TTLRDGAQTN GVDFTLHDKR LIAGLLDDLG IDYVEGGYPG ANPLDTEFFA 
TEQKLERATF AAFGMTRRPG RSASNDPGVA LLLDAKADAI CYVAKSSEYQ VRVALETTNE
ENIASIRDSV AIAKDRSREV LVDCEHFFDG YKENPAFALD CAKAAYESGA RWVVLCDTNG
GTMPDEVEAI VGEVVKHIPG SHVGIHAHND TEQAVAVSFA AVRAGARQIQ GTLNGLGERC
GNANLVSMIP TLKLKKEFAD RFEIGVSDDK LATLVQVSRA LDNILDRAPN PHAPYVGGSA
FVTKTGIHAS AVMKDPHTYE HVTPESVGNH RKVLVSDQAG RSNVVAELSR TTIEFDRNDP
KLGRLIEKMK EREAAGYAYE SANASFDLLA RGTLGKVPEF FRVEQFDVNV EQRYNSHGER
VTVAMAVVKV EVDGETLISA AEGNGPVNAL DVALRKDLGK YQKYIENLKL IDYRVRILNG
GTEAVTRVLI ESEDELGERW TTIGVSPNII DASFQALMDS VVYKLVKSNA PA