Gene Rpal_5088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5088 
Symbol 
ID6412782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5474407 
End bp5475600 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content66% 
IMG OID642714973 
Producthomocitrate synthase 
Protein accessionYP_001994052 
Protein GI192293447 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGA TCAAGTCTGA GATCGTCCGG CCGGATCAGT CCTGCGGTTT CCAGTCCGCC 
CCAATCGTGC TCAACGACAC CACATTGCGC GATGGTGAGC AGGCGCCGGG TGTTGCCTTC
TCCACCGCCG AGAAGGTTGC GATCGCCCGA GCGCTGGCGC GCGCCGGCGT GCCGGAAATC
GAGGCGGGGA CGCCCGCGAT GGGCGTCGAT GAGATCGCGG CGATCCGCGC CATCGTCGAA
GCCGGCCTGC CGCTGACCAC GATTGCCTGG TGCCGGATGC GCACCGAAGA CGTCGATGCC
GCGCTGAAGG CCGGTGTGGC GATGGTCAAT GTCTCGGTGC CGGTGTCGGA CGTGCAGATC
GCTGCCAAGC TCGGCGGCAA GCGGTCGAAT GCGATCGAGA CCGTCAAGCG CGTGGTCGGC
TATGCCCGGG ACCGCGGCCT CGACGTCGCC GTCGGCGGTG AGGATTCCTC GCGAGCCGAT
CCCGAATTCC TCGCCGAGGT GATCGCCACC GCAAAGGCAT CCGGCGCGCG CCGGTTTCGG
ATCGCCGATA CGCTGAGTGT GCTCGACCCA TTCTCCAGCC ATGCGCTGCT GGCGACGCTT
CGCGCCTCGA CGGACCTCGA GCTCGAATTC CACGGCCATG ACGATCTCGG CCTCGCCACC
GCCAACACGC TGGCCGCGCT CCGCGCCGGT GCCACCCATG CCTCGGTGAC AGTGATCGGC
CTCGGCGAAC GGGCCGGCAA TGCGCCGCTT GAAGAGGTCG CGGTGGCGCT GAAGCAGCTC
TATGGCCGCG ACACCGGCAT CGTGCTGTCG GAGCTCGGCA ACGTCGCCGA TCTCGTTGCC
ACCGCAGCCG CCCGTACCAT TCCGCTCAAC AAGGCGATCG TCGGTGAGCA CGTCTTCACC
CATGAATCGG GAATACATGT CGATGGCCTG CTCAAGGATC AGCGCACCTA CCAGTCGCTC
GATCCGAACT TGTTCGGCCG CTCCAACCGC ATTGTCATCG GCAAGCACTC CGGGCTATCG
GCGATCACCT CGTCGCTCGC CAAGTTGGAT CTGCCGGCGA CCGCGGACGA GGCGCAGGGT
ATCCTGGCCA AGGTCCGCCA CTATGCAGTC ACCCACAAGG GCCCGGTCGG CAACGAGACA
TTGATTGCGA TTTGGCGCGA GGTCCGCGAG CGGACGCTCA CCAACTGCGC CTGA
 
Protein sequence
MSEIKSEIVR PDQSCGFQSA PIVLNDTTLR DGEQAPGVAF STAEKVAIAR ALARAGVPEI 
EAGTPAMGVD EIAAIRAIVE AGLPLTTIAW CRMRTEDVDA ALKAGVAMVN VSVPVSDVQI
AAKLGGKRSN AIETVKRVVG YARDRGLDVA VGGEDSSRAD PEFLAEVIAT AKASGARRFR
IADTLSVLDP FSSHALLATL RASTDLELEF HGHDDLGLAT ANTLAALRAG ATHASVTVIG
LGERAGNAPL EEVAVALKQL YGRDTGIVLS ELGNVADLVA TAAARTIPLN KAIVGEHVFT
HESGIHVDGL LKDQRTYQSL DPNLFGRSNR IVIGKHSGLS AITSSLAKLD LPATADEAQG
ILAKVRHYAV THKGPVGNET LIAIWREVRE RTLTNCA