Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5088 |
Symbol | |
ID | 6412782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5474407 |
End bp | 5475600 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714973 |
Product | homocitrate synthase |
Protein accession | YP_001994052 |
Protein GI | 192293447 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAGA TCAAGTCTGA GATCGTCCGG CCGGATCAGT CCTGCGGTTT CCAGTCCGCC CCAATCGTGC TCAACGACAC CACATTGCGC GATGGTGAGC AGGCGCCGGG TGTTGCCTTC TCCACCGCCG AGAAGGTTGC GATCGCCCGA GCGCTGGCGC GCGCCGGCGT GCCGGAAATC GAGGCGGGGA CGCCCGCGAT GGGCGTCGAT GAGATCGCGG CGATCCGCGC CATCGTCGAA GCCGGCCTGC CGCTGACCAC GATTGCCTGG TGCCGGATGC GCACCGAAGA CGTCGATGCC GCGCTGAAGG CCGGTGTGGC GATGGTCAAT GTCTCGGTGC CGGTGTCGGA CGTGCAGATC GCTGCCAAGC TCGGCGGCAA GCGGTCGAAT GCGATCGAGA CCGTCAAGCG CGTGGTCGGC TATGCCCGGG ACCGCGGCCT CGACGTCGCC GTCGGCGGTG AGGATTCCTC GCGAGCCGAT CCCGAATTCC TCGCCGAGGT GATCGCCACC GCAAAGGCAT CCGGCGCGCG CCGGTTTCGG ATCGCCGATA CGCTGAGTGT GCTCGACCCA TTCTCCAGCC ATGCGCTGCT GGCGACGCTT CGCGCCTCGA CGGACCTCGA GCTCGAATTC CACGGCCATG ACGATCTCGG CCTCGCCACC GCCAACACGC TGGCCGCGCT CCGCGCCGGT GCCACCCATG CCTCGGTGAC AGTGATCGGC CTCGGCGAAC GGGCCGGCAA TGCGCCGCTT GAAGAGGTCG CGGTGGCGCT GAAGCAGCTC TATGGCCGCG ACACCGGCAT CGTGCTGTCG GAGCTCGGCA ACGTCGCCGA TCTCGTTGCC ACCGCAGCCG CCCGTACCAT TCCGCTCAAC AAGGCGATCG TCGGTGAGCA CGTCTTCACC CATGAATCGG GAATACATGT CGATGGCCTG CTCAAGGATC AGCGCACCTA CCAGTCGCTC GATCCGAACT TGTTCGGCCG CTCCAACCGC ATTGTCATCG GCAAGCACTC CGGGCTATCG GCGATCACCT CGTCGCTCGC CAAGTTGGAT CTGCCGGCGA CCGCGGACGA GGCGCAGGGT ATCCTGGCCA AGGTCCGCCA CTATGCAGTC ACCCACAAGG GCCCGGTCGG CAACGAGACA TTGATTGCGA TTTGGCGCGA GGTCCGCGAG CGGACGCTCA CCAACTGCGC CTGA
|
Protein sequence | MSEIKSEIVR PDQSCGFQSA PIVLNDTTLR DGEQAPGVAF STAEKVAIAR ALARAGVPEI EAGTPAMGVD EIAAIRAIVE AGLPLTTIAW CRMRTEDVDA ALKAGVAMVN VSVPVSDVQI AAKLGGKRSN AIETVKRVVG YARDRGLDVA VGGEDSSRAD PEFLAEVIAT AKASGARRFR IADTLSVLDP FSSHALLATL RASTDLELEF HGHDDLGLAT ANTLAALRAG ATHASVTVIG LGERAGNAPL EEVAVALKQL YGRDTGIVLS ELGNVADLVA TAAARTIPLN KAIVGEHVFT HESGIHVDGL LKDQRTYQSL DPNLFGRSNR IVIGKHSGLS AITSSLAKLD LPATADEAQG ILAKVRHYAV THKGPVGNET LIAIWREVRE RTLTNCA
|
| |