Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1703 |
Symbol | |
ID | 3908228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1937547 |
End bp | 1939223 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883597 |
Product | hypothetical protein |
Protein accession | YP_485322 |
Protein GI | 86748826 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.603628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCGT CGACGAAGCC GACTAGCTCC AATCCAACAA CGAGACCCAA AATGAGCATC ATGACCGGCG GCGAAGCGAT CGTGCAGACC CTCGTCGCGC ACGGCGTCGA CACCGTGTTC GGCCTGCCCG GCGCGCAGAT CTACGGGTTG TTCGACGGCT TCGCAAAGGC GCAACTCCGC GTCGTCGGTG CGCGCCATGA GCAGGCCTGC GGCTACATGG CGTTCGGCTA TGCCCGCGCC TCGGGCCGCC CCGGCGTGTT CAGCGTCGTG CCCGGCCCCG GCGTGCTGAA TGCCGGCGCG GCGCTGCTCA CCGCCTTCGG CTGCAACGAG CCGGTGCTGT GCCTCACCGG TCAGGTGCCG AGCGCCTATC TCGGCAAGGG CCGTGGTCAC CTGCATGAGA TGCCGGATCA GCTCGCGACG TTGCGCAGCT TCATCAAATG GGCCGAGCGC ATGGAGTATC CCGGCAACGC GCCGGCGCTG GTCGCGCGCG CGTTCCAGGA GATGATGAGC GGCCGGCGCG GCCCGGTGGC GCTGGAAATG CCCTGGGAGG TGTTCACCCA GCGCGCCGAG ACCGCAGCGG CGATCAAGCT CGATCCGGTC GTACCGCCGC AGCCCGACCC GGACCGCGTC GCCGCCGCCG CGAAGCTGAT CGCCGCGAGC AAGACGCCGA TGATCTTCGT CGGCTCCGGT GCGCTCGACG CTGGCGACGA GATCCTCGAA CTGGCCGAAG CGATCGACGC GCCGGTCGTC GCATTTCGCA GCGGCCGCGG CATTGTCAGT AACCGGCACG ACCTCGGCCT GACCTTCGCC GCCGCCTATC GGCTGTGGCC GCAGACCGAT CTGATCATCG GCATCGGCTC GCGGATGGAA CTGCCGACGA CGTTCCGCTG GCCGTTCCGG CCGGACGGCC AGAAGTCGGT GCGGATCGAC ATCGATCCCG CCGAGATGCG CCGCTTTTCG CCGGACGCTT CGATCGTCGC CGATGCCAAG GCCGGCACTC GTGCGCTGGT CGACGCGGTG AGCAAGCGTG GCTACAACAA GACCCAAGGG CGGCGCGCGA CCATTCGCGA GGCGACCGCG CTCACGCTGG AAGCGATCCA GTCGGTGCAG CCGCAGATGG CCTATTTGAA GATCCTGCGC GAGGTGCTGC CGGACGACGC CATCGTCACC GACGAGCTGT CGCAGGTTGG ATTCGCCTCG TGGTACGGCT TCCCGATCTA CCAGCCCCGC ACCTTTCTCA CATCGGGCTA TCAGGGCACG CTCGGCTCCG GCTTCCCGAC CGCGCTCGGC GCCAAGGTCG CCTGCCCCGA CAAGCCGGTC GTCGCCATCA CCGGCGACGG CGGTTTCATG TTCGCTGTGC AGGAGCTCGC CACCGCGGTG CAGTTCAACA TCGGCGTGGT GACGCTGGTG TTCGACAATT CGGCCTATGG CAACGTCCGG CGCGACCAGG TCACCCAGTT CGAAGGCCGC GTGGTGGCGT CCGATCTGGT CAATCCGGAT TTCGTCAAGC TCGCGGAATC CTTCGGCGTC GCGGCGTCGC GGGTCGGCTC GCCCGATCAC TTCCGCGCCG CGCTGGAGAA GGCGCTGGCG CATGGCGGGC CGTATCTGAT CGCGATCGAC GTTCCGCGCG ACAGCGAAGC CAGCCCCTGG CCGTTCATCC ATCCGGCGAA GCCGTGA
|
Protein sequence | MASSTKPTSS NPTTRPKMSI MTGGEAIVQT LVAHGVDTVF GLPGAQIYGL FDGFAKAQLR VVGARHEQAC GYMAFGYARA SGRPGVFSVV PGPGVLNAGA ALLTAFGCNE PVLCLTGQVP SAYLGKGRGH LHEMPDQLAT LRSFIKWAER MEYPGNAPAL VARAFQEMMS GRRGPVALEM PWEVFTQRAE TAAAIKLDPV VPPQPDPDRV AAAAKLIAAS KTPMIFVGSG ALDAGDEILE LAEAIDAPVV AFRSGRGIVS NRHDLGLTFA AAYRLWPQTD LIIGIGSRME LPTTFRWPFR PDGQKSVRID IDPAEMRRFS PDASIVADAK AGTRALVDAV SKRGYNKTQG RRATIREATA LTLEAIQSVQ PQMAYLKILR EVLPDDAIVT DELSQVGFAS WYGFPIYQPR TFLTSGYQGT LGSGFPTALG AKVACPDKPV VAITGDGGFM FAVQELATAV QFNIGVVTLV FDNSAYGNVR RDQVTQFEGR VVASDLVNPD FVKLAESFGV AASRVGSPDH FRAALEKALA HGGPYLIAID VPRDSEASPW PFIHPAKP
|
| |