Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3930 |
Symbol | |
ID | 3911737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4483569 |
End bp | 4485113 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637885834 |
Product | hypothetical protein |
Protein accession | YP_487534 |
Protein GI | 86751038 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.12121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.390263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGTG CCGAAAGTCT GGTGCGGACG CTGGTCCATG GCGGCGTGGA CGTCTGCTTC ACCAATCCCG GCACCTCGGA AATGCACTTC GTCGCCGCGC TCGACCGCGT CGAGGGCATG CGCTGCGTGC TCGGCCTGTT CGAGGGCGTG GTCACCGGCG CCGCCGACGG TTATTTCCGG ATGAAAGGCA CGCCGGCCTC GACGCTGCTG CATCTCGGCC CCGGCCTCGC CAACGGCCTC GCCAATCTGC ACAACGCCAA GAAGGCGAGT TCCGGCATCG TCAACATCGT CGGCCAGCAC GCCACCTACC ACATCGACTA CAACGCGCCG CTGACCTCCG ACATCGAGGG CCTGGCGCGG CCGATGTCGG CCTGGGTCCG CACCTCGCCG GACGCGCAAT CGGTGGCGCG CGACGGCGCC GCCGCGATTG CCGCCGCGAA GAGCGCGCCG CCGCAGATCG CCACCCTGAT CCTGCCCGCC GACACCGCCT GGGGCGAGGC CGACGGCATC GCCGAGGTGC CGCAAGACAC CCAGCGCCCG AGCTATTCGC CGCACGCGGT GGAAGCCGCG GCGCGCGTGC TGCGCTCCGG CGAGCCGACG CTGCTGCTGC TGACCGGCGG CGCGCTCACC GAACACGGCC TCGAGCTCGC CGCGCGGATC GCCGGCAAGA CCGGCTGCCG TGTGATGGGC CAGACCTACA ATCCGCGGAT GGCGCGCGGT CGCGGCCGCT ATGCGATCGA GCGGATTCCC TATGTGATCG AAGCCGCGCT GCCGATCCTG AAGGACTTCC GCCACATCGT GCTGGTCGAG GCCAACGATC CGGTGGCGTT CTTCGCCTAT CCGAACAAGC CGAGCCTGCT GAAACCGGAC GGCTGCGAGG TACATCGCAT GACCGAGGGC GGCGAGAATT CCACCGCAGC GCTCGAAGCG CTGGCCGGCG CGCTCGGCGC CAAGGCGGCC GACGCCCAGC CGCAGACCCA TGTCGAGATC GCGCGGCCGA GCGGCGCGCT GACCCATGCC TCGATCGCCC AGGCGATCGC GATGGCGATC CCGGACAACG CCATCGTGAT CGACGAATCG ATCACTACCG GCCGCGGCTT CTTTCCGCCG ACGGCGGCGG CGGCGCCGCA CGACTGGCTG CAGAACATGG GCGGCTCGAT CGGGTTCTCG CCGCCGGTCG CGGTCGGCGC CGCGGTGGCG TGCCCGGATC GCAAGGTGAT CTGCCTGGTC GGCGACGGCA GCGCGATGTA CACGCTGCAG GCGCTGTGGA CCCAGGCTCG CGAAAATCTC GACGTCACCA CCGTGGTGTT CGCCAACCGC AAATATCAGA TCCTGCGCGG CGAGTTCGAC GGCGTCGGCG CCGGCGAGCC GGGCCAGCGC GCGCAGGACA TGCTGTCGCT GGATCGGCCG AACCTCGACT GGGTGTCGCT GGCCCGGGGC ATGGGCGTGC CGGCCCGCGC CGTGACCAGC GCCGATGAAC TCAACAAGGC GCTCGACGCC GGCGTCGCCG GCAGCGGTCC GAATTTGATC GAAGTGCAGA TGTAG
|
Protein sequence | MNGAESLVRT LVHGGVDVCF TNPGTSEMHF VAALDRVEGM RCVLGLFEGV VTGAADGYFR MKGTPASTLL HLGPGLANGL ANLHNAKKAS SGIVNIVGQH ATYHIDYNAP LTSDIEGLAR PMSAWVRTSP DAQSVARDGA AAIAAAKSAP PQIATLILPA DTAWGEADGI AEVPQDTQRP SYSPHAVEAA ARVLRSGEPT LLLLTGGALT EHGLELAARI AGKTGCRVMG QTYNPRMARG RGRYAIERIP YVIEAALPIL KDFRHIVLVE ANDPVAFFAY PNKPSLLKPD GCEVHRMTEG GENSTAALEA LAGALGAKAA DAQPQTHVEI ARPSGALTHA SIAQAIAMAI PDNAIVIDES ITTGRGFFPP TAAAAPHDWL QNMGGSIGFS PPVAVGAAVA CPDRKVICLV GDGSAMYTLQ ALWTQARENL DVTTVVFANR KYQILRGEFD GVGAGEPGQR AQDMLSLDRP NLDWVSLARG MGVPARAVTS ADELNKALDA GVAGSGPNLI EVQM
|
| |