Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3098 |
Symbol | |
ID | 3910899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3530560 |
End bp | 3531654 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637885002 |
Product | hypothetical protein |
Protein accession | YP_486707 |
Protein GI | 86750211 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.616034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGCC TGAATTTCGG ACGAACTCTG GTCGCTGTGG TCGCGAGCGT CGCCGTCTGC CTGGGCGGCG TCGACACCGC CGCAGCGCTC GAGAAGGTGC GGGTGCTGAT CCCGGTGAGG AACATCGACG AGGCGTTCTC ACCCTTCGTG GTCGCGAAGG AGAAGGGCTA CTTCGCCGAG GAGGGCCTGG ACGTGACCTT GATCGCGGTT GGCGGCTCCA ATGAGTCCGC CATTCAAGTC TCAGCCGGCA ATGGCGATGT CGGCGCGGCG TCGCCCGGCG AGGCGCTGGT CGGCATTCAG GCCGGCAAGC TCGACGTGCG CTATTTCTAC AGCCTTTACT ATCAGAATAT CTGGCACGTC GCCGTGCTGC CGGACAGTCC GATCAAGGCC ATCGCCGACC TCAAGGGCAA GAAGCTCGGC GTGCAGTCGC TCGGCAGCGC CGGCACGACG TTCGGCAAGG CGTTCGTGCA GCAGGCCGGC CTCGATCCGC AGACAGACGT CGCGTTCCTG CCGGTCGGCG TGGGTGCGCA AGCGGTGACG TCGGTCCGCC AGAAGTTCGT CGACGCGGTG GTGTATTGGG ACGCCGCGCT CGCCAAGTTC AAGTTCTCCG GGCTCGATCT GCGCGAGGTC CCGGCACCGG AAGGAATCCG TTCGCTACCG GACGTCGGGC TGCTCGCCAC CAGCGACACG ATCGCCAAGA AGCCGAAGAT GCTGATTGGC GTCTCCCGCG GCGTTGCCAA GGGTTACGAC TACTCGATGG CCAATCCGAG GGCCGCAGTG CTGATCACCT GGAAGGCGTA TCCGGAGGCG AAATCGAAGA ATCCCGACCC CGCGGCTGCG CTCGAGGAAG GCATCACCGT CAATCAGGCG CGGCTCCGCA TCTGGAATTC GCCGAAGACC GAGGATCAGC ACGGCCGCCT GATCGAAGCG GATTGGCAGC GCTTGGTCGA TTTTTTCGTC GCCCAGAAGG TGCTGCCGGG GGCAGTTCCG GTGGACCGCG TCATCACCAA CCAGTTCGTC AAGGATGCCA ACAGCTACGA CCGCCAGGCG GTGATCGCCG ACGCCAAGAA GACAGACGTC TCAAAGCTGG ACTGA
|
Protein sequence | MPGLNFGRTL VAVVASVAVC LGGVDTAAAL EKVRVLIPVR NIDEAFSPFV VAKEKGYFAE EGLDVTLIAV GGSNESAIQV SAGNGDVGAA SPGEALVGIQ AGKLDVRYFY SLYYQNIWHV AVLPDSPIKA IADLKGKKLG VQSLGSAGTT FGKAFVQQAG LDPQTDVAFL PVGVGAQAVT SVRQKFVDAV VYWDAALAKF KFSGLDLREV PAPEGIRSLP DVGLLATSDT IAKKPKMLIG VSRGVAKGYD YSMANPRAAV LITWKAYPEA KSKNPDPAAA LEEGITVNQA RLRIWNSPKT EDQHGRLIEA DWQRLVDFFV AQKVLPGAVP VDRVITNQFV KDANSYDRQA VIADAKKTDV SKLD
|
| |