Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1217 |
Symbol | |
ID | 3910152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1391472 |
End bp | 1392851 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883111 |
Product | hypothetical protein |
Protein accession | YP_484838 |
Protein GI | 86748342 |
COG category | [S] Function unknown |
COG ID | [COG3864] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.555758 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGA CCATCGCGCT GTATTGCCAT CGCGGAACGC GCGCGATCCA GCGCATGGTC GAGTTCGCGC CCTCCACCGG CGGGCTGGCA TTATGGGTGC GGCATCAGGA CCTGACCGCG GACAGCGATA CGGCGGCCGT GGTCGTACTC ACCGACGGCA CCACCGTGTA TTACGGCGCC GCCTTCGACA AGCTGCCGCT GCCCGAACAA GTCGGCCTCG TCGCCCACGA GGTGCTGCAC ATCGCGCTGC GCCATCCGCA GCGCTTCGTC GAACTGCAGC GCGTGATCGG CGACGTCGAC CTCGAATTGT TCAACATCTG CGCCGACGCC ATCGTCAATT CGACGCTGGC GCATCTGAGC TGGCTGACGC TGCCGGAAAA GTCCGTGATG CTGGAGCAGA TCCTCGCCAA GGCGCTGAGG CGCGAGCAGG ACGCCGAGGC GGCGCTGCTA GAATGGGACG TCGAGAAGCT GTATCGCGCG ATCGACGATC GCGACAGCGA CAGCAACAAC GGCAAGTCGA AGACCGGCAA CAAATCGCAA GCGGGCTCGC AGTCCGACGC CTCGGGCGCC GGCGGCGGCG ACCCGTCGCA GTCGCAATCC GAATCCGCGC AGGAGAGCGC CGAACAGCGC GCCGACGGCG CCCGCGCGTC CAAGGTGCGC GAGCTCGGCG CAGGCGGCGT CCGCGATCTG GTGCCGAATC CGGAATCACA ATCGGCGCCG GAACACGAAG CCGAGCACGC CCGCGAATGG AGCGAGCGGA TCCTGCGCGG CCACGCCGGC GACGGCGCGT TTTCGATGCT GCGGGCGCTG ATCGCAGACT TGCCGCGCAG CCGCACACCG TGGGCGCAGG TGCTGCGCGT GCAGCTTGCG CGCGGGCTGG CGCGAAAACC GTCGCTGACC TGGTCGCGGC CGGCGCGCTC CTACATCGCC AATCAGGGCC GCGCCGGCCA ACACCGGATG CCGTTCGAGC CGGGATTCTC CGCGACCAAG AACGAGCCGC GGCTGGCGCT GATCATCGAC GTCTCCGGCT CGATCGACGA CCAATTGATG GAGCGCTTCG CGCGCGAGAT CGAGACCATC ACCCGGCGGC AGGAGGCCGG GCTGGTGCTG ATCATCGGCG ACGAGCGCGT GCGGCAGGTC GAATTCTTCG AGCCGGGCCG GCGTTTCGTG CTAAGCGAGA TCGAATTCAC CGGCGGCGGC GGCACCGATT TCACCCCACT CCTCGCGGAG GCGGACCGGC ACAGGCCGGA TATCGCGGTG GTGCTGACCG ATCTCGAAGG TCCGGCGGAT TTCAAGCCGC GCTGGCCGGT GATCTGGGCG GTTCCGGAGA ACTACTCACA TGCGGTGCAG CCGTTCGGCC GGCTGCTGAC GTTGAACTAA
|
Protein sequence | MPETIALYCH RGTRAIQRMV EFAPSTGGLA LWVRHQDLTA DSDTAAVVVL TDGTTVYYGA AFDKLPLPEQ VGLVAHEVLH IALRHPQRFV ELQRVIGDVD LELFNICADA IVNSTLAHLS WLTLPEKSVM LEQILAKALR REQDAEAALL EWDVEKLYRA IDDRDSDSNN GKSKTGNKSQ AGSQSDASGA GGGDPSQSQS ESAQESAEQR ADGARASKVR ELGAGGVRDL VPNPESQSAP EHEAEHAREW SERILRGHAG DGAFSMLRAL IADLPRSRTP WAQVLRVQLA RGLARKPSLT WSRPARSYIA NQGRAGQHRM PFEPGFSATK NEPRLALIID VSGSIDDQLM ERFAREIETI TRRQEAGLVL IIGDERVRQV EFFEPGRRFV LSEIEFTGGG GTDFTPLLAE ADRHRPDIAV VLTDLEGPAD FKPRWPVIWA VPENYSHAVQ PFGRLLTLN
|
| |