Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4784 |
Symbol | |
ID | 6412470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5148123 |
End bp | 5149094 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714662 |
Product | Rubrerythrin |
Protein accession | YP_001993749 |
Protein GI | 192293144 |
COG category | [S] Function unknown |
COG ID | [COG1633] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.667039 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCGT TTTCCGACCT CACCGAGCGT GAAATCCTGG CCGTCGCCAT CTCGGGCGAG GAGGAAGACA GCCGGATCTA TCTCGCGTTC GCCGAGGATC TGGCCGAGCG CTACCCCGAT TCCGCGCGCG TGTTCACCGA GATGGCGCAG CAGGAGAAGG GCCACCGCCA CATGCTGCTC CGGATGTACG AACAGCGGTT CGGGCCGGAT CTGCCGCCGA TCCGCCGCGA GGACGTCAAA GGCTTCATCC GCCGCCGCCC GATCTGGCTC ACCCGCAATC TGCCGCTCGA CCGCATCCGC AAGGAAGCCG AGACGATGGA ATTCGAGGCG CAGCGCTTCT ACGAGCGCGC GGCCGAACGC GCCACCGACG TCCACATCCG CAAGCTGCTG TCGGATCTCG CCGAGTTCGA GAAGCGCCAC GAGCAGCGCG CCACCCAGCT CACCGACAAG ATCCTCACTC CGGATGCGCG CAGCGCCGAA GACCATGCCG CGCGGCGGAT GTTCGTGCTG CAATACGTCC AGCCTGGTCT GGCCGGCCTG ATGGACGGCT CGGTGTCGAC GCTGGCGCCG CTGTTCGCCG CGGCCTTCGC CACCCACCAG AACTGGCCGA CCTTCCTGGT CGGTCTCGCC GCCTCGATCG GCGCCGGCAT CTCGATGGGC TTTGCCGAAG CGCTGTCCGA CGACGGCTCG ATGACCGGGC GCGGCTCGCC GTGGCTGCGC GGCGGCATCT GCGGCCTGAT GACCACGCTC GGTGGCCTCG GTCACACGCT GCCCTATCTT GTGCCCGATA GCTGGGCGAA CGCGTTCTGG ATCGCGACCG GCATCGCCGG CCTGGTGGTG TTCGTCGAAT TGTGGGCGAT CGCCTACATC CGCGCCCGCT ACATGGACAC GCCGTTCCTG CACGCGGTGT TTCAGATCGT GCTCGGTGGC GTCATCGTGC TGGCGGTGGG CATCCTGATA GGCGGCGCGT AG
|
Protein sequence | MKAFSDLTER EILAVAISGE EEDSRIYLAF AEDLAERYPD SARVFTEMAQ QEKGHRHMLL RMYEQRFGPD LPPIRREDVK GFIRRRPIWL TRNLPLDRIR KEAETMEFEA QRFYERAAER ATDVHIRKLL SDLAEFEKRH EQRATQLTDK ILTPDARSAE DHAARRMFVL QYVQPGLAGL MDGSVSTLAP LFAAAFATHQ NWPTFLVGLA ASIGAGISMG FAEALSDDGS MTGRGSPWLR GGICGLMTTL GGLGHTLPYL VPDSWANAFW IATGIAGLVV FVELWAIAYI RARYMDTPFL HAVFQIVLGG VIVLAVGILI GGA
|
| |