Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4457 |
Symbol | |
ID | 3912273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5045112 |
End bp | 5046791 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637886360 |
Product | 5'-nucleotidase-like protein |
Protein accession | YP_488051 |
Protein GI | 86751555 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.25099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.845616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACCC TTTTGAAACG TCGCCTCGCC CTGCTCGCGC CCGTATTGGC GCTGGCGCTG GCCTCGACCG CGCTCCGCGC CGACGATGCG GCGACGTCGG TCGAGCTGCG GATTCTCGCG ATCAATGATT TCCACGGCTA TCTGCAGCCG CCGCCGGGCG GCCTGGCGCT GCCCGACCCG GCCGATCCGG ACAAGAAGAC CAGCGTGCCG GCCGGCGGCG CCGAGCACAT GGCGACGCTG GTCCAACAGT TGCGCGCGGG ACACGCCCAC AGCATCTTCG TCGCCGCCGG CGACCTGATC GGCGCCAGCC CGTTTTTGTC GGCGATGTTT CACGACGAGC CGACGATCGA ATCGATGTCG CTGATGGGGC TGGCGCTGTC GGCGGTCGGC AATCACGAAT TCGACGAGGG CCGGACCGAG CTGCTGCGGA TGCAGCACGG CGGCTGCCAT CCGGTCGACG GCTGCCTCGG GCCGCATCCG TTTACCGGGG CGAAATTCCA GTATCTCGCC GCCTCGACGG TGGATACCGC GACCGGCAAG ACCCTGCTGC CGGCGACCGC GGTGCGTGAC TTCGGCGGCA TCCCGGTCGG CTTCATCGGC CTGACGCTGA AGACGACGCC GACCATGGTG TCGCCGCCGG GCGTCGCCGG GCTGCAGTTC AGGGACGAGG CCGAGACCGT CAACGAACAG GTCGCCGAGC TGAAGGCCCG CGGCGTCGAG GCCATCGTGG TGCTGATCCA CGAAGGCGGC TTTCCGACCG GCGGCATCAA CGACTGTCCG GGAATCTCCG GGCCGATCGT CGAGATTGTC AGGAAGTTCG ACAGGGCCGT CGACCTGGTG ATCAGCGGCC ACACCCACCG CGCCTACACC TGCACGATCG ACGGCCGGCT GGTCACCAGC GGCGACCGGT ACGGCACGCT GGTCACCGCG ATCGACCTCG TGCTCGATCC CAAGACCCGC GACGTCGTCA GCGCCAGGGC TGATAATGTC GTTGTGCGCA CCGAGACGCT GGCGAAGGAC CCGGCGCAGA CCGCGCTGAT CGACAGCTAC GACAGGCTCG CCGGGCCGAT CGCGGCGCGG CCCGCCGGCA GCGTCACCGC AGCGGTGTCG CGGATGCCGA ATGCGGCCGG CGAAAGCGCG CTCGGCGATC TGGTCGCCGA CGCTCATCTC GCCGCCACGC GCGACCGCGA GACCGGCGGC GCCGAGATCG CGATCACCAA TCCCGGCGGG CTGCGCGCCG ATATCAACGC CGCGGAAGAT GGAAAGGTGA CGTTCGGCGA TGTCTTCGCC GCGCAGCCGT TCCGCAATCA GCTGGTGACG ATGACGCTGA CCGGCGCGCA GCTCAAAGCG GCGCTGGAGC TGCAATGGCA GGATCCGGCG CGTCCGCGCA TCCTGCAGGT GTCGCGCGGC TTCAGCTATG CGTGGGACAA TACCGCCGCA CCGGGCGCGC GCATCGTCGC CGAGCGGATG CTGCTGCACG GCAAGCCGAT CACGGCCGAG CGCAGCTACC GCGTCACGAT CAACGCCTAT CTGGCCGCCG GCGGCGACGG CTTCACGCCG TTCACGCAGG GCACCGACCG CCTGACTGGC GTCTACGACG TCGACGCCCT GTTCGGCTAT TTCCGGGCGC ACAGCCCGAT CGCGCCGGCG CCGCTGGACC GGATCTCGCG CATGAACTGA
|
Protein sequence | MMTLLKRRLA LLAPVLALAL ASTALRADDA ATSVELRILA INDFHGYLQP PPGGLALPDP ADPDKKTSVP AGGAEHMATL VQQLRAGHAH SIFVAAGDLI GASPFLSAMF HDEPTIESMS LMGLALSAVG NHEFDEGRTE LLRMQHGGCH PVDGCLGPHP FTGAKFQYLA ASTVDTATGK TLLPATAVRD FGGIPVGFIG LTLKTTPTMV SPPGVAGLQF RDEAETVNEQ VAELKARGVE AIVVLIHEGG FPTGGINDCP GISGPIVEIV RKFDRAVDLV ISGHTHRAYT CTIDGRLVTS GDRYGTLVTA IDLVLDPKTR DVVSARADNV VVRTETLAKD PAQTALIDSY DRLAGPIAAR PAGSVTAAVS RMPNAAGESA LGDLVADAHL AATRDRETGG AEIAITNPGG LRADINAAED GKVTFGDVFA AQPFRNQLVT MTLTGAQLKA ALELQWQDPA RPRILQVSRG FSYAWDNTAA PGARIVAERM LLHGKPITAE RSYRVTINAY LAAGGDGFTP FTQGTDRLTG VYDVDALFGY FRAHSPIAPA PLDRISRMN
|
| |