Gene RPB_4457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4457 
Symbol 
ID3912273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5045112 
End bp5046791 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content69% 
IMG OID637886360 
Product5'-nucleotidase-like protein 
Protein accessionYP_488051 
Protein GI86751555 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.25099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.845616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACCC TTTTGAAACG TCGCCTCGCC CTGCTCGCGC CCGTATTGGC GCTGGCGCTG 
GCCTCGACCG CGCTCCGCGC CGACGATGCG GCGACGTCGG TCGAGCTGCG GATTCTCGCG
ATCAATGATT TCCACGGCTA TCTGCAGCCG CCGCCGGGCG GCCTGGCGCT GCCCGACCCG
GCCGATCCGG ACAAGAAGAC CAGCGTGCCG GCCGGCGGCG CCGAGCACAT GGCGACGCTG
GTCCAACAGT TGCGCGCGGG ACACGCCCAC AGCATCTTCG TCGCCGCCGG CGACCTGATC
GGCGCCAGCC CGTTTTTGTC GGCGATGTTT CACGACGAGC CGACGATCGA ATCGATGTCG
CTGATGGGGC TGGCGCTGTC GGCGGTCGGC AATCACGAAT TCGACGAGGG CCGGACCGAG
CTGCTGCGGA TGCAGCACGG CGGCTGCCAT CCGGTCGACG GCTGCCTCGG GCCGCATCCG
TTTACCGGGG CGAAATTCCA GTATCTCGCC GCCTCGACGG TGGATACCGC GACCGGCAAG
ACCCTGCTGC CGGCGACCGC GGTGCGTGAC TTCGGCGGCA TCCCGGTCGG CTTCATCGGC
CTGACGCTGA AGACGACGCC GACCATGGTG TCGCCGCCGG GCGTCGCCGG GCTGCAGTTC
AGGGACGAGG CCGAGACCGT CAACGAACAG GTCGCCGAGC TGAAGGCCCG CGGCGTCGAG
GCCATCGTGG TGCTGATCCA CGAAGGCGGC TTTCCGACCG GCGGCATCAA CGACTGTCCG
GGAATCTCCG GGCCGATCGT CGAGATTGTC AGGAAGTTCG ACAGGGCCGT CGACCTGGTG
ATCAGCGGCC ACACCCACCG CGCCTACACC TGCACGATCG ACGGCCGGCT GGTCACCAGC
GGCGACCGGT ACGGCACGCT GGTCACCGCG ATCGACCTCG TGCTCGATCC CAAGACCCGC
GACGTCGTCA GCGCCAGGGC TGATAATGTC GTTGTGCGCA CCGAGACGCT GGCGAAGGAC
CCGGCGCAGA CCGCGCTGAT CGACAGCTAC GACAGGCTCG CCGGGCCGAT CGCGGCGCGG
CCCGCCGGCA GCGTCACCGC AGCGGTGTCG CGGATGCCGA ATGCGGCCGG CGAAAGCGCG
CTCGGCGATC TGGTCGCCGA CGCTCATCTC GCCGCCACGC GCGACCGCGA GACCGGCGGC
GCCGAGATCG CGATCACCAA TCCCGGCGGG CTGCGCGCCG ATATCAACGC CGCGGAAGAT
GGAAAGGTGA CGTTCGGCGA TGTCTTCGCC GCGCAGCCGT TCCGCAATCA GCTGGTGACG
ATGACGCTGA CCGGCGCGCA GCTCAAAGCG GCGCTGGAGC TGCAATGGCA GGATCCGGCG
CGTCCGCGCA TCCTGCAGGT GTCGCGCGGC TTCAGCTATG CGTGGGACAA TACCGCCGCA
CCGGGCGCGC GCATCGTCGC CGAGCGGATG CTGCTGCACG GCAAGCCGAT CACGGCCGAG
CGCAGCTACC GCGTCACGAT CAACGCCTAT CTGGCCGCCG GCGGCGACGG CTTCACGCCG
TTCACGCAGG GCACCGACCG CCTGACTGGC GTCTACGACG TCGACGCCCT GTTCGGCTAT
TTCCGGGCGC ACAGCCCGAT CGCGCCGGCG CCGCTGGACC GGATCTCGCG CATGAACTGA
 
Protein sequence
MMTLLKRRLA LLAPVLALAL ASTALRADDA ATSVELRILA INDFHGYLQP PPGGLALPDP 
ADPDKKTSVP AGGAEHMATL VQQLRAGHAH SIFVAAGDLI GASPFLSAMF HDEPTIESMS
LMGLALSAVG NHEFDEGRTE LLRMQHGGCH PVDGCLGPHP FTGAKFQYLA ASTVDTATGK
TLLPATAVRD FGGIPVGFIG LTLKTTPTMV SPPGVAGLQF RDEAETVNEQ VAELKARGVE
AIVVLIHEGG FPTGGINDCP GISGPIVEIV RKFDRAVDLV ISGHTHRAYT CTIDGRLVTS
GDRYGTLVTA IDLVLDPKTR DVVSARADNV VVRTETLAKD PAQTALIDSY DRLAGPIAAR
PAGSVTAAVS RMPNAAGESA LGDLVADAHL AATRDRETGG AEIAITNPGG LRADINAAED
GKVTFGDVFA AQPFRNQLVT MTLTGAQLKA ALELQWQDPA RPRILQVSRG FSYAWDNTAA
PGARIVAERM LLHGKPITAE RSYRVTINAY LAAGGDGFTP FTQGTDRLTG VYDVDALFGY
FRAHSPIAPA PLDRISRMN