Gene Rpal_4958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4958 
Symbol 
ID6412650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5338389 
End bp5340086 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content66% 
IMG OID642714841 
Product5'-Nucleotidase domain protein 
Protein accessionYP_001993922 
Protein GI192293317 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.416285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTCGA GGCGGGAATT CCTGCAGGCG ACGGCCGCCG CATCGGCGCT GACGATCGCC 
GGCGGCCTGG GGCCGATCGG GCGGGTCGCG GCCCAGCAGC GGCTGACCCA GGGCGACATC
CTGAAATTCG ATCCGCTCGG CACGGTGACG CTGCTGCATA TCACCGACAC GCACGCGCAA
CTCGTGCCGC TGCATTTCCG CGAGCCCTCG GTCAATCTCG GTGTCGGCGA GGTCAAGGGC
AAGCCGCCGC ATCTCACGGA CGAGGAATTC CGCAAGTACT TCCATATCGC CACCGGCTCG
CCGGATGCGT TCGCGCTGAC CGCGGACGAC TTCACCGCGC TTGCCCGCAA CTACGGCAAG
ATGGGCGGCT TCGACCGGAT CGCCACGCTG GTCAAGGCCA TCCGCGCCGA GCGCGGCGCC
GACAAGGTGC TGCTGCTCGA CGGCGGCGAC GCGCTGCAGG GCAGCTGGAG CTCGCTGAAG
AGCAACGGTC AGGACATGAT CGACGCGCTC GCCGGGCTCA AAGTCGACGC GATGACCGGC
CATTGGGAGT TCACCTACGG CGCCGACCGC GTCAAGGAAA TCGCCGAGAA GGCGCCGTTC
GCGTTCCTGG CGCAGAACGT CCGCGATATC GAATGGCAGG AGCCGGTGTT CGAGGCCCGC
AAGATGTTCG AGCGCGGCGG CGTCAAGATC GCGGTGATCG GCCAGGCGTT GCCGCGCACC
GCGGTCGCCA ATCCGCGCTG GATGTTTCCG AACTGGGAGT TCGGCATCCG CGAGGAGGAC
ATGCAGAAAC AGGTCGACGA TGCGCGCGCC GAGGGCGCCG CGATCGTGGT GCTGCTGTCG
CACAACGGCT TCGACGTCGA TCGCAAGCTC GCCGGCCGGG TGAAGGGCCT CGACGTCATC
CTCACCGGCC ACACCCACGA CGCGATGCCG GGCGTGATCA AGGTCGGCGA AACCGTGCTG
GTGGCGTCGG GCTCGCACGG CAAGTTCGTG TCGCGGCTCG ACATCAAGGT CGACGGCGGC
AAGGTCGCGG ACATCCGCTT CAAGCTGATG CCGGTGTTTG CGGATGCGAT CACGCCAGAC
CCGGAGATGG CCAAGCTGGT CGAGAAGCTG CGCGAGCCTT ACGCCAAGGA TCTCGCCCGC
GTCGTCGGCA AGACCGACTC GCTCTTGTAT CGCCGCGGCA ATTTCAACGG CACCTTCGAT
GATTTGATCT GCGACGCGAT GCTGAAGCAG CGCGACACCG AAATCGCGCT GTCGCCGGGC
TTCCGCTGGG GCGGCACACT GCTGCCGGAA GAGGGCATCA CCTGGGAGGC GATCACCAAC
GCCACCGCGA TCACCTATCC GAACTGCTAC CGCACGGAGA TGACCGGCGA GCAGCTCAAG
AACGTGCTCG AAGACATCGC CGACAACATC TTCCATCCCG ACCCTTACTA TCAGGGCGGC
GGCGACATGG TGCGCACTGG CGGCATGGGC TACGCGATCG ACATCTCCAA GGAGATGGGC
TCGCGCATCT CCAACATGAC GCATCTGGCA ACCGGCAAGC CGATCGAGGC GTCGAAGAAG
TACACGGTGT CCGGCTGGGC CAGCGTCAAT CAGGGCACCG AAGGTCCGCC GATCTGGGAG
GTGCTGGAGA AGCACGTCGC CAGCGCCGGC CCGGTGAAGA TCGAACCGAA CAGCGCGGTC
AAAGTCTCCG GTGCCTGA
 
Protein sequence
MISRREFLQA TAAASALTIA GGLGPIGRVA AQQRLTQGDI LKFDPLGTVT LLHITDTHAQ 
LVPLHFREPS VNLGVGEVKG KPPHLTDEEF RKYFHIATGS PDAFALTADD FTALARNYGK
MGGFDRIATL VKAIRAERGA DKVLLLDGGD ALQGSWSSLK SNGQDMIDAL AGLKVDAMTG
HWEFTYGADR VKEIAEKAPF AFLAQNVRDI EWQEPVFEAR KMFERGGVKI AVIGQALPRT
AVANPRWMFP NWEFGIREED MQKQVDDARA EGAAIVVLLS HNGFDVDRKL AGRVKGLDVI
LTGHTHDAMP GVIKVGETVL VASGSHGKFV SRLDIKVDGG KVADIRFKLM PVFADAITPD
PEMAKLVEKL REPYAKDLAR VVGKTDSLLY RRGNFNGTFD DLICDAMLKQ RDTEIALSPG
FRWGGTLLPE EGITWEAITN ATAITYPNCY RTEMTGEQLK NVLEDIADNI FHPDPYYQGG
GDMVRTGGMG YAIDISKEMG SRISNMTHLA TGKPIEASKK YTVSGWASVN QGTEGPPIWE
VLEKHVASAG PVKIEPNSAV KVSGA