Gene Rpal_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1647 
Symbol 
ID6409304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1763992 
End bp1765200 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID642711536 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001990651 
Protein GI192290046 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCACC CCCTTCTCCC GGTCACCGTT CTGTCCGGCT TTCTCGGCGC CGGCAAAACC 
ACCCTTCTCA ACCGGATGCT GTCCAACCGT GAAGGCCGGC GGCTGGCGGT GATCGTCAAT
GACATGAGCG AGGTGAACAT TGACGCCGAG CTGGTGCGCC AGGATGGCGG GTTGACGCGG
TCGGAGGAAA CATTGGTCGA GATGACCAAT GGCTGCATCT GCTGCACGCT GCGTGAGGAC
CTGCTGCTCG AAGTCCGCAA GCTCGCCGAG AGCGGCCGGT TCGACGGGTT GGTGATCGAA
TCCACCGGCA TCGCTGAGCC GCTGCCGATC GCCGCGACGT TCGAATTCCG CGACGAGAAC
GGCGCCAGTC TGTCCGACAT CGCGCGGCTC GACACCATGG TGACGGTGGT CGATACGGCC
AGCCTGCTGG CGAACTACTC CAGCCAGGAA TTCCTGCGCG ACCGCGGCGA AGTCGCCGGC
GCCGACGACG AGCGGACACT GGTCGATCTG CTGGTCGAAC AGATCGAATT CGCCGACGTC
ATCGTGCTCA ACAAGGTGTC GGCGACCTCG CCGAGCCGGC GCGAAACCGC GCGGTCGATC
GTGCGAGCGC TGAACTCGGA TGCCAGGATC ATCGAAGCCG ATTTCGGCGA CGTACCTCTG
TCGTCGGTCT TGCACACCGG ACTGTTCGAT TTCGAGCGCG CGCATCAGCA CCCGCTGTGG
TTCAAGGAAC TAAACGGCTT CGCCGACCAC GTGCCGGAAA CCGAAGAATA CGGCATCCGC
TCATTCGTGT ACCGGGCGCG GCGGCCGTTT CATCCGCTCC GCTTCCAGTC GTTCTGCAAC
CGGAGCTGGC CGGGCGTAAT CCGCGCCAAG GGCTTCTTCT GGCTGGCGAC GCGACCGCAG
CATGTCGGTG AAATCGCGCA GGCCGGCGCG ATGGTACGCA CCTCCAAGCG CGGACTGTGG
TGGTCGGCGG TGCCGAAGAC GCGCTGGCCC GACAACGCCG ACTGGCACGA GGCGATGAAG
CCGTATTTCG ATCCGGTGTG GGGCGACCGC CGCCAGGAAA TCGTGTTCAT CGGCACCGGC
GAGATGGACG AAGCCGCGCT GCGCCGCCAG CTCGATCTCT GTCTCGTCGG TGAAGATGCG
CAGTTCACGC CGGATGCGTG GCAGCAATTG CCCGATCCGT TCCCGAGCTG GGCGCCGGCA
CAGAACTGA
 
Protein sequence
MDHPLLPVTV LSGFLGAGKT TLLNRMLSNR EGRRLAVIVN DMSEVNIDAE LVRQDGGLTR 
SEETLVEMTN GCICCTLRED LLLEVRKLAE SGRFDGLVIE STGIAEPLPI AATFEFRDEN
GASLSDIARL DTMVTVVDTA SLLANYSSQE FLRDRGEVAG ADDERTLVDL LVEQIEFADV
IVLNKVSATS PSRRETARSI VRALNSDARI IEADFGDVPL SSVLHTGLFD FERAHQHPLW
FKELNGFADH VPETEEYGIR SFVYRARRPF HPLRFQSFCN RSWPGVIRAK GFFWLATRPQ
HVGEIAQAGA MVRTSKRGLW WSAVPKTRWP DNADWHEAMK PYFDPVWGDR RQEIVFIGTG
EMDEAALRRQ LDLCLVGEDA QFTPDAWQQL PDPFPSWAPA QN