Gene Rpal_5041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5041 
Symbol 
ID6412735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5422710 
End bp5423753 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content65% 
IMG OID642714926 
Productguanosine 5'-monophosphate oxidoreductase 
Protein accessionYP_001994005 
Protein GI192293400 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01305] guanosine monophosphate reductase, eukaryotic 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCACATCG ATTTGAACCC GAAGCTCGAC TTTCGCGACG TGCTGATCCG GCCCAAGCGT 
TCGGTGCTGT CGTCGCGTTT CGAGGCGAAT ATCAAGCGCA GCCTCCGCTT CCGCCATTCC
AGCCGGGCCT GGACCGGCTT TCCGCTGATC GCTTCGAACA TGGACACCAT CGGCACGCTG
GAGATGGCGA AGGCCTTCAA ACCGTTCGGC GCGCTGGTCG CCTTGCACAA GTTCCATCAT
CCGGATCGGC TGGCCGAGTA TCTTGCCGGC GACGAGGACG CCAACGTGTT CGTCACCGTC
GGTACCGGTT CGGCCGATTG GGAGCGGCTC GCTGCGGTGA AGGCCAAGGT GAAGGTGCCG
ATGCTGAACA TCGACGTGGC CAACGGCTAC ACCGAAGCCT TCGTGCGTGC GGTGGCCAGG
CTGCGCGACG AGAACCCGGA CGCGATCATC ATGGCCGGCA CCGTGGTCAC CGCCGAGATG
ACCGAGGCGC TGGTGCTCGC CGGCGCGGAC ATCGTCCGCG TCGGTATCGG CTCAGGTTCG
GTGTGTACCA CGCGTGATCT CACCGGCGTC GGCTATCCGC AGCTCTCGGC TGTGATCGAA
TGTGCCGATG CGGCACACGG GCTGAAGGGC CACGTCTGTT CGGACGGCGG TTGCGTGGTC
CCCGGCGATC TCGCCAAGGC CTATGGCGGC GGCGCGGATT TCGTGATGCT CGGCGGCATG
CTGGCGGGCC ATGACGAATG CGGCGGCGAG CTGCGCTACG CTGAGCAGAA CGGGCAGAAG
ACCCCGACCA GCATGGTGTT CTACGGCATG TCGTCGGAGA CCGCGATGAA CAAGTATCAC
GGCGGCGTCG CCGATTATCG CGCCGCCGAA GGCAAGACCG TCGAGGTGCC GTATCGCGGC
GAGGTGCATG CCACGGTCGA AAAGATCGCC GGCGGCCTGC GCTCGGCGAT GACCTATATC
GGCGCTGAGA ACCTGAAGGA AATTCCGAAG CGGACCACGT TCATCCTGGT CAACGCCCAG
CGCAACACGG TGTTCGACCG CTGA
 
Protein sequence
MHIDLNPKLD FRDVLIRPKR SVLSSRFEAN IKRSLRFRHS SRAWTGFPLI ASNMDTIGTL 
EMAKAFKPFG ALVALHKFHH PDRLAEYLAG DEDANVFVTV GTGSADWERL AAVKAKVKVP
MLNIDVANGY TEAFVRAVAR LRDENPDAII MAGTVVTAEM TEALVLAGAD IVRVGIGSGS
VCTTRDLTGV GYPQLSAVIE CADAAHGLKG HVCSDGGCVV PGDLAKAYGG GADFVMLGGM
LAGHDECGGE LRYAEQNGQK TPTSMVFYGM SSETAMNKYH GGVADYRAAE GKTVEVPYRG
EVHATVEKIA GGLRSAMTYI GAENLKEIPK RTTFILVNAQ RNTVFDR