Gene Rpal_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2003 
Symbol 
ID6409663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2167982 
End bp2169232 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content65% 
IMG OID642711889 
ProductRadical SAM domain protein 
Protein accessionYP_001991001 
Protein GI192290396 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTGC AGCGCAAGCT GGCCATCCTG GCGGACGCGG CGAAATACGA CGCCTCCTGC 
GCCTCCAGCG GCACGGAGAA ACGCGACAGC CAGGGCGGCA AGGGTCTCGG CTCCACTGCG
CCCGGCATGG GCATCTGTCA TTCATATGCG CCGGACGGGC GCTGCATCTC GCTGCTCAAG
GTGCTGCTCA CCAACGCCTG TGCTTATGAC TGCCTGTATT GCGTCAACCG GGCCTCTTCG
AACGTGCCGC GCGCCCGCTT CACCATCGAC GAGGTGGTGC AGCTCACGCT CGACTTCTAT
CGCCGCAACT ACATCGAGGG GCTGTTTCTA TCCTCGGGCA TCATCCGCAG CGCCGACTAC
ACCATGGAGC AGATCGTCGA AGTGGCGCGG CGCCTCCGGG AAGAGCATCA CTTCCGCGGC
TACATTCATC TGAAGACGAT CCCGGAAGCC GACGACGCGC TGATCGCCAA GGCCGGCCGC
TATGCGGACC GGCTCAGCAT CAATATCGAG ATGCCGGAGG AGACCAGCCT CGCTCAATTG
GCGCCGGAGA AGAACGTCCG CGCCATCCGC CGCACCATGG GGCGGCTGCG GCTGAAGCTG
GATGAAGCGA CCGAGGCCAA GGCCGAGGCG CGTAAAACCC CCACCCGCGC CAAGCCGCCG
CGCTTCGCAC CGGCCGGCCA GAGCACGCAG ATGATTGTCG GCGCCGACAG CGCCACCGAC
CAGACCATCC TCAATACATC CGCCAATCTC TACGGCTCGT ACAATCTGAA GCGGGTGTAC
TACTCGGCGT TCAGCCCGAT CCCGGATTCC AGCCGCGCGC TGCCGCTGCA GGCGCCGCCA
TTGGTGCGCG AACATCGGCT GTACCAGGCC GACTGGCTGA TACGGTTCTA CGGCTTCGAC
GCCGGCGAGA TCATCGATCC GGCCGCCGGC ATGCTGTCAC TGGAGATGGA TCCGAAGCTC
GCCTGGGCGC TGCGCCACCG CGAGCGCTTC CCGCTCGACG TCAACCGCGC CAGCCGCGAG
GAATTGCTGC GCATTCCCGG CTTCGGCCGC AAGGCGGTCG ACCGCATTAT CGATACGCGG
CGCTACAGCG CGATCCGCGC TGCGGACCTC GCCAAACTCC ACATCCCAAG GAACAAGGCG
CTGCCGTTCA TCGTTCTCCC CGACCACCGC CCGCCGACCT ATCTGCTCGA CGGAGCGCGG
CTGGCGGAGC GGTTTCAACC GAAAGCACAG CAACTGGGAT TTGGGTTTTG A
 
Protein sequence
MDVQRKLAIL ADAAKYDASC ASSGTEKRDS QGGKGLGSTA PGMGICHSYA PDGRCISLLK 
VLLTNACAYD CLYCVNRASS NVPRARFTID EVVQLTLDFY RRNYIEGLFL SSGIIRSADY
TMEQIVEVAR RLREEHHFRG YIHLKTIPEA DDALIAKAGR YADRLSINIE MPEETSLAQL
APEKNVRAIR RTMGRLRLKL DEATEAKAEA RKTPTRAKPP RFAPAGQSTQ MIVGADSATD
QTILNTSANL YGSYNLKRVY YSAFSPIPDS SRALPLQAPP LVREHRLYQA DWLIRFYGFD
AGEIIDPAAG MLSLEMDPKL AWALRHRERF PLDVNRASRE ELLRIPGFGR KAVDRIIDTR
RYSAIRAADL AKLHIPRNKA LPFIVLPDHR PPTYLLDGAR LAERFQPKAQ QLGFGF