Gene RPB_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2421 
Symbol 
ID3909555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2775650 
End bp2776786 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content72% 
IMG OID637884320 
ProductDNA processing protein DprA, putative 
Protein accessionYP_486037 
Protein GI86749541 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.254094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCGAGC GTAGCAGTGA TCAGGGGACG ACCGTTCTCA CCGAGGCCCA GCGGGTCGAC 
TGGCTGCGGC TGATCCGGGC CGACAATGTC GGGCCGCGCA CCTTCCGCTC GCTGATCAAT
CATTTCGGCT CGGCGCGCGC GGCGCTGGAG CGGTTGCCGG AACTGGCGCG GCGCGGCGGC
GCGGCGCGCG CCGGACGTAT CCCGTCCGAA GATGAGGCGC GGCGCGAGAT CGAGGGCGGG
CGGCGGCTCG GTGTCGAGCT GGTGGCGCCG GGCGAGCCTG GCTACCCGCC TCGCCTCGCG
ACGATCGACG ATGCGCCGCC GCTGCTCGGC GTCCATGCGC TGCCCGACGC GCTGGCGCTG
ATGCAGCGGC CGATGATCGC GATCGTCGGC TCACGCAACG CTTCCGGCGC CGGGTTGAAG
TTCGCCGGCC TGCTCGCCGG CGACCTCGGC GCGAGCGGCT TCGTGGTGAT CTCCGGGCTG
GCGCGCGGCA TCGACCAGGC GGCGCATCGC GCCAGTCTCG GCAGCGGCAC CGTCGCGGTG
CTGGCCGGCG GCCATGACAA GATCTATCCG GCAGAGCACG AGGATCTGCT GCTCGACATT
ATCGAAGCGC GCGGCGCGGC GATCTCGGAA ATGCCACTCG GCCATGTCCC ACGCGGCAAG
GATTTTCCCC GCCGCAACCG GCTGATCTCC GGCGCCGCGG TCGGCGTGGT GGTGATCGAG
GCGGCGTATC GCTCCGGCTC GCTGATCACC GCGCGGCGCG CCGCCGATCA GGGCCGCGAG
GTGTTCGCGG TGCCGGGCTC GCCACTCGAT CCGCGCGCCG CCGGCACCAA CGATCTGATC
AAGCAGGGCG CGACGCTGAT CACCTGCGCC GCCGACATCA TCCAGGCGGT GGCGCCGATC
CTCGATCGGC CGATCGAGCT GCCCGGCCGC GAACCCGACC ACCCGCCGCC GGCAAGCGAT
CCAGATCCCA GCGACCGCAG CCGGATCGTC AACCTGCTGG GGCCGAGCCC TGTCGGAATC
GACGATCTGA TCCGGCTGAG CGGTATCGCT CCCGCGGTGG TGCGCACCGT GCTGCTGGAG
CTGGAACTCG CCGGCCGGCT GGAGCGCCAT GGCGGCGGGC TGGTGTCGCT GCTGTGA
 
Protein sequence
MGERSSDQGT TVLTEAQRVD WLRLIRADNV GPRTFRSLIN HFGSARAALE RLPELARRGG 
AARAGRIPSE DEARREIEGG RRLGVELVAP GEPGYPPRLA TIDDAPPLLG VHALPDALAL
MQRPMIAIVG SRNASGAGLK FAGLLAGDLG ASGFVVISGL ARGIDQAAHR ASLGSGTVAV
LAGGHDKIYP AEHEDLLLDI IEARGAAISE MPLGHVPRGK DFPRRNRLIS GAAVGVVVIE
AAYRSGSLIT ARRAADQGRE VFAVPGSPLD PRAAGTNDLI KQGATLITCA ADIIQAVAPI
LDRPIELPGR EPDHPPPASD PDPSDRSRIV NLLGPSPVGI DDLIRLSGIA PAVVRTVLLE
LELAGRLERH GGGLVSLL