Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2421 |
Symbol | |
ID | 3909555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2775650 |
End bp | 2776786 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637884320 |
Product | DNA processing protein DprA, putative |
Protein accession | YP_486037 |
Protein GI | 86749541 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.254094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCGAGC GTAGCAGTGA TCAGGGGACG ACCGTTCTCA CCGAGGCCCA GCGGGTCGAC TGGCTGCGGC TGATCCGGGC CGACAATGTC GGGCCGCGCA CCTTCCGCTC GCTGATCAAT CATTTCGGCT CGGCGCGCGC GGCGCTGGAG CGGTTGCCGG AACTGGCGCG GCGCGGCGGC GCGGCGCGCG CCGGACGTAT CCCGTCCGAA GATGAGGCGC GGCGCGAGAT CGAGGGCGGG CGGCGGCTCG GTGTCGAGCT GGTGGCGCCG GGCGAGCCTG GCTACCCGCC TCGCCTCGCG ACGATCGACG ATGCGCCGCC GCTGCTCGGC GTCCATGCGC TGCCCGACGC GCTGGCGCTG ATGCAGCGGC CGATGATCGC GATCGTCGGC TCACGCAACG CTTCCGGCGC CGGGTTGAAG TTCGCCGGCC TGCTCGCCGG CGACCTCGGC GCGAGCGGCT TCGTGGTGAT CTCCGGGCTG GCGCGCGGCA TCGACCAGGC GGCGCATCGC GCCAGTCTCG GCAGCGGCAC CGTCGCGGTG CTGGCCGGCG GCCATGACAA GATCTATCCG GCAGAGCACG AGGATCTGCT GCTCGACATT ATCGAAGCGC GCGGCGCGGC GATCTCGGAA ATGCCACTCG GCCATGTCCC ACGCGGCAAG GATTTTCCCC GCCGCAACCG GCTGATCTCC GGCGCCGCGG TCGGCGTGGT GGTGATCGAG GCGGCGTATC GCTCCGGCTC GCTGATCACC GCGCGGCGCG CCGCCGATCA GGGCCGCGAG GTGTTCGCGG TGCCGGGCTC GCCACTCGAT CCGCGCGCCG CCGGCACCAA CGATCTGATC AAGCAGGGCG CGACGCTGAT CACCTGCGCC GCCGACATCA TCCAGGCGGT GGCGCCGATC CTCGATCGGC CGATCGAGCT GCCCGGCCGC GAACCCGACC ACCCGCCGCC GGCAAGCGAT CCAGATCCCA GCGACCGCAG CCGGATCGTC AACCTGCTGG GGCCGAGCCC TGTCGGAATC GACGATCTGA TCCGGCTGAG CGGTATCGCT CCCGCGGTGG TGCGCACCGT GCTGCTGGAG CTGGAACTCG CCGGCCGGCT GGAGCGCCAT GGCGGCGGGC TGGTGTCGCT GCTGTGA
|
Protein sequence | MGERSSDQGT TVLTEAQRVD WLRLIRADNV GPRTFRSLIN HFGSARAALE RLPELARRGG AARAGRIPSE DEARREIEGG RRLGVELVAP GEPGYPPRLA TIDDAPPLLG VHALPDALAL MQRPMIAIVG SRNASGAGLK FAGLLAGDLG ASGFVVISGL ARGIDQAAHR ASLGSGTVAV LAGGHDKIYP AEHEDLLLDI IEARGAAISE MPLGHVPRGK DFPRRNRLIS GAAVGVVVIE AAYRSGSLIT ARRAADQGRE VFAVPGSPLD PRAAGTNDLI KQGATLITCA ADIIQAVAPI LDRPIELPGR EPDHPPPASD PDPSDRSRIV NLLGPSPVGI DDLIRLSGIA PAVVRTVLLE LELAGRLERH GGGLVSLL
|
| |