Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3721 |
Symbol | |
ID | 3911523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4258053 |
End bp | 4259111 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637885623 |
Product | putative DNA-binding protein |
Protein accession | YP_487327 |
Protein GI | 86750831 |
COG category | [R] General function prediction only |
COG ID | [COG3943] Virulence protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.299367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCTG ACGACGGCAA GGGCGAAGGC GAACTGATCC TCTACCGCAC GGAGGATGGC CGCGATGCGT TGCAACTGCG GCTGGTGGAC GGGACGGTCT GGCTGACCCA GGCCGAAATT GCGCTGCTGT TCGACACCAC CAAGCAGAAC GTCAGCCTGC ACTTCAAGAG CATCTTTGCC GACGCCGAAC TGCCCGAGGG GGCAGTTGTC AAGGATTCCT TGACAACTGC GGCCGACGGC AAACGCTATA CGACAAAGCT CTACAACCTC AATGCCATCC TGGCGGTCGG CTACCGCGTG CGCAGCGCCC GCGGGGTGCA GTTCCGGCGC TGGGCCACCG AGGTCTTGAA CGACTATCTG GTCAAGGGCT TCGTCATCAA CGACGAGCGC CTGAAGGATC CTGACGGCTT CGATTATTTC GACGAGCTGC TCGAACGCAT CCGCGACATC CGGGCGTCCG AAAAACGGTT CTACCAGAAA GTCCGCGACC TGTTCGCCGC CACCAGCGCC GACTACGACC CCAAGGCGGA GGCCGCCAAG GCCTTCTTCG CCACGATCCA GAACAAGCTG GTGTTCGCCA TCACCGGGAT GAACGCCGCC GAACTGATCG TCACGAGGGC CGATCCTGCG CGGCCGAACA TGGCGCTGAC CAGCTGGAAG GGCGATCGCG TCCGCAAAAG CGACGTCACC ATCTCGAAAA ATTATCTCAC CGCCGACGAG ATCAGCGACC TCAACCTGCT GACCACGGCC TTCCTGGATT TTGCCGAGCT GCGCGCCCGC AACCGCCAGC CGACCACCAT GGCCGAATGG ATGGCGCAGA CCGATCGCTT CGTCGCCTTC AACGAACGCG GCGTGCTGCA GGGCGCCGGC CGCGTCTCCC ATACGAGCAT GGAGCAGGTC GTGGCCGAGC GCTTCGAGAC TTTCGACAAA CGCCGTCGCG CCGCCGAGAC CGATGCAGCC GAGGCGGAGG CGATCAGCGA GCTGACTGAA CTCGAGCAGC AGGCGCGATC GTCGAAGCCA CGCCCTGAGG TCAAAAAACC GCCGAAGAAG CTATCCTGA
|
Protein sequence | MNPDDGKGEG ELILYRTEDG RDALQLRLVD GTVWLTQAEI ALLFDTTKQN VSLHFKSIFA DAELPEGAVV KDSLTTAADG KRYTTKLYNL NAILAVGYRV RSARGVQFRR WATEVLNDYL VKGFVINDER LKDPDGFDYF DELLERIRDI RASEKRFYQK VRDLFAATSA DYDPKAEAAK AFFATIQNKL VFAITGMNAA ELIVTRADPA RPNMALTSWK GDRVRKSDVT ISKNYLTADE ISDLNLLTTA FLDFAELRAR NRQPTTMAEW MAQTDRFVAF NERGVLQGAG RVSHTSMEQV VAERFETFDK RRRAAETDAA EAEAISELTE LEQQARSSKP RPEVKKPPKK LS
|
| |