Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0216 |
Symbol | |
ID | 4020674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 247862 |
End bp | 248824 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637960395 |
Product | transposase, IS4 |
Protein accession | YP_567357 |
Protein GI | 91974698 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3039] Transposase and inactivated derivatives, IS5 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.10412 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGG ACAGTTTCAT TGAGGCTTTG CTGCCGGCGG GCTTTGGGCG CAACGAGCGG CTGGAGCGCA TTTCCGGCCT GATCGATTGG GACCGTCTTG CCGTCCTGAT CCGCAAGGTG CGGCCGGGCG AGACCGGTCG CCCGCCCTAC GCGCCGCTGG CGATGTTCAA AGCCTTGCTG CTGCAGCAGT GGTACGGACT GTCCGACCCC GGCCTCGAAG AGGCCCTGCT CGACCGGGTT TCGTTTCGGC GCTTTTGCGG GTTCGCGCTG GATGCGCAGA CACCCGACGA GACGACGCTT TGCCGGTTTC GCAATGCCTT GCAGCAGGCC GGCCTGGGAG ACGCCCTGTT TCAGGAGGTC CTCCGGCAGC TCGAAGCGGC TGGATATGTC TTGAAGACAG GGACCTTGAT CGACGCCACC CTGGTTCAGA GTTCAGGGCG AACGCCCCCG TCGGGATCGA CACCGCGCGA GGTCGAGAGC CGTTCGTCAC ACGACCCGCA GGCCGACTGG ACACGGCACG GCGCCGGACG AAGGCTGTTC TTCGGCTACA AGGCGCACAT CGCGATCGAT CAGGGATCGG GGCTGATCCG GGCCCGCAAA CTGACCGGGG CGAAGACGTA TGAGAGCGAA GTCGCCGACG ATCTCGTCCT CGGCGACGAG AAGGCGGTCT ATGCCGACAA GGCCTACGAG AAGCGGGCGC GCCGCCAAGC TCTCAAAGCC CGCGGCATCA AGGATCGCAT CCAGCATCGT CGCAACAAAC ACCAGAAGGC GCTGCCGCGA TGGCAGGCAG TGCGCAACAA GCTGATCGGC CGTGTTCGGC AGGCGATCGA ACGCACCTTC GCCCAACTGA AAGGCCGCTA CGGCTTCACC CGCATGCGCT ACGCAGGCAT CACCGCCAAC GCATTCCACC TCGATCTGAT CTCGATCGCC TACAACCTGC GAACCGCAGC CGCCATCCGC TGA
|
Protein sequence | MAQDSFIEAL LPAGFGRNER LERISGLIDW DRLAVLIRKV RPGETGRPPY APLAMFKALL LQQWYGLSDP GLEEALLDRV SFRRFCGFAL DAQTPDETTL CRFRNALQQA GLGDALFQEV LRQLEAAGYV LKTGTLIDAT LVQSSGRTPP SGSTPREVES RSSHDPQADW TRHGAGRRLF FGYKAHIAID QGSGLIRARK LTGAKTYESE VADDLVLGDE KAVYADKAYE KRARRQALKA RGIKDRIQHR RNKHQKALPR WQAVRNKLIG RVRQAIERTF AQLKGRYGFT RMRYAGITAN AFHLDLISIA YNLRTAAAIR
|
| |