Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1641 |
Symbol | |
ID | 3909918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1871011 |
End bp | 1871940 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637883535 |
Product | protein tyrosine phosphatase |
Protein accession | YP_485260 |
Protein GI | 86748764 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0394] Protein-tyrosine-phosphatase [COG0640] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCGT CATCTAATTC GTCGAGAATA ATCGACAGAT TGGATGGGGT CATGCTGGCG TCGAGCAACA CAACACGCGA AACGGACGCC GTCGAAGGCT TCGGATCTCT GGCGCAGCCG ACCCGGCTCG CCGCGGTCCG GCTCCTGTTG TCGGCGTATC CAGCATCGTT GTCGGCTGGA GAGATCGCCC GGCGATGCGA CGTGCCGCAC AACACGATGT CGACGCATCT CGGCATCCTG CAGCGCGCCG GGTTGATCGG CGTCGAAAAG ACCGGCCGCT CGATGAACTA CCGCGCCGAC CCCGCCGGCT TCCGCAGCCT GATCGCGTTC CTGGCCCGCG ACTGCTGCAG CGGCCGCCCC GACATCTGCG CCGACATCTT CGATGTTACC AAGCCCGCAC CTCCCCTGCC GATGGAGACG TTCATGACTC CCGCCTTCAA CGTCCTGTTC CTTTGCACGC AGAATTCCGC GCGCTCGATC ATCGCCGAAG CCCTGCTGGA GAAAGTCGGG CAAGGCCGCT TCCGCGGTTA CTCGGCAGGC TCCGCGCCGG CGCAGCAACC GCTGCCGCAA GTGATCGAGC GCCTGCAGGC GCTGGGTCAC GATGTGACGC GGCTCCATTC GAAATCCTGG GACGAATTCA AGCGGCCGGA TGCGCCGCGG ATGGATTTCA TCATCGCGCT TTGCGATACG CCGAGCGGTC AGATCTGCCC GGATTTCGGT GGACAATATG TCACCGCCGC GTGGCCGCTG CCCGATCCGG CGCAGTTTTC GGGCTCGGAG GTCGAACGCA CAACGCTGCT CAACGAGCTT TACGCGATGA TCCGCAGGCG TCTCGAAATC TTCACCAGCC TGCCGTTCGA GTCGCTCGAC CGAATGGCGG TGAAGGCCCG CCTCGATGAA ATCGGCGACA CCAACCTCGT CAAGCCCTGA
|
Protein sequence | MTSSSNSSRI IDRLDGVMLA SSNTTRETDA VEGFGSLAQP TRLAAVRLLL SAYPASLSAG EIARRCDVPH NTMSTHLGIL QRAGLIGVEK TGRSMNYRAD PAGFRSLIAF LARDCCSGRP DICADIFDVT KPAPPLPMET FMTPAFNVLF LCTQNSARSI IAEALLEKVG QGRFRGYSAG SAPAQQPLPQ VIERLQALGH DVTRLHSKSW DEFKRPDAPR MDFIIALCDT PSGQICPDFG GQYVTAAWPL PDPAQFSGSE VERTTLLNEL YAMIRRRLEI FTSLPFESLD RMAVKARLDE IGDTNLVKP
|
| |