Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3834 |
Symbol | |
ID | 3911637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4376935 |
End bp | 4378965 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637885734 |
Product | hypothetical protein |
Protein accession | YP_487438 |
Protein GI | 86750942 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0630158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACAC TGTCCCTGCT CCGTCTTCCG GCCCGCCTGG TCGCGCTCGC CGCGGCCGTT CTGCTGCCGT CCGTCGTCGC GCTCGCCCAG ACCGCGCCTC CGCCGCCGGG CCCCGAACAG CCCGGCGCCG CCCTACCGAT CATGCCTCCC GTCGCCGCGA TCCCGGTCAC CGACGGCTCG ATGCAGATCG TCTGGGAGGT CCGCAACCGC TTCCGGCTGT TTCGCGAGGA GCGCGACTTT CGCGAGCAGG CCGAAGCGCT GCGCGGGCTC ACCGTGCTCG CCTCCGAACA GGCGCTGGGG CTGCAGAGCG AGGGCCGCGG CTGGGCCCGC AACGTCGTCA ACCGGCTGTG CATCGATCTG ACCGGCCGGG TCAACGAGCC CTGCACCCGC GACGGCGTCA AGGAAAGCTA CCTGACGCCG ACCGAACACC CGGTCACGGT GCGGCTGACC GGCGCGATCC CGGTCGGCGC GATCTGCGCC TGGCTGTTCG ACGACGGCGA CGGCCCGCGG GCGTCGACGC TGGACTGCGC CGAGCCGATC AATTTCCGCG CCCGCTACGG CAAGCCGACG GTGGCGACCG TGGACGTCAC CAGCGGCGCC GACGCGCCGC TCCGGGTGAC CACCGAGATC ATGGTGCGCG ATTTCTTCAT CGCGGGGCTG GGCGACTCGA TCGCATCCGG CGAAGGCAAT CCCGACCGGC CGATCGCGCT GTCCGACGAC GGCTTCTGCT ATCGCTCCTA TCTCGGCATC GGGTCCGGCG CACGCCCCGG CCAGTATTAT CGGCCGAGCC GCGCCGGCTA CAAAGGCGGC CGCGCCTGCG AGGCGCCGGA CACGCTGGCC AACTGGCAGC GTTATTCGGC GACCTGGTTC AACGCCGCCT GCCATCGCTC GCTGTACAGC TACCAGACCC GCACCGCGCT GGCGCTGGCG GCGCGTCACC CGCACATCGC GGTGACCTAT CTGCCGCTGG CCTGCACCGG GGCGACCATC GCCGACGGGC TGTTCGGCTC GCAGCGGCCG CGCGAATGCT ATCGCACCAA GACCGGCGCC AATTGTCCCG GCAACGTCAA CAGCCAGCTC GCCGAGCTGC GCGAGGCGCT CGCCGCCGCG CGCAAACGCC AGCCGCAACG CGGGCTCGAT CTGGTGCTGC TGTCGGTCGG CGCCAACGAC ATCAACTTCT CCGGCCTGGT CGCCGACGTC ATCGTCGATA GCCCGACCGA GCGCGGCATC TTCCGCCGCT CCGGCGTGAT CGGCTCGATC GAGGAGTCGC GCAGCGCGCT GGCGCGAACC CTGCCGCAGA GCTTTTCGAA GATGCGTGAA GCGCTCAAGG GGCTGGTCGA CGACATGTCG CGCGTGGTCT ACGTCACCTA CGCCAATCCG GCGCTGGCGA ACCGAGGCGT CCCCTGCCCC GGCGGCCGCG CCGGCTTCGA CATCCATCCC TCGTTCGACG CCGACCCCAA CCGGCTGGCC GCGGTGGCCT CCTTCGTCGA CAATGAGTTC CTGCCGCGCC TGAAGGATCT GGCGCAATGC AGCGGCGGCG TGCTGTGCCG CAATCCGTCC GCCGACGCCA TGACCTTCGT CGACGCGCAT CAGCGCACCT TCGCCCATCA CGGCTTCTGC GCCCGTGCCG ACACCGATCC GGAATTCGAC CGCGCCTGCT TTTCGCCGCG CGGCGACAGC TTCACCAGCG ACATCGTCGC GGCGGCGAAT TCGCCGATGA GCTGCGGTGC CGGCGCCAGC AATTACCGCG CTTATCTGCC GCGCGCGCGC TGGATCCGCG ACGCCAATGA CAGTTACTTC GCGGCGATGA CGTTCCCGCA AGGCCTGCCC GCGGCGATCC AGCCCGCCGA CATTCACGAC GCCACCTGGG GCGTGGTGTC CGCGGTCTAT GGCGGCGCGG TCCACCCCTC CGCCGAAGGC CACGCCGCGA TGGCCGACGC CGCGGTGCCC GCCGCCGAAG CGGTGCTGCA ACTGGAGTCG GGACCGAACG TGATCAGCGC ACCGCTGCCG CCGCCGGGAG CAGTGGAGTA G
|
Protein sequence | MITLSLLRLP ARLVALAAAV LLPSVVALAQ TAPPPPGPEQ PGAALPIMPP VAAIPVTDGS MQIVWEVRNR FRLFREERDF REQAEALRGL TVLASEQALG LQSEGRGWAR NVVNRLCIDL TGRVNEPCTR DGVKESYLTP TEHPVTVRLT GAIPVGAICA WLFDDGDGPR ASTLDCAEPI NFRARYGKPT VATVDVTSGA DAPLRVTTEI MVRDFFIAGL GDSIASGEGN PDRPIALSDD GFCYRSYLGI GSGARPGQYY RPSRAGYKGG RACEAPDTLA NWQRYSATWF NAACHRSLYS YQTRTALALA ARHPHIAVTY LPLACTGATI ADGLFGSQRP RECYRTKTGA NCPGNVNSQL AELREALAAA RKRQPQRGLD LVLLSVGAND INFSGLVADV IVDSPTERGI FRRSGVIGSI EESRSALART LPQSFSKMRE ALKGLVDDMS RVVYVTYANP ALANRGVPCP GGRAGFDIHP SFDADPNRLA AVASFVDNEF LPRLKDLAQC SGGVLCRNPS ADAMTFVDAH QRTFAHHGFC ARADTDPEFD RACFSPRGDS FTSDIVAAAN SPMSCGAGAS NYRAYLPRAR WIRDANDSYF AAMTFPQGLP AAIQPADIHD ATWGVVSAVY GGAVHPSAEG HAAMADAAVP AAEAVLQLES GPNVISAPLP PPGAVE
|
| |