Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3451 |
Symbol | |
ID | 3911253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3957280 |
End bp | 3958680 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885354 |
Product | hypothetical protein |
Protein accession | YP_487058 |
Protein GI | 86750562 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.139963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA CACCGACCGA ACGGCTGAGG GACTACCTCG CCCAGCTCCC GCCGCAGTCG CAGGCGTTGC TGATGCGCGA GTTCGAGCGC GCGCTGGAGC GCGGCGAAGA GGTCGCGGTG GCCAGCTTCG TGCTCGAGGA ACTGCGCAAG ATCGTCCGCG GCAACGGTGA AGACGAGGCC CCGCGCACCG ACGATCCGGC GCGGCTGATG TTCCGCGTGC TGGAGCCGTT CCTGATCGAC AACAGCCAGC AGCCGCGGCC GGGCCAGGTC CGCCGGGCGT CGCTGAGCTC GATCTGGCAA TGGCTGGTGC GCGAAGCGAT CCCGACCCAG GTCCGCGAAT TCGAGGGCGG GCTGATCAAC CTGCGCGGCG GCACCGCGGC GCAGATCGAC GATCTCGTCC GCAAGCTGCA GCTCGCGGCG GCCGAGGCGA TCGACAAGGT GATCAATCCG GAACAGGGCG TCGACCGGCA GCGCGCGATG TCGCGGGTCG GACCGCCCTC GACGGTCGAG GACCTACCCG GCGTCGGCGC CGTGCTCGCC AACCGCGAAG CGCTCGAAGC TTTCGACGGC AAGCTTTCGT CCAACCTGCG GACCTTCGGC GACTCGCAGG TGCATTCGAT GATCGCCGCG CTCAACGTGC CGGCGCTGCA GACCCCGATC CTGCTGCCGT TCGCGCTGAC GATGATCCTG CAGCATCTGA CCCAGCCGTG GCAGATCGTC CGGCTGGCGC TCAAGGTGGC CGGCTCCGAC GACGAGATCA GGGTCGCCGC CACCCCTTAC GGCGTCGCCG TCACCATGGC GATCCACGAC ATCGCCCAGC TCACCGCCGA TCTGCGCGAC GAGCTCAAGC GCGGCCATTA CAGCAACGTC GCCGAGAAGC TGAAGCTGGT CCATGACGGC GTTCGCGGCC TGCGCACCGA ACTCGACATC CGCAGCGATT CGGCCTGGGG CAAGCGGCTC GCCGCGATCC GCGTCGACAT TTCCAATGCG TTGAAATCCG AGATCGAGAG CGTCCCCGGC CGGGTGCGCC GGCTGCTGCG CCAGCGCCCC GACAAGGAGA TCTCGGCCAA CAGCCGGATC GATCAGATCG AGGTCGACGA GTCGGCGGCG CTGATCGATT TCGTCGCGGT GTGCCGCAAC TACGCCAGCG AACTCGCGAT CAACGAGATG ACGCTGCGGA CCTATTCCGA GCTGCAGCAA TATGTCGAGA AGTCCACCGA ATCGCTGGTG CAATCGCTGC GCGGCTGCGA ACCGCGGGTG AAGCCGTTCC GGCACATGCA GGCGCTCGCC GCGATCCGGT TCTGCGAAGT GCTGTTCGGC CACGATTATG CGCAACTGAT GCGCCGCGCG GCCGAGAGCG CGATGGTCGT GGTGGAGCGC AAGCCGGCGC GCGCTGGCTG A
|
Protein sequence | MSQTPTERLR DYLAQLPPQS QALLMREFER ALERGEEVAV ASFVLEELRK IVRGNGEDEA PRTDDPARLM FRVLEPFLID NSQQPRPGQV RRASLSSIWQ WLVREAIPTQ VREFEGGLIN LRGGTAAQID DLVRKLQLAA AEAIDKVINP EQGVDRQRAM SRVGPPSTVE DLPGVGAVLA NREALEAFDG KLSSNLRTFG DSQVHSMIAA LNVPALQTPI LLPFALTMIL QHLTQPWQIV RLALKVAGSD DEIRVAATPY GVAVTMAIHD IAQLTADLRD ELKRGHYSNV AEKLKLVHDG VRGLRTELDI RSDSAWGKRL AAIRVDISNA LKSEIESVPG RVRRLLRQRP DKEISANSRI DQIEVDESAA LIDFVAVCRN YASELAINEM TLRTYSELQQ YVEKSTESLV QSLRGCEPRV KPFRHMQALA AIRFCEVLFG HDYAQLMRRA AESAMVVVER KPARAG
|
| |