Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3833 |
Symbol | |
ID | 3911636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4375641 |
End bp | 4376702 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885733 |
Product | hypothetical protein |
Protein accession | YP_487437 |
Protein GI | 86750941 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00492954 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCGAC TGCTGAGCGC GCTGGGGCGC GGCTTCAAGA CGTATGTCGG GTGGAAACGG CTCGGCATCG TCGCGAGCAT TCTGATCATC GGCTTTGCGA TCACGTCGCT GGTCAACACC CTGAAGGGGG TCGACAGCGC CGTCATCCTG ACCGCGCTGA CGGAGAAGTC CCCAACTCAG ATCGGGCTCG CCGCGCTGTT CGTGGTCGGC GCGTTCTGCA CGCTGACCTT CTACGATTTC TTCGCCCTGC GAACGATCGG CAAGCTGCAC GTGCCGTACC GGATCGCGGC GCTGTCGGCC TTCACCTCTT ATGTCATCGG GCACAATCTC GGCGCCACGG TGTTCACCGG CGGCGCGATC CGGTTCCGGA TCTATTCCGA CTACGGCCTC ACCGCGATCG ACGTCGCCAA GATCTGCTTC ATCTCCGGCC TGACGTTCTG GCTCGGTAAC CTGTTCGTGC TCGGCATCGG CATGATCTGG CACCCCGCCG CCGCCAGCGC GATGGATCTG TTGCCGGACA GCGTCAACCA GCTGATCGGC GTCGCCTGCC TCACCGGCAT CGCGGCGTAT TTCGTGTGGC TGGCGACCGG CAAGAAGCGC CGCCAGCTCG GCCAGAACGG CTGGAAGGTG GTACTGCCGT CGGCCAAGCT GACGCTGGTG CAGGTGCTGA TCGGCGTGGT CGATCTCGGC TTCTGCGCCA TGGCGATGTA CATGCTGATG CCGTCCGAGC CCTATATCGA CTTCGTCTCG CTGGCGGTGG TGTTCATCCT CGCCACGCTA CTCGGCTTCG CCAGCCATGC CCCCGGCAGC CTCGGCGTGT TCGACGCCGC GATGCTGGTG GCGCTGCCGA TGTTCGCCCG CGAGGACATC ATCGCCACGC TGCTGATCTA TCGCGTGTTG TATTTCCTGC TGCCGTTCGG CGTCGCGATC TCGATCCTCG GCATGCGCGA GCTGTGGCTG AGCGTGATCA AGCCGTGGCA GGAGAAACGC GCCGGCAATG GCCACCCGGT CGCCGCGGCT CCGGTCCGGC AGATCGCGCA GCGCCCGCGC AAGCAAGGCT GA
|
Protein sequence | MYRLLSALGR GFKTYVGWKR LGIVASILII GFAITSLVNT LKGVDSAVIL TALTEKSPTQ IGLAALFVVG AFCTLTFYDF FALRTIGKLH VPYRIAALSA FTSYVIGHNL GATVFTGGAI RFRIYSDYGL TAIDVAKICF ISGLTFWLGN LFVLGIGMIW HPAAASAMDL LPDSVNQLIG VACLTGIAAY FVWLATGKKR RQLGQNGWKV VLPSAKLTLV QVLIGVVDLG FCAMAMYMLM PSEPYIDFVS LAVVFILATL LGFASHAPGS LGVFDAAMLV ALPMFAREDI IATLLIYRVL YFLLPFGVAI SILGMRELWL SVIKPWQEKR AGNGHPVAAA PVRQIAQRPR KQG
|
| |