Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4165 |
Symbol | |
ID | 3911973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4736463 |
End bp | 4737593 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637886069 |
Product | hypothetical protein |
Protein accession | YP_487768 |
Protein GI | 86751272 |
COG category | [S] Function unknown |
COG ID | [COG1565] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.576249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0725771 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGACG ACTCGCCGTT GCTTGCCGAG ATCAAGCGGC TGATCGAGAC CGCCGGCCCG ATGCCGGTGT GGCGCTATAT GGAGCTCTGC CTCGCCCACC CCGAATACGG CTACTACGTG TCGCGCGATC CGCTCGGCCG CGAGGGCGAT TTCACCACCT CGCCCGAGAT CAGCCAGATG TTCGGCGAAC TGATCGGGCT GTGGACCGCG TCGGTGTGGA AGGCTGTCGG CGAACCGGGC GTGCTGCGGC TGATCGAGAT CGGTCCCGGC CGCGGCACCA TGATCGCCGA CGCGCTGCGG GCGCTGCGGG TGCTGCCGCC GCTGTATCAG TCGCTCAGCG TGCATCTGGT CGAGATCAAT CCGGTGCTGC GCGCCAAGCA ACAGGCGACG CTGGCCGGCA TCCGCAACGT GCACTGGCAC GAGGATTTCG CCGAGGTGCC GGAAGGCCCG GCGGTGGTGC TCGCCAATGA ATATTTCGAC GTGCTGCCGA TCCACCAGGC GGTGAAGCGC GACGGCGGCT GGCACGAGCG CGTGATCGAG ATCAGCGCCA GCGGCGACCT GGTGTTCGGC GTCGCCGACG ATCCGATTCC GCGCTTCGAG GTGCTGCTGC CGCCGCTGGT GCAGATGGCC CCGGCCGGCA CGGTGTTCGA ATGGCGTCCG GACAACGAGA TCATGGCGAT CGCCGCGCGG CTGCGCGACC AAGGCGGGGC GGCGCTGATC ATCGACTACG GCCATGTCCG CAGCGACGTC GGCGACACCT TCCAGGCGAT CGCGCGCCAC TCTTTCGCCG ACCCACTGCA GCATCCCGGC GGCGCCGACC TCACCGCCCA TGTCGATTTC CAGGCGCTCG GCCGCGCCGC CGAGACGATC GGCGCCCGCA TCCACGGCCC GGTGACGCAG GGCGAATTCC TCAAGCGGCT GGGGATCGAG ACCCGCGCGC TGTCGCTGAT GGCCAAAGCG AGCGCGCAAG TCTCCGAGGA CATCGCCGGC GCGCTGAAGC GACTGACCGG CGAAGGCCGC GGCGGGATGG GCGCGATGTT CAAGGTGATC GGCGTTTCGG ATCCGAGCAT CACCTCGCTG GTGGCGCTGA GCGACGACGC CGAACGCGCC GCGGAGGGAC AAAAGGCATG A
|
Protein sequence | MNDDSPLLAE IKRLIETAGP MPVWRYMELC LAHPEYGYYV SRDPLGREGD FTTSPEISQM FGELIGLWTA SVWKAVGEPG VLRLIEIGPG RGTMIADALR ALRVLPPLYQ SLSVHLVEIN PVLRAKQQAT LAGIRNVHWH EDFAEVPEGP AVVLANEYFD VLPIHQAVKR DGGWHERVIE ISASGDLVFG VADDPIPRFE VLLPPLVQMA PAGTVFEWRP DNEIMAIAAR LRDQGGAALI IDYGHVRSDV GDTFQAIARH SFADPLQHPG GADLTAHVDF QALGRAAETI GARIHGPVTQ GEFLKRLGIE TRALSLMAKA SAQVSEDIAG ALKRLTGEGR GGMGAMFKVI GVSDPSITSL VALSDDAERA AEGQKA
|
| |