Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2386 |
Symbol | |
ID | 3909386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2738107 |
End bp | 2739144 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637884285 |
Product | hypothetical protein |
Protein accession | YP_486002 |
Protein GI | 86749506 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000815162 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTGGG CCTCATGGTC GGCGGCAGCC GACCGCATTC TGCATTCGCC GAATTTGCCG ATCTGGGCCG CGATGGCCGC TGCGGCTGTT TTCGTCCTGA TTCTCGTCAT CGCGCTGGTG CGGGCCGACA GATCGGTGGC GAATGCCGTG CTCGGCGTCA TCACGCTGCT CGCCGTGGGC GTTGCCGGCC TCGCATCTTG GCGGCTGTTC GTCGACAGCG CGGGTCCGGT GGTGCGCAAC GCGGACGCGG GGCGGGCCAT CAACACCATG CCGGCGCTGG CCTGCGTCGA CGACCTCGCC GGCGATGCCG TGCAGGCCGC ATGCGAACGC GCGCTGTTCT CGTCGCCCGA GACGGTGGCG GCGGCGGTGT CCTATGCGTC GGCGCGAATG GCGCAACTGA CGGCGCAGGG CGATGTCGCT TCGGCCAGCA AGCGGATGAC GCCCGAACTC GACCGGCTGC GGCGTTCGAT CGAGCGTGAC CGCTACGGCC TGATCGCCTA TGTGCTGACG GCGCAGGATC GTTGCGTGCC TGATCAGTGC GCCGCCTTCC AGTCGCTCAC CGATCACAGC CGGATCGCGG CCAACATGAA CGAACACGTC TACGAGACCA CCGTCACCCG GGCAGCGCCC GCGTGGGGCG CGGGAAGTGG GCCGCTCGCA GCGGCCGCTC CGGTGCCTGG CGGCGTGCTC GGCGTCGCCG AGGCGCCGAG CGGCAAGCCG GTCAATATCG ATTTCGCGAC CTCGTCCTCG ATTCCGCCGA TCTCGATCAT GAACGAGCCG GCGCCGGCCG CGCCGAAGCC TGCCGCGACG GCCGCCGGCA ATGCCGCCGG TCACGCGGCG CCGTCGGCCG CCGCCAAGCG GCCGCCCGCA GCCCCGAGCG CCCAGGCTCC GGCCAACGGC CAGGCAGCCG CCGCCGGCGG GCAGGCTCCG GCGAATGCCC AGGCTGCGGT CAGGCGACCG CCGCCGCCGC CGAAACCGAA GCCGCCGGTG GCGGCGCCGG TCCAGCTTGC GCCCGCGGAG GCAAATCCGG ATAACTGA
|
Protein sequence | MEWASWSAAA DRILHSPNLP IWAAMAAAAV FVLILVIALV RADRSVANAV LGVITLLAVG VAGLASWRLF VDSAGPVVRN ADAGRAINTM PALACVDDLA GDAVQAACER ALFSSPETVA AAVSYASARM AQLTAQGDVA SASKRMTPEL DRLRRSIERD RYGLIAYVLT AQDRCVPDQC AAFQSLTDHS RIAANMNEHV YETTVTRAAP AWGAGSGPLA AAAPVPGGVL GVAEAPSGKP VNIDFATSSS IPPISIMNEP APAAPKPAAT AAGNAAGHAA PSAAAKRPPA APSAQAPANG QAAAAGGQAP ANAQAAVRRP PPPPKPKPPV AAPVQLAPAE ANPDN
|
| |