Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2195 |
Symbol | |
ID | 3907935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2488020 |
End bp | 2489195 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637884088 |
Product | hypothetical protein |
Protein accession | YP_485811 |
Protein GI | 86749315 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.175958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0733507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAATG ATAATGATGA CTGCGAAGAG CAGCTCACTT TTTTTACGAC AGAGAAACCG CACCGTACGC AACGAGAGAT CGAAGCCGCA GAGAGCCAAA TTGTCGCCCA GTTGCGAGAC GTACGATACG TAGTTCGCGA ATATCCGATC GAAGTCGTAG TTCAAATGTA CCTAAGCGGT AGAAGCGAAG ATCGCAACGA AATATATGTG CCAGACTATC AACGAGACCT AATCTGGTCG GAGAGACACA AATCTCGCTT CGTCGAGTCC CTTTTGATTG GACTCCCGAT CCCTTTTCTT TTCGTTGCAG ACGTCGGGGA CGAAGAAGAT CCAGACAAAG CTGGCAGACT AGAAATCGTT GACGGCGTCC AACGCATTCG AACCCTTGCG GAATTCCTAA CCGGCAGGCT AACACTAAGT TGTCTTGATC GCCTTGATCG TCTGAACGGT TTTCGTTTCA ATGATCTTCC AATTTCCAGA CAGCGACGCT TCCGCAGAGC TACACTCAGA TTGATTGAAC TGACAGAGGC CGTTACCGAA GACGTGCGTC GTGAGATGTT CGACCGGATC AATAGCGGAT CAGTCAACCT AAAAGCGGTC GAAGTCAGAA GGGGGATGCA ACGCGGCCCA TTTCTTGATC TTGTTACCGA ACTCGCGGCA GCCCCCCTCC TACACCAACT AGCGCCAATT TCGGACGGAC TTCGAAAGCG ATTTGAGTAT GAAGAATTAG TCACTCGCTT TTTCGCATTT CTTTACCGTT ACGAAGACTA CGGGAAAGGA GGAAAGGTCG TCTCCGAATT CTTGCTCAAC TATGTACGCG ACACGAACAA GAAATTAAGC TCTCCAGAAG GCGACAGAAT TGCTGAAGAA ATGAAGCGCC AATGGCACGA AATGCTAGAG GTAGTACGAG GCTATTTCCC CGACGGCTTC AAGAAGCGCG GCCCCGGCCG CAAGGTTCCT CGGGTACGTT TCGAGGCAAT CGCAGTGGGC ATCGGATTAG CGATAAGAGC ACTTAAGGAC GACGGAGACA GCTTTAAGAT TCTCCACATA GACAACATTG ATGAATGGCT TGAAAGTGAC GAATTCAAAG AATGGACCAC CAGTGACGCC TCGAACAACA AATCGAATCT TGTAGGACGC CTCGAATTTG TCCGTAACAA AATGCTAGGC CGATGA
|
Protein sequence | MANDNDDCEE QLTFFTTEKP HRTQREIEAA ESQIVAQLRD VRYVVREYPI EVVVQMYLSG RSEDRNEIYV PDYQRDLIWS ERHKSRFVES LLIGLPIPFL FVADVGDEED PDKAGRLEIV DGVQRIRTLA EFLTGRLTLS CLDRLDRLNG FRFNDLPISR QRRFRRATLR LIELTEAVTE DVRREMFDRI NSGSVNLKAV EVRRGMQRGP FLDLVTELAA APLLHQLAPI SDGLRKRFEY EELVTRFFAF LYRYEDYGKG GKVVSEFLLN YVRDTNKKLS SPEGDRIAEE MKRQWHEMLE VVRGYFPDGF KKRGPGRKVP RVRFEAIAVG IGLAIRALKD DGDSFKILHI DNIDEWLESD EFKEWTTSDA SNNKSNLVGR LEFVRNKMLG R
|
| |