Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4522 |
Symbol | |
ID | 3912339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5109767 |
End bp | 5111212 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637886426 |
Product | hypothetical protein |
Protein accession | YP_488116 |
Protein GI | 86751620 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.909404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTATTTT CGACTCATGG AGATCAAAAC GTTCATGTTC CTGGAGACAG GTTCGAGGCT TTAGGCTTGG CGGACCTAAG GCGAATTCGC TTCTCAGGCA GGGCTCAATC AGAACAGACC TTCGCAGGCA ACTTCGTTGA GTGCCTTTTC GACGGCGTCG AGCTTGACAA GATCAACTTC TCGAATTGCG ACTGGAAGGA TTGTCGAGTT TACAATACCA CTTTTATCGA CTGCGAACTG GGCGACGCGT CATTCATTAC AAACCTGTTC GATCAGTGCA GCTTTATCCA ATGCAAGTTT CCAAATACTG GGATCAGCGA TTGCAGCTTC CGCAATTGTG TTTTCGAGGA CTGTGATCTC AGCAACATCG TAATGAAGTC AAATCGAATC GAGCGATCTC GATTCAGTCG ATGCTCAACT TCAAACAGGG TCATTGAGAG TAGCTTACTC ATTGACACCT CTTGGAGTGG TATGAATTTG GAAGTTGGTC TGATCCTTGG AAACTTCGGT CTCGAGCGGT CTAATGTTCA GGCCTGCGTA CTGTTCGAAA AACAGCCTGA CCGCAGCTTG CGGGAAATCA CTTGGTCTGA CCTCGGTCGT ATAGGAGGTC ACCGGCCACT GAGCCCCATC GAAACATTCC GTTTGGCATT CTTTGAGACC GGGGACGCAG ACGGAGATCC CGACGCACTT GAACAGGCAA TCAATCCACG AAATTGGACC AGCGAGGCGA TTATAGAGTC GAGCTTCGGA GCTCTACTGA ATTCGTTTGC TCAATTTTTG CTGGCGCTAT TCGCCGCGAA CCGTTTACCC GTCTATGGTC TACTCCGCTT TCACACTCAT AATTTTGCAC TACTGGAATG GCTGTCAGGG AAGCCCGAGT TTTCATCCCT TTATCAATCT GCCGCGGGCG TCCATCTTTC TGTGACTCGG GAAGTTGACG CTTACGCTAG AGCAGTTCAG GAAATAGTCG ACGCTCATAC AAACTTGCTC CATATTCGAC TTGAAGCAGA TGGCCCCCTG GATCCTAGCT ATTACGAATC GCTCTTTCGT GAAGCCGATG GCGGGCAGAT TCGGGTAGAT TGGGTTCGCC CGCGCAATTC TCCGGTAGAA GTCTCCTTAA ATTTCTTCGA CTATGCAACT TTGCTGTCGA TAGTCGCTCT CGTACTCGCA ACCCGAACGA AATTTGAACT GTCAAAAATA CAGTCAATGG GATGGATAGC GTCTCCACCG ACAGGCGATC GAAATGGAGA AGCCGCCAAC GGCAAGCAAT TGATCGCATT TCGAACAGGC TTCTCTCTCG ATCGTCCCTC TGAGTATGAA ATTAATGTGC GAACATTACT GCCGCGCTCC TTCCTTTTAG ATCTACACCT TTGCTTGAAC ATCTCTCTGT TCAAGAGGGT CAGAGGGGTG TTGATTGGAT TGCTACTGCC ACCTAAAGAC CCTTGA
|
Protein sequence | MLFSTHGDQN VHVPGDRFEA LGLADLRRIR FSGRAQSEQT FAGNFVECLF DGVELDKINF SNCDWKDCRV YNTTFIDCEL GDASFITNLF DQCSFIQCKF PNTGISDCSF RNCVFEDCDL SNIVMKSNRI ERSRFSRCST SNRVIESSLL IDTSWSGMNL EVGLILGNFG LERSNVQACV LFEKQPDRSL REITWSDLGR IGGHRPLSPI ETFRLAFFET GDADGDPDAL EQAINPRNWT SEAIIESSFG ALLNSFAQFL LALFAANRLP VYGLLRFHTH NFALLEWLSG KPEFSSLYQS AAGVHLSVTR EVDAYARAVQ EIVDAHTNLL HIRLEADGPL DPSYYESLFR EADGGQIRVD WVRPRNSPVE VSLNFFDYAT LLSIVALVLA TRTKFELSKI QSMGWIASPP TGDRNGEAAN GKQLIAFRTG FSLDRPSEYE INVRTLLPRS FLLDLHLCLN ISLFKRVRGV LIGLLLPPKD P
|
| |