Gene RPB_4522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4522 
Symbol 
ID3912339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5109767 
End bp5111212 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content49% 
IMG OID637886426 
Producthypothetical protein 
Protein accessionYP_488116 
Protein GI86751620 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.909404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATTTT CGACTCATGG AGATCAAAAC GTTCATGTTC CTGGAGACAG GTTCGAGGCT 
TTAGGCTTGG CGGACCTAAG GCGAATTCGC TTCTCAGGCA GGGCTCAATC AGAACAGACC
TTCGCAGGCA ACTTCGTTGA GTGCCTTTTC GACGGCGTCG AGCTTGACAA GATCAACTTC
TCGAATTGCG ACTGGAAGGA TTGTCGAGTT TACAATACCA CTTTTATCGA CTGCGAACTG
GGCGACGCGT CATTCATTAC AAACCTGTTC GATCAGTGCA GCTTTATCCA ATGCAAGTTT
CCAAATACTG GGATCAGCGA TTGCAGCTTC CGCAATTGTG TTTTCGAGGA CTGTGATCTC
AGCAACATCG TAATGAAGTC AAATCGAATC GAGCGATCTC GATTCAGTCG ATGCTCAACT
TCAAACAGGG TCATTGAGAG TAGCTTACTC ATTGACACCT CTTGGAGTGG TATGAATTTG
GAAGTTGGTC TGATCCTTGG AAACTTCGGT CTCGAGCGGT CTAATGTTCA GGCCTGCGTA
CTGTTCGAAA AACAGCCTGA CCGCAGCTTG CGGGAAATCA CTTGGTCTGA CCTCGGTCGT
ATAGGAGGTC ACCGGCCACT GAGCCCCATC GAAACATTCC GTTTGGCATT CTTTGAGACC
GGGGACGCAG ACGGAGATCC CGACGCACTT GAACAGGCAA TCAATCCACG AAATTGGACC
AGCGAGGCGA TTATAGAGTC GAGCTTCGGA GCTCTACTGA ATTCGTTTGC TCAATTTTTG
CTGGCGCTAT TCGCCGCGAA CCGTTTACCC GTCTATGGTC TACTCCGCTT TCACACTCAT
AATTTTGCAC TACTGGAATG GCTGTCAGGG AAGCCCGAGT TTTCATCCCT TTATCAATCT
GCCGCGGGCG TCCATCTTTC TGTGACTCGG GAAGTTGACG CTTACGCTAG AGCAGTTCAG
GAAATAGTCG ACGCTCATAC AAACTTGCTC CATATTCGAC TTGAAGCAGA TGGCCCCCTG
GATCCTAGCT ATTACGAATC GCTCTTTCGT GAAGCCGATG GCGGGCAGAT TCGGGTAGAT
TGGGTTCGCC CGCGCAATTC TCCGGTAGAA GTCTCCTTAA ATTTCTTCGA CTATGCAACT
TTGCTGTCGA TAGTCGCTCT CGTACTCGCA ACCCGAACGA AATTTGAACT GTCAAAAATA
CAGTCAATGG GATGGATAGC GTCTCCACCG ACAGGCGATC GAAATGGAGA AGCCGCCAAC
GGCAAGCAAT TGATCGCATT TCGAACAGGC TTCTCTCTCG ATCGTCCCTC TGAGTATGAA
ATTAATGTGC GAACATTACT GCCGCGCTCC TTCCTTTTAG ATCTACACCT TTGCTTGAAC
ATCTCTCTGT TCAAGAGGGT CAGAGGGGTG TTGATTGGAT TGCTACTGCC ACCTAAAGAC
CCTTGA
 
Protein sequence
MLFSTHGDQN VHVPGDRFEA LGLADLRRIR FSGRAQSEQT FAGNFVECLF DGVELDKINF 
SNCDWKDCRV YNTTFIDCEL GDASFITNLF DQCSFIQCKF PNTGISDCSF RNCVFEDCDL
SNIVMKSNRI ERSRFSRCST SNRVIESSLL IDTSWSGMNL EVGLILGNFG LERSNVQACV
LFEKQPDRSL REITWSDLGR IGGHRPLSPI ETFRLAFFET GDADGDPDAL EQAINPRNWT
SEAIIESSFG ALLNSFAQFL LALFAANRLP VYGLLRFHTH NFALLEWLSG KPEFSSLYQS
AAGVHLSVTR EVDAYARAVQ EIVDAHTNLL HIRLEADGPL DPSYYESLFR EADGGQIRVD
WVRPRNSPVE VSLNFFDYAT LLSIVALVLA TRTKFELSKI QSMGWIASPP TGDRNGEAAN
GKQLIAFRTG FSLDRPSEYE INVRTLLPRS FLLDLHLCLN ISLFKRVRGV LIGLLLPPKD
P