Gene RPB_3532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3532 
Symbol 
ID3911334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4042224 
End bp4044602 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content50% 
IMG OID637885434 
Producthypothetical protein 
Protein accessionYP_487138 
Protein GI86750642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.263961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAC CCACCAACTT TGACCCATCG AGTTCAGTTC TATTCCTCGG TTCGGGATTT 
AGCGCAAGTG GGACGGCAAT CTATGGCGGA CATCCACCGG CAGGAGACCA ATTGTGCGAC
CTGTTGGCGG ACGAACTGAA AGTGGCTCGC GGGAAATACG ATTTGCAGGC GCTCGCAGAC
GCCTTTCGCA GGCGACCTGA GCTGAATATG TATCAATCAC TGCGTCGGCT ATTTACGATA
AGCAATCTGA GCGTCGATCA GCGCGAAATT TTGAAGCTTC GCTGGCAACG AATTTATACG
ACGAACTACG ATGATAGTGT AGAACTGGCG TTTCATGAGA ACGGCATCAA GTCGCCTTCT
TACAACTACG ACGACCCGAA GCCCGCACGC GTTCCACGCG GTGCTGTGAT TCACCTTCAC
GGTGTGATCA AGAAAGCCAC CGAAGAGAAT ATTCACCAAC AGCTCGTGCT CGGGCGGCAG
TCCTATATAA GGCAATTCTT TGCGAAATCT CCTTGGTATG ATGAATTCTT GCGAGACATA
AGATTTTGCG AAGCCATTTT TTTTGTTGGC TACAGCCTTG CTGATCCACA CGTCACAGCT
CTTTTCGTTA ATCCCGAGCA GTCCAAACTC CGGACTTATT TCGTGCTTCG ACCGCCTCTG
GACTCCCTAT TGGTGGAGCA CATCGAGGAG TATGGAGAAG CCCATCCGAT AGAAACGAAA
GGGTTTGCTC AGATATGTCG GTCGCTTAGC GCGCCACCGC CGCTTGCAGA CCTAAATAAT
CTCCGAATTC TTCGTTGGAT CGATCCATTC AAAGATCAAA AGACAGTAAT TCAACCGACC
TCGCTGGAGG TAATCAACTT AGTGGCCTTC GGTGCTTTTG ACTCACAGCG TGCTTTATCA
ACTCTGCCCA ATGGGAAGTA TGTAATACCT CGCCAGAAGA TGGCGCGGCG GGCTGTTCAG
CAAGTGGTAC AAAATCGCAC GACGCTTCTT CATGGCCGCC TGGGAAATGG GAAATCTATA
TTCCTTTGGA TACTCGCCTT TCATCTGATG GCTCTAGATT ATCAATGTTT CCGATGCAGT
GCCATGTCGC CAGCCATTGA CCGCGAGGCT AAAGCGCTAG TGGATCATCC AAAAGTGGCG
ATTCTATTTG ACAGTTATGA TGTTGCGATT GATTCCGTCG ATCGCCTGTA TGAGCTTTTG
CCGCACGCGC GATTTATAGT TTGCGTGCGG AGCGGCGTTC AGGACGTTAG GCTGCACGAA
ATTGCAACGC GGTTTCCGTC GTCGATTGCG CGCGTGAATC TCAATGAGTT TGATGCCGAA
GATCGTTCGG ACTTCATTGA TCTCTTGGTG CCGGCCGGAG CTCTTAAGGA CGATCTTGAA
AATAGGATAC GCAGTTGTGC CGACATCCGT GAAGTCGTTG CCACGATCTA CGAGAACGAA
TTCATTCAGC GTCGGATTCG AGAATCGCTA GCTCCGCTGC GCTCTGATCG ATCTGCTGCA
GCGGTCACCA TTCTGGGATT GCTGTTATCT TGGATCAATC AGACTGGTGA TCCATCGTTA
TATCTTGAAG CGCTCGATGC GGACCCGCAC ACCACGCTTG CAAAATACCG AGAGGTCGCC
ATCGATATTT TTCGGCTTGA CGATGATCAG ATTCAAGCAA GATCGCCGGT GTTTTCGGAT
TACATTCTCC GTCGTCTTTT CTCGGTTGAT GAGATATTCC CCGTTGTCGA GAAGGTATTG
ATCGCGGCCG TTCAGCGCAA AAAGGAAAGA AAATACCGTG CGATACTAAG TAATATAATG
CGTTACTCAG CGCTCTTGAG CTTGTCCAAA GAGGCTCCCG ATGGCGCGAA CAAGATCATT
GGTTTGTATG GCCGACTGCA GCGAGATGTC GGCATTCAAG AGGAGCCGCT GTTTTGGTTG
CAGTACGCAA TCGCTATGAC TGAAGCGGAT TCAGCCGAAA TTGCGGAAGG TTTTCTTAGG
ACGGCGTACC GCAAGGCCGC CGAAGCTGGA GATTTTGCGA CCTATCAGTT GGACACTTTT
GCCCTTCGCC TGTATTTGAA GCTGGAGGAA AAGGCTGAAG TGGGTAGGTC TGTAAGCCGT
ATTAAGATGA TTCTGTACTC GACTAAATTA GTTTCGGGAA TGATTGGTGA TCAAAACCAT
CGGGCTTATG CAGTAAGAGT CCTAGAGGGC TGGTTGCCGT TTGTCGCTTC AAGGGTTGCG
GATCTCACCG GTTCGCAGAA AACGAAGTGC TTGGCTGCGG TGGACGATCT GTTGCATAAG
ATTTCGGGGC TGAGCGCTGC AGTTAGAGCG GAGACAGGAT CTGACCAAGT TAAGTCCGAC
CTTGAAGCTG CTAAGCGAAC GTTGCTGCTC GGAGCATAG
 
Protein sequence
MPIPTNFDPS SSVLFLGSGF SASGTAIYGG HPPAGDQLCD LLADELKVAR GKYDLQALAD 
AFRRRPELNM YQSLRRLFTI SNLSVDQREI LKLRWQRIYT TNYDDSVELA FHENGIKSPS
YNYDDPKPAR VPRGAVIHLH GVIKKATEEN IHQQLVLGRQ SYIRQFFAKS PWYDEFLRDI
RFCEAIFFVG YSLADPHVTA LFVNPEQSKL RTYFVLRPPL DSLLVEHIEE YGEAHPIETK
GFAQICRSLS APPPLADLNN LRILRWIDPF KDQKTVIQPT SLEVINLVAF GAFDSQRALS
TLPNGKYVIP RQKMARRAVQ QVVQNRTTLL HGRLGNGKSI FLWILAFHLM ALDYQCFRCS
AMSPAIDREA KALVDHPKVA ILFDSYDVAI DSVDRLYELL PHARFIVCVR SGVQDVRLHE
IATRFPSSIA RVNLNEFDAE DRSDFIDLLV PAGALKDDLE NRIRSCADIR EVVATIYENE
FIQRRIRESL APLRSDRSAA AVTILGLLLS WINQTGDPSL YLEALDADPH TTLAKYREVA
IDIFRLDDDQ IQARSPVFSD YILRRLFSVD EIFPVVEKVL IAAVQRKKER KYRAILSNIM
RYSALLSLSK EAPDGANKII GLYGRLQRDV GIQEEPLFWL QYAIAMTEAD SAEIAEGFLR
TAYRKAAEAG DFATYQLDTF ALRLYLKLEE KAEVGRSVSR IKMILYSTKL VSGMIGDQNH
RAYAVRVLEG WLPFVASRVA DLTGSQKTKC LAAVDDLLHK ISGLSAAVRA ETGSDQVKSD
LEAAKRTLLL GA