Gene RPB_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2793 
Symbol 
ID3910586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3182228 
End bp3184033 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content69% 
IMG OID637884693 
Producthemagluttinin-like protein 
Protein accessionYP_486406 
Protein GI86749910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATGC TGGCCGTATC CAAGCAACAA ACATGTCGCC CGAAAATCGG CTGGCGCGCC 
GCATGTTCGG CCGGACTGCT GACGGCAACG GCCCTGACGC TCTGGACCGG AGGCGCCTCG
GCTGCGGATT ATGCAGCGGG CGGCGGCACG ATCAACGCGC CGTCCGGGTT TGCAACGGCG
GTTGGGGACA ACGCGCAGAC TACAGGCGAA GCTGCAACGG CGACAGGCGC GAACAGCGCC
GCCACCGGCA ACTATGCCAC TGCGATGGGC ACGAGCAGCA TCGCCACAGG TGGCTATGCC
ACTGCGAGCG GTTCCTACAG TTCAGCTCAG GGCTCGCAGG CAACCGCGAC GGGAGCGAAC
AGTAGCGCTA CTGGTATAAA CGCGACCGCG AACGGTGCTT TTGCCATCGC CAACGGCGAC
AGCGCCACCG CGACCGGCGC GAGCGCCAAT GCCGACGGCG CGACCGCGAC GGCGACCGGC
GCGGTCAGCA ATGCCCTCGG AGCCTCGGCG ACCGCGACTG GCTGGCGGAG TGCCGCTACC
GGCGATTCAG CAACGGCGAC CGGCGCGGCC AGCAATGCCG CCGGCACATT CGCGACCGCC
GCTGGCGTGA GCAGCGCCGC CACCGGCAAC TACGCCACCG CTACGGGCGC ATATAGCGTC
GCACAAGGCT CGAACGCGAC CGCGACCGGC CAGGCGAGCA ACGCCATCGG CCAATTCGCC
ACCGCCACCG GCGAAAGCAG CAGAGCGACC GGTTCGAACG CGACTGCGAC CGGCCAGAAC
AGTCTTGCAA CTGGAAATAG AGCCACTGCG ACGGGCGGCG ATTCGAACGC GGACGGCGCC
TTTGCGACCG CGACAGGCAA TGAAGCTCAA GCCCTTGGCA TCCGGGCGAC AGCGACAGGT
GCGGGGAGCA GGGCAACTGG CGACGATGCC TCTGCGATGG GAATGAGCAG CCTCGCCACC
GGCGCCGGCG CAACGGCGGT GGGTGCGAAC ACCACTGCAA CGGGCGGCTC CGCCAGCGCG
TTCGGGTTCG GCAGCATCGC CGACGGGGAA GCCACCACGG CGCTCGGCGA GACCAGCCTC
GCTTCGGCGA CCGGGGCCAC AGCCGTCGGA CGGCGAAGTG CGGCGCAGGC TGTCGCGGCA
ACCGCGCTCG GCAACGCGGC AGTGGCCACC GGGGTCAACG CCACCGCGCT GGGCGAGACC
AGCGTCGCCT CGGCGACCGG GGCGACAGCC GTCGGGCAGG GAAGTGCGGC GCAGGCGGTC
GGGGCGACTG CACTCGGCAA CGCGGCGATG GCGAACGGCC TCAATGCGAT CGCGCTCGGC
GCCAATTCGC AGGCGCTCGG CGTCAACTCC GTGGCGATCG GCAGCGGCTC GGTGGCGACA
CTCGCCGACA CCGTGTCGTT CGGCACAGCA GGCAACGAGC GGCGCCTCAC CAATGTCGCG
GCCGGTATTA ATCCCACCGA CGCCGTCAAC GTCAGTCAGC TCAGCGGCAT CACCTCGGCG
ATCCAATCCC AGATCGGGTC GCTGCAATCT CAGGTCGGCT CGCTGCAGTC ACAGATCGGC
GAAAACCGGA CGGAGGCACG GCGGGGCATT GCCGCGGCGG TCGCGGCCGC CAATGCGCCG
ATGCCATCCG GTCCGGGCAA GACAACCTGG CAGATGCGCG GCTCGACCTT CCATGGCGAA
GGCGGCTTCG GCTTCGGCTT CGCTCACCGC TTCAACACCT CGATGCCGCT CGCCGTCGTC
GCCGGCTACG GTAATGGCGG CGGCACCGAG CACACGGCCT ATGTCGGCAT CGGCGGGGAA
TTCTGA
 
Protein sequence
MEMLAVSKQQ TCRPKIGWRA ACSAGLLTAT ALTLWTGGAS AADYAAGGGT INAPSGFATA 
VGDNAQTTGE AATATGANSA ATGNYATAMG TSSIATGGYA TASGSYSSAQ GSQATATGAN
SSATGINATA NGAFAIANGD SATATGASAN ADGATATATG AVSNALGASA TATGWRSAAT
GDSATATGAA SNAAGTFATA AGVSSAATGN YATATGAYSV AQGSNATATG QASNAIGQFA
TATGESSRAT GSNATATGQN SLATGNRATA TGGDSNADGA FATATGNEAQ ALGIRATATG
AGSRATGDDA SAMGMSSLAT GAGATAVGAN TTATGGSASA FGFGSIADGE ATTALGETSL
ASATGATAVG RRSAAQAVAA TALGNAAVAT GVNATALGET SVASATGATA VGQGSAAQAV
GATALGNAAM ANGLNAIALG ANSQALGVNS VAIGSGSVAT LADTVSFGTA GNERRLTNVA
AGINPTDAVN VSQLSGITSA IQSQIGSLQS QVGSLQSQIG ENRTEARRGI AAAVAAANAP
MPSGPGKTTW QMRGSTFHGE GGFGFGFAHR FNTSMPLAVV AGYGNGGGTE HTAYVGIGGE
F