Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2793 |
Symbol | |
ID | 3910586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3182228 |
End bp | 3184033 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637884693 |
Product | hemagluttinin-like protein |
Protein accession | YP_486406 |
Protein GI | 86749910 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATGC TGGCCGTATC CAAGCAACAA ACATGTCGCC CGAAAATCGG CTGGCGCGCC GCATGTTCGG CCGGACTGCT GACGGCAACG GCCCTGACGC TCTGGACCGG AGGCGCCTCG GCTGCGGATT ATGCAGCGGG CGGCGGCACG ATCAACGCGC CGTCCGGGTT TGCAACGGCG GTTGGGGACA ACGCGCAGAC TACAGGCGAA GCTGCAACGG CGACAGGCGC GAACAGCGCC GCCACCGGCA ACTATGCCAC TGCGATGGGC ACGAGCAGCA TCGCCACAGG TGGCTATGCC ACTGCGAGCG GTTCCTACAG TTCAGCTCAG GGCTCGCAGG CAACCGCGAC GGGAGCGAAC AGTAGCGCTA CTGGTATAAA CGCGACCGCG AACGGTGCTT TTGCCATCGC CAACGGCGAC AGCGCCACCG CGACCGGCGC GAGCGCCAAT GCCGACGGCG CGACCGCGAC GGCGACCGGC GCGGTCAGCA ATGCCCTCGG AGCCTCGGCG ACCGCGACTG GCTGGCGGAG TGCCGCTACC GGCGATTCAG CAACGGCGAC CGGCGCGGCC AGCAATGCCG CCGGCACATT CGCGACCGCC GCTGGCGTGA GCAGCGCCGC CACCGGCAAC TACGCCACCG CTACGGGCGC ATATAGCGTC GCACAAGGCT CGAACGCGAC CGCGACCGGC CAGGCGAGCA ACGCCATCGG CCAATTCGCC ACCGCCACCG GCGAAAGCAG CAGAGCGACC GGTTCGAACG CGACTGCGAC CGGCCAGAAC AGTCTTGCAA CTGGAAATAG AGCCACTGCG ACGGGCGGCG ATTCGAACGC GGACGGCGCC TTTGCGACCG CGACAGGCAA TGAAGCTCAA GCCCTTGGCA TCCGGGCGAC AGCGACAGGT GCGGGGAGCA GGGCAACTGG CGACGATGCC TCTGCGATGG GAATGAGCAG CCTCGCCACC GGCGCCGGCG CAACGGCGGT GGGTGCGAAC ACCACTGCAA CGGGCGGCTC CGCCAGCGCG TTCGGGTTCG GCAGCATCGC CGACGGGGAA GCCACCACGG CGCTCGGCGA GACCAGCCTC GCTTCGGCGA CCGGGGCCAC AGCCGTCGGA CGGCGAAGTG CGGCGCAGGC TGTCGCGGCA ACCGCGCTCG GCAACGCGGC AGTGGCCACC GGGGTCAACG CCACCGCGCT GGGCGAGACC AGCGTCGCCT CGGCGACCGG GGCGACAGCC GTCGGGCAGG GAAGTGCGGC GCAGGCGGTC GGGGCGACTG CACTCGGCAA CGCGGCGATG GCGAACGGCC TCAATGCGAT CGCGCTCGGC GCCAATTCGC AGGCGCTCGG CGTCAACTCC GTGGCGATCG GCAGCGGCTC GGTGGCGACA CTCGCCGACA CCGTGTCGTT CGGCACAGCA GGCAACGAGC GGCGCCTCAC CAATGTCGCG GCCGGTATTA ATCCCACCGA CGCCGTCAAC GTCAGTCAGC TCAGCGGCAT CACCTCGGCG ATCCAATCCC AGATCGGGTC GCTGCAATCT CAGGTCGGCT CGCTGCAGTC ACAGATCGGC GAAAACCGGA CGGAGGCACG GCGGGGCATT GCCGCGGCGG TCGCGGCCGC CAATGCGCCG ATGCCATCCG GTCCGGGCAA GACAACCTGG CAGATGCGCG GCTCGACCTT CCATGGCGAA GGCGGCTTCG GCTTCGGCTT CGCTCACCGC TTCAACACCT CGATGCCGCT CGCCGTCGTC GCCGGCTACG GTAATGGCGG CGGCACCGAG CACACGGCCT ATGTCGGCAT CGGCGGGGAA TTCTGA
|
Protein sequence | MEMLAVSKQQ TCRPKIGWRA ACSAGLLTAT ALTLWTGGAS AADYAAGGGT INAPSGFATA VGDNAQTTGE AATATGANSA ATGNYATAMG TSSIATGGYA TASGSYSSAQ GSQATATGAN SSATGINATA NGAFAIANGD SATATGASAN ADGATATATG AVSNALGASA TATGWRSAAT GDSATATGAA SNAAGTFATA AGVSSAATGN YATATGAYSV AQGSNATATG QASNAIGQFA TATGESSRAT GSNATATGQN SLATGNRATA TGGDSNADGA FATATGNEAQ ALGIRATATG AGSRATGDDA SAMGMSSLAT GAGATAVGAN TTATGGSASA FGFGSIADGE ATTALGETSL ASATGATAVG RRSAAQAVAA TALGNAAVAT GVNATALGET SVASATGATA VGQGSAAQAV GATALGNAAM ANGLNAIALG ANSQALGVNS VAIGSGSVAT LADTVSFGTA GNERRLTNVA AGINPTDAVN VSQLSGITSA IQSQIGSLQS QVGSLQSQIG ENRTEARRGI AAAVAAANAP MPSGPGKTTW QMRGSTFHGE GGFGFGFAHR FNTSMPLAVV AGYGNGGGTE HTAYVGIGGE F
|
| |