Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3661 |
Symbol | |
ID | 3911463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4202981 |
End bp | 4204237 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885563 |
Product | hemagluttinin-like protein |
Protein accession | YP_487267 |
Protein GI | 86750771 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0725058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCCC GGATGAACGA CAGAGTTGGC GCGGCGTGGA TTGGCGCATC TCTCATTCTC GCGGCGTTTC TGCTGCAGCC GGGCCGCGCG ATGGCGCAAA CCACCGACCC GGTTCAGGTC GTGTCGGGTT GCGTCGCCAA TGTCGCAGAC CGGCAATTGG CCTGCGGCCC GGGCGCCAGC ACTGCCGGAA GCAATGATCG GTCGACGGCC ATCGGCAGCA ACGCTCAATC CGACGGCAGC TCTGTCGCCA TCGGTAGCAG TTCCATAGCA ACCGGCAATA ACTCGACCGC TATCGGCGAC AACGCCAACG CGGTTGGATT TGGGGATTCG ACAGTGATCG GCTCCGGCGC CGGATCTGGA GGAGCCCGCA GCACGGTTAT CGGCAGCGGG GCGGCCACCG GCAACGAAGG AGCCATCGCC GTGGGACATC GCGCCGGCGT CGGACTGGGC TCGGGTCAGT ACAGCATCGC GATGGGCGCG GGCGGCGATA CCGCGCAGTC CGCGTCGCAT GCAATCGGCA ATTTCAGCAT CGCGATCGGC GGCGGCGACG GCCTATCGGC CAACGGTGCC ATTTCCAATG CGGCGTTCGG CACCGCGGTC GGCGCATCCA GCATCGCGGC GAACCAGTTC GACGCGGCGT TTGGCGCCTT CTCGATCGCC AGCGGGGCGC GTAGCGCCGC ATTCGGCGCC AACAGCGTCG CCGCCGGCGC GTCGTCGGTG GCGCTGGGAG ACGGCTCGTT TGCCCAAGGG ACCCACGCTG TGTCCACCGG CTTCAATTCA GCGGCCACCG GTGTCAACAG CGTGGCGCTC GGTGCCGAGG CTTCGGCGAC GGCTTCCAAC TCGGTGGCGA TCGGCTCGCG TTCGGTCACG AGCGCGCCCA ACACGGCATC GTTCGGGACG CCCGGCAATG AGCGCCGGCT CACCAACGTG GCCGCCGGCA TCAGCCAGAC CGACGCGGTC AATGTCGGGC AGCTCGCGGC AGTCACCAGC GGCCTTCAGT CGCAGATCAC CAACAACCGC TCCGAAGCGC GGCGCGGCAT CGCCGCGGCG GTCGCCACCG CCAGCGCGCC GATGCCCTCG GCGCCAGGCA AGACGACGTG GCAGATCCGC GGCTCCACCT TCCAGAACGA ATATGGCATC GGCGTCGGTT TTGCCCACCA ACTGCGGACG GCGATGCCGC TCAACATCGT CGGCGGCTAC GGCAATGGCG GCGGCGCCGA GCACACCGCC TATGTCGGCG TCGGCGGCGA GTTCTGA
|
Protein sequence | MTARMNDRVG AAWIGASLIL AAFLLQPGRA MAQTTDPVQV VSGCVANVAD RQLACGPGAS TAGSNDRSTA IGSNAQSDGS SVAIGSSSIA TGNNSTAIGD NANAVGFGDS TVIGSGAGSG GARSTVIGSG AATGNEGAIA VGHRAGVGLG SGQYSIAMGA GGDTAQSASH AIGNFSIAIG GGDGLSANGA ISNAAFGTAV GASSIAANQF DAAFGAFSIA SGARSAAFGA NSVAAGASSV ALGDGSFAQG THAVSTGFNS AATGVNSVAL GAEASATASN SVAIGSRSVT SAPNTASFGT PGNERRLTNV AAGISQTDAV NVGQLAAVTS GLQSQITNNR SEARRGIAAA VATASAPMPS APGKTTWQIR GSTFQNEYGI GVGFAHQLRT AMPLNIVGGY GNGGGAEHTA YVGVGGEF
|
| |