Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3481 |
Symbol | |
ID | 3911283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3985787 |
End bp | 3987040 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885383 |
Product | Phage major capsid protein, HK97 |
Protein accession | YP_487087 |
Protein GI | 86750591 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.137838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTACG ACATCACCCA CGCCCCCGAG ACCAAGGCCG GCATTGCCGG CGACGATGCG CAGCAGGTCT ACGACACGCT GATGCGCACC TTCGAGGACT ACAAGGCGGA GAACGATACC CGGCTGCAGG CGATCGAGAA GCGGCGCGGC GACGTGCTGG CGGAGGAGAA GGTGGCGCGG ATCGACGCGG CGCTGGATGC GCAGCAGCGC AAGCTCGACG AACTGGCGCT GAAGGCGGCG CGGCCGCAGC TCGGCAGCGC GATCACGCCG GTTCCGGCGG CGCGCGAGCA CAAGAGTGCG TTCGACGCCT ATATCCGCTT CGGCGACACC GCGGGGCTGC GCGCGCTCGA AACCAAGGCG ATGTCGATCG GCTCCAATCC CGACGGCGGC TACCTGGTTC CGGACGAGCT GGAGCATACG ATCGGCGAGC GGCTAGCGGT GGTCTCGCCG ATCCGCGCGA TCGCCGCCGT GCGGCAGATC TCCGGCAACG TCTACAAGAA GCCGTTCATG ATCACCGGCC CCACCACCGG CTGGGTCGGC GAGACCGCGG CGCGGCCGCA GACCGGCTCG CCGCAGCTCG ACGCGCTGTC GTTCCCGGCG ATGGAGCTGT ATGCGATGCC GGCGGCGACC GCGAATCTGC TGGAAGACGC CGTCGTCAAT CTCGACCAGT GGATCGCCGG CGAGGTCGAA TTGGTGTTCT CGGTGCAGGA GGGGACGGCC TTCATCACCG GCGACGGCCT CGGCAAGCCG AAGGGCTTTC TCGCCTACCC GACGGTGGCG AATGCCTCCT GGAGCTGGGG CAATCTCGGC ACCATCGCCT CCGGCGCCGC CGGCGCGTTC GCCGCATCGA GCCCGTCCGA CGTGCTGATC GACCTGATCT ACGGGCTGAA GCCGGGCTAC CGCCAGAACG CCTCGTTCGT GATGAACCGG CGCACCCAGG CGGCGGTCCG CAAGTTCAAG GACTCCACCG GCGTCTATCT GTGGCAGCCG CCGGCGACCG TCTCCGGCCG CGCCAGCCTG ATCGGCTTCC CGCTGGTCGA TGCCGAGGAC ATGCCGGACA TCGCCGCGAA CTCGCTCAGC ATCGCGTTCG GCGACTTCCA GCGCGGCTAT CTGATCGTCG ACCGCCAGGG TATCCGCGTG CTGCGCGACC CGTATTCCGC CAAGCCCTAC GTGCTGTTCT ACACCACCAA GCGCGTCGGC GGCGGCGTGC AGGACTTCGA CGCGATCAAG CTGTTGAAGT TCGCGGCGAG TTGA
|
Protein sequence | MDYDITHAPE TKAGIAGDDA QQVYDTLMRT FEDYKAENDT RLQAIEKRRG DVLAEEKVAR IDAALDAQQR KLDELALKAA RPQLGSAITP VPAAREHKSA FDAYIRFGDT AGLRALETKA MSIGSNPDGG YLVPDELEHT IGERLAVVSP IRAIAAVRQI SGNVYKKPFM ITGPTTGWVG ETAARPQTGS PQLDALSFPA MELYAMPAAT ANLLEDAVVN LDQWIAGEVE LVFSVQEGTA FITGDGLGKP KGFLAYPTVA NASWSWGNLG TIASGAAGAF AASSPSDVLI DLIYGLKPGY RQNASFVMNR RTQAAVRKFK DSTGVYLWQP PATVSGRASL IGFPLVDAED MPDIAANSLS IAFGDFQRGY LIVDRQGIRV LRDPYSAKPY VLFYTTKRVG GGVQDFDAIK LLKFAAS
|
| |