Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2096 |
Symbol | |
ID | 6409756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2269153 |
End bp | 2270409 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642711981 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_001991093 |
Protein GI | 192290488 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.225363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACCA CGTTCGACCA TGCCCCCGAG ACCAAGGCCG GCTTTGCCGG CGACGATGCG CAGCAGGTGT TCGATGCCTT GATGCGCGCG TTCGAGGACT ACAAGGCCGA GAACGACGCG CGGCTGAAGG CGATCGAGAC CGGCAAGGGC GACGTGCTGG CCGAGGAGAA GCTGGCGCGG ATCGACGCCG CGCTGGATCA TCAGCAGCGC CGGCTCGACG AGCTCGTCCT GAAGGCGGCG CGGCCGCAGC TCGCCGCGGA GCGGACGCCG CCGGCCGCCG ATCAGCGCGA GCACAAGGGC GCGTTCGAGG CCTATGTCCG CACCGGCGAG GCCGCCAGCC TGCGCCGGCT GGAGACCAAG GCGCTGTCGG TGGGCTCCGA TCCCGACGGC GGCTATCTGG TGCCGACCGA GCTCGAGCGC GCGATCTCGG CGCGGCTCGC CGCGATCTCG CCGATCCGTT CGCTCGCTAC CGTGCGCGAG ATCTCCGGCG GGGTGTACAA GAAGCCGTTC GTGACCGCCG GGCCGGCGAC CGGCTGGGTC GGCGAGAGTG ACGCCAGGCC GCAGACCAAT GCGCCGTCGC TCGATGCGCT GTCGTTTCCG GCGATGGAGC TGTACGCGAT GCCGGCCGCG ACCGCGACGC TGCTCGACGA CTCTGCGGTG AATATCGACG ACTGGCTCGC CGGCGAGATC GACCAGGCGT TCGCCGAGCA GGAGGGCATC GCCTTCGTGT CCGGCGACGG CGTCGCTAAG CCGAAGGGCT TCCTCGCCGT GCCGACCGCG GCCAACAGCG CCTGGAGCTG GGGCAAGCTC GGCACCATCT CGACCGGCGC CGCCGGCGCG TTCGCCGCCA CCGCGCCCGG CGACGTGCTG ATCGACGTGA TCTACGCGCT GCGGCCGGGC TATCGCCAGA ACGCCAGCTT CGTGATGAAC CGGCGGACGC AGGCGGCGAT CCGCAAGTTC AAGGATTCCT CCGGCGCCTA TCTGTGGCAG CCGCCGGTCA CCGCCGGCGG CCGCGCCAGC CTCGCCGGCT TCCCGCTCGC CGATGCCGAG GACATGCCGG ACATCGCGGC GAATTCGCTG TCGATCGCGT TCGGCGACTT CCGCCGCGGC TATCTGATCG TCGATCGCCA GGGCGTGCGC GTGCTGCGCG ATCCGTATTC GTCGAAGCCC TACGTGCTGT TCTACACCAC CAAGCGCGTC GGCGGCGGCG TCCAGGATTT CGACGCCATC AAGCTGGTGA AGTTCGCCGC GAGCTGA
|
Protein sequence | MMTTFDHAPE TKAGFAGDDA QQVFDALMRA FEDYKAENDA RLKAIETGKG DVLAEEKLAR IDAALDHQQR RLDELVLKAA RPQLAAERTP PAADQREHKG AFEAYVRTGE AASLRRLETK ALSVGSDPDG GYLVPTELER AISARLAAIS PIRSLATVRE ISGGVYKKPF VTAGPATGWV GESDARPQTN APSLDALSFP AMELYAMPAA TATLLDDSAV NIDDWLAGEI DQAFAEQEGI AFVSGDGVAK PKGFLAVPTA ANSAWSWGKL GTISTGAAGA FAATAPGDVL IDVIYALRPG YRQNASFVMN RRTQAAIRKF KDSSGAYLWQ PPVTAGGRAS LAGFPLADAE DMPDIAANSL SIAFGDFRRG YLIVDRQGVR VLRDPYSSKP YVLFYTTKRV GGGVQDFDAI KLVKFAAS
|
| |