Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0686 |
Symbol | |
ID | 6408339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 721826 |
End bp | 723085 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642710601 |
Product | phage tail protein I |
Protein accession | YP_001989721 |
Protein GI | 192289116 |
COG category | [R] General function prediction only |
COG ID | [COG4385] Bacteriophage P2-related tail formation protein |
TIGRFAM ID | [TIGR01634] phage tail protein, P2 protein I family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCG ATTTCACCCC GATCCAGCCG ACGCTCAACG ATGCCGAGCG CTGCCACAGC CTGGTCAACG CCGCGCGCTG GCCGGTGCTG TTCGCCGAGG CCGCCAAGCT GCGCACGCTG TGGGACCCGT GGGCATGCCC GGTCGATCAG TTGCCGCTGC TCGCGTGGGC ATGGTCGGTC GACTTCTGGA AGGCTGAATG GCCGGAGCAC CGCAAGCGGC AAGTGATCGC CGAGTCGCGC GCTTATCACG AAGCTAAGAC CACGGTGCTC GGCTATCGGA TGGCTCTTGG CTACGTCGAT GCCGAGTTCG TTCGGGCCCG GCTGCCGCGA CACGCGTTCT TCGCCGGTGC TGCGCCGACC CAGGAGTCGC ATGATCGCTG GCTCGCTGGC CTGCCGGAGA TCCGCATCTA CACGGCGGTG TTTCCGATCA AGCGCCGGCC AGTGCGGTTC TTGCAGGCCG ACGGAACGAT CGTTCAACGC ACCCTTAAAG GGCGCTTTGT CGGCCGGCGC TTCCGGGCGC ATTCGGCGAA GTCGATCGTG CTCGACGGTC GGCGGCCCGA ATTGCGGCGG CCCGATGGAT CGGTGCAGCG CCTCGTGTTC GAAGGCGTGC GGATCGACAG CTTTGGACGG CTGCTGAGCG ATCCGGAGCG CCTGGTCATT CCGGCGCCGC TGCGCAACAC CTTCCAGGTC GGAAAGCGGC TCCGCGGTCG GTTCGTCGGC GACGGCAAGG CGAACGGCAA GCGCGTTATA TCGCTGAGTT TCTCGACGGG CGTCGATGTA TTCCGGCCGA ATGCCGTGTC GCCGGGCCTC ACGGCGATCG ACGTCGTGCC GCGGCAGATG TGGGACGTGA TGCCGCCCGG CCGCGGCTGG TTCATCAGCC GGGCTCGTAA GGGCCGCGGC ATCCAGCGGA ACAGGGCGGA CGAGCTGGCC TATCTGTCGA TCCGGCTCGC TGACGGCAGT CGTCCGGAGT ACGGCGGGCG CATGCGCAAC CGCATCGGCC GCACGCGGAT CAGGCGCGCA CCGTTCACCG CGACAGTGCT GGCGCATCTT GCAGGACCGC CCAAGCACGG CTTTCCGGTT GGTCGCTTCG TCAAGGTCGG CCCGGAAGCG CGCATTGCCG AAGCAATCAA AGCGCTGGCG GTCACACAGG CGGCTCGCGA CACCGTCTAT CTCGACATCA ATTCGGTCCG TCCGATCACC TACGGCGATC TTGCTCGATT GCCGGAGAAC GTTCGCGCCG GTGCCATCAT CCGCATCTGA
|
Protein sequence | MTADFTPIQP TLNDAERCHS LVNAARWPVL FAEAAKLRTL WDPWACPVDQ LPLLAWAWSV DFWKAEWPEH RKRQVIAESR AYHEAKTTVL GYRMALGYVD AEFVRARLPR HAFFAGAAPT QESHDRWLAG LPEIRIYTAV FPIKRRPVRF LQADGTIVQR TLKGRFVGRR FRAHSAKSIV LDGRRPELRR PDGSVQRLVF EGVRIDSFGR LLSDPERLVI PAPLRNTFQV GKRLRGRFVG DGKANGKRVI SLSFSTGVDV FRPNAVSPGL TAIDVVPRQM WDVMPPGRGW FISRARKGRG IQRNRADELA YLSIRLADGS RPEYGGRMRN RIGRTRIRRA PFTATVLAHL AGPPKHGFPV GRFVKVGPEA RIAEAIKALA VTQAARDTVY LDINSVRPIT YGDLARLPEN VRAGAIIRI
|
| |