Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1359 |
Symbol | |
ID | 5083130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1391121 |
End bp | 1392278 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640482916 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001167561 |
Protein GI | 146277402 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.504084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.308637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCACA TGAACAAGAG GCAACTGTTT GGCGGCACGA TGCTCGTCGT GAAGGGCGAC GATGACGAAC CGGCCGAGCT GGTGACGAAG GCCATTGCCG ACCTCACGAA GACGGTGAAT GATCGCCTGG ATGCGCTCGA ACAGAAGGCC GATACGACGC AGATCGTGGC CCGGCTCGAC AAGGTAGAAG CGAAGGTCAA CCGTCCTGGT GGGGCGGATC CGAAGCCCGA GGCGTCGATC GAGCGCAAGG CCTTCGGTAC CTACCTTCGC GCTGGCAATG CCGCACCTGC CGACGAGCTG AAGGCACTGA ACGTGTCGAG CGATCCGCAG GGGGGGTATC TCGCGCCGGC CGAGATGAGC ACCGAGTTCA TCCGCGACCT TGTCGAGTTC TCCCCCGTGC GGGCCGTTGC GAGCGTTCGG CAGACCGGCT CCCCGAGCAT CATCTATCCC GCGCGAACCG GCATCACGAA CGCACGATGG AAAGGGGAGG CTCAGGCGCA GGAAGGGTCT GAGCCCGGCT TCGGCCAGGC CGAGGTGGTG GTCAAGGAGG TCAACACGTT CGTTGACATC TCGAACCAGC TCCTTGCCGA CAGTGCGGGG CAGGCGGAGG CGGAAGTGCG CATGGCCTTG GCTGAGGACT TCGGCCAGAA GGAAGGAGCC GCCTTCGTAT CCGGCGACGG CATCCTTGAG CCGGCAGGCT TCATGACCCA TGCAGGCATC GCCCATACGG TGAGCGGCGC CGCTGCCGGG ATCACGGCCG ACGCCCTGGT GAAGCTGCTC TATGCGCTTC CCGCAACCTA TCGCGGCCGC GGTGCCTGGG CGATGAACGG CACCACTCTC GGCGCTGTGC GTCTCCTGAA GGATGGTGAC GGGCGCTTCC TCTGGCAGCC TTCCTATCAG GCCGGCCAGC CCGAAACGCT CCTTGGGCGT CCTGTTGTGG AGATGGTGGA CATGCCCGAC GTAGAGGCCG GCGCGTTTCC GATCATCTAC GGCGACTGGT CGGGATACCG AATCGTGGAC CGCATTGCGC TGAGCGTCCT GGTGAACCCC TACATCCGGG CGACCGAGGG TATCACCCGC ATCCATGCGA CGCGGCGGGT CGGTGGCCGG GTCTTGCAGG CTGCGAAGTT CCGCAAGCTC AAGATCGCCG GGGCCTGA
|
Protein sequence | MRHMNKRQLF GGTMLVVKGD DDEPAELVTK AIADLTKTVN DRLDALEQKA DTTQIVARLD KVEAKVNRPG GADPKPEASI ERKAFGTYLR AGNAAPADEL KALNVSSDPQ GGYLAPAEMS TEFIRDLVEF SPVRAVASVR QTGSPSIIYP ARTGITNARW KGEAQAQEGS EPGFGQAEVV VKEVNTFVDI SNQLLADSAG QAEAEVRMAL AEDFGQKEGA AFVSGDGILE PAGFMTHAGI AHTVSGAAAG ITADALVKLL YALPATYRGR GAWAMNGTTL GAVRLLKDGD GRFLWQPSYQ AGQPETLLGR PVVEMVDMPD VEAGAFPIIY GDWSGYRIVD RIALSVLVNP YIRATEGITR IHATRRVGGR VLQAAKFRKL KIAGA
|
| |