Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0642 |
Symbol | |
ID | 4897247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 656750 |
End bp | 657961 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640111225 |
Product | phage major head protein |
Protein accession | YP_001042527 |
Protein GI | 126461413 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAGG AAATCGCCGC GATCCTCGAG CGGGTCGTGA AGGACGTCGC TCGTATCGAC AACGAGCTTT CGAAGAAGGC CGAGGCCGCC TTCGCCGAGG TCAAGAACAT GGGCAACCTG TCGACCGAGA CGAAGGCTTC GGTCGATCAG CTGCTCACGG CGCAGACCAC GCTTGTCAGC GTCGTGGATG ACCTGAAGGC CCGCCTGGGC GAGGTCGAGC AGAAGGGCGC TCGTCGCTCG GCGCCGACTT CGGCGCAGTC GTGGGGCCAG CAGGCGGTGC AGGCCGAGAA ACTGATCGCC TTTGCGGCGG CGGTGGAAGG CGGCCGGCGC GTCTCGGTGC CCGTGGTCAA GAACGTGGTC ACCTCGGCGG ATGTGGCCGA AGGCGTCGTC GAGCCCCAGC GCCTGCCGGG CATCGACGTC GCGCCGAAGC AACGGCTGTT CATCCGCGAC CTGATCGCGC CGGGCAGCAC CGAGTCGCCC GCGATCTTCT GGGTGCAGCA GACCGGCTTC ACGAATGCTG CCCGGGTGGT GCCCGAGGGC ACGGCGAAGC CCTACTCGGA TATCGAGTTC GCGACCAAGA TCACGCCGGT CGTGACCGTT GCGCACATGT TCAAGGCGTC GAAGCAGATC CTCGACGACT TCCGCCAGCT GCAGTCCATG ATCGATGCCG AGATGCGGTA TGGCCTGAAG TATGTCGAGG AGCAGGAGAT CCTGTTCGGC GCGGGCGGCG CGGGCAACAT CGAGGGCATC GTCCCGCAGG CGTCGGCCTT CGCTCCCGCC TTCGCGCCGG AAATGCGGAC GCCGATCGAC GATCTTCGCC TCGCGCTCCT GCAGGCGCAG CTGGCCCGTC TGCCGGCCTC GGGCTTCGTG CTTCACATGA TGGATTGGGC CAAGATCGAG CTCACGAAGA ACACGGTTGG CGATTACGTC CTTGCCAACC CGCTCCGCCT CGCCGGGCCG ACGCTCTGGG GTAAGCCCAT CGTCGAAACG GAGATCCCGG AGTTCGAGGG CGAGTTCCTC GCGGGCGCCT TCTCCACCGG CGCGCAGATC TTCGATCGCG AGGACGCGAA TGTCGTCATC TCGACCGAGA ACGCCGACGA CTTCGAGAAG AACATGATCT CGATCCGCTG CGAGGAGCGT CTGGCGCTCG CCGTGAAGCG TCCGGAGGCG TTCGTCACCG GCGAGTTCGG CACCGCAGTC GCCGCGCCGT GA
|
Protein sequence | MDKEIAAILE RVVKDVARID NELSKKAEAA FAEVKNMGNL STETKASVDQ LLTAQTTLVS VVDDLKARLG EVEQKGARRS APTSAQSWGQ QAVQAEKLIA FAAAVEGGRR VSVPVVKNVV TSADVAEGVV EPQRLPGIDV APKQRLFIRD LIAPGSTESP AIFWVQQTGF TNAARVVPEG TAKPYSDIEF ATKITPVVTV AHMFKASKQI LDDFRQLQSM IDAEMRYGLK YVEEQEILFG AGGAGNIEGI VPQASAFAPA FAPEMRTPID DLRLALLQAQ LARLPASGFV LHMMDWAKIE LTKNTVGDYV LANPLRLAGP TLWGKPIVET EIPEFEGEFL AGAFSTGAQI FDREDANVVI STENADDFEK NMISIRCEER LALAVKRPEA FVTGEFGTAV AAP
|
| |