Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_1999 |
Symbol | |
ID | 4082164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | - |
Start bp | 2109026 |
End bp | 2110168 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638010375 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_617043 |
Protein GI | 103487482 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00431544 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAGTGG ATATGGAAGT GAAGGCCGAT GCGCTGGACG GCGCGTTCGA TGCGGTGCTG GCGGCGGAGG CCGTCGATGA GCTGAAGGCG TCGGTTGCGG CGCTGAAGGC ACAGGTCGAT GCCCAGGCGG TCGCGGCGGC GCGGTTGCCG CTCGACGGGG CGAAGGCGGC CGATCCGGCG CTGGACGCCT TTGTCGAACG TTATCTGCGG CGCGGGATCG ACGCCGGGGT GGAGATGAAG AGCCTGTCGG GGGCGAGCGG AGCCGAGGGC GGCTATGCGG TGCCGCGCGA GATCGACACC AGCATCGCCG CGACGCTGAA ATCGCTGTCG CCGATCCGCA GCATCGCGAC CGTGGTGCAG ACAGGGACGA GCGGGTACCG GAAGCTGATC GCGACGGGCG CGACGGGCGC GGGCTGGGTC GGCGAAAGCG ACGCGCGGCC CGAGACGGCG ACGCGCAGCT TTGCCGAGAT CGCGCCGCCG TCGGGCGAGC TTTACGCCAA TCCGGCGGCG AGCCAGGCGA TGCTCGACGA TGCGATGTTC AACGTCGAGG CCTGGCTGGC CGACGAGATC GGGCGCGAGT TCGCGGTCGC CGAAGGGGCG GCGTTCGTGA CCGGCAACGG CACGAACCGG CCCAGGGGAT TCCTGACCTA TGCGACGAGC GACGAGGGTG ACGGTGCGCG GCCGTTCGGC ACGTTGCAGC ATCTGGCGAC GGGCAGCGCG GGCGCCTTTC CGGCGGTGAA CCCTGAGGAC AGACTGGTCG AGCTGGTCCA TGCGCTGAAA GCTCCGTACC GGCAGGGCGC GGTGTGGGTG ATGAACAGCG ATACGCTGGC GCGCATCCGC AAGTTCAAGA CGTCGGACGG CGCCTTCGTC TGGCAGCCGG GGCTGGTCGA GGGACAGGCG GCGAGCCTGC TCGGCTATCC GGTCGTCGAG GCCGAGGACA TGCCCGATAT TGCGGCCGAC AGCCTGTCGA TCGCCTTCGG CAATTTCCGC GCGGGCTATC TGATCGCCGA CCGCGGCGAG ACGCGCATCC TGCGCGATCC GTTCAGCAAC AAGCCCTTCG TGCATTTCTA TGCAACCAAA AGGGTCGGCG GCGCGATCAT CGATTCGCAG GCGATCAAGC TGATGAAATT CGCCGCCAGC TGA
|
Protein sequence | MEVDMEVKAD ALDGAFDAVL AAEAVDELKA SVAALKAQVD AQAVAAARLP LDGAKAADPA LDAFVERYLR RGIDAGVEMK SLSGASGAEG GYAVPREIDT SIAATLKSLS PIRSIATVVQ TGTSGYRKLI ATGATGAGWV GESDARPETA TRSFAEIAPP SGELYANPAA SQAMLDDAMF NVEAWLADEI GREFAVAEGA AFVTGNGTNR PRGFLTYATS DEGDGARPFG TLQHLATGSA GAFPAVNPED RLVELVHALK APYRQGAVWV MNSDTLARIR KFKTSDGAFV WQPGLVEGQA ASLLGYPVVE AEDMPDIAAD SLSIAFGNFR AGYLIADRGE TRILRDPFSN KPFVHFYATK RVGGAIIDSQ AIKLMKFAAS
|
| |