Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_0337 |
Symbol | |
ID | 4030493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 366230 |
End bp | 367378 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637968871 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_575693 |
Protein GI | 92115964 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.139658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTTC ATTTTCTTGA GACCAAATCT GCGGCTGACG TCGACGAAGG CGATCCGTCC ATCGTTGAAG TCAAGTCGGC GCTGACCGCT CTTACGGAAG ACGTAAAGAA GGCCACCGCA CCGGTTGCCG ATCTGACCAA GCGCCTCGAC GAAATCGAGA CGAAGATCAA TCGTCCGGCC ATTCACACTG AGAAAAAGGA CGAGATCAGC GACGAGCGCA AAGCGTTCAC CGGCTATCTT CGCCGCGGCA AGGAAACGCT CCAGCCGGAC GAGATCAAGT CGCTGCGCGT TGCCGATGAT ACCTCGGGCG GCTATCTGGC GCCTGCGGAG TTCAGCGCCG AAGTGGTCAA GGGCATCGTG GAAATGTCGC CGATCCGTCA GGCGGCTCGC GTTGGCTCTA CGTCCAGCGG TGAAGTTCTG CTGCCGAAAC GTACCGGCCG TCCGACCGGA TCGTGGGTTG GCGAAACCGA TGCGCGTCCG GGCACGGAAT CGAGCTATGG TCAGATCGAA GTGCCGATCC ATGAAATGGC TTGCTACGTT GACGTGTCAC AGCGCCTGCT TGAGGACGCG GCAGTCAATG TCGAGTCCGA AGTTGCTTCT GACTTGTCCG AGGAATTCGG TCGGCTTGAA GGTCTCGGCT TCTCGCAGGG CGATGGCGTA AAGAAGCCGA TTGGCATCAT GGAAGCGGCT GGCGTTGCCT ATACCGCGAC CGGCAATGCT TCGACGCTTG GCACCGCGCC GGCCGACACC CTGATCGACG TTTTCTATTC GCTCCCGGCG TACTATCGCA ATCGCGGCGT CTGGCTGATG AATTCGAAGA CGATCGCAGC GGTTCGCAAG CTGAAGGACG GTTCTACCGG TGCCTACCTG TGGCAGCCTG GCCTTGCGCA GGGTGACCCG GCGACGATCC TTGGCCGTCC GCTGATTGAA GATCCGACCA TGGATGACAT CGGCTCCGCT GCCGAGCCTA TCCTGTTCGG TTCGGTTTCC GATGCCTATC GCATCTATGA CCGACTGAAT CTTTCGATCA TGCGCGACCC GTACTCGCAG GCAACGTCTG GCGTTGTCCG CTTCCATGCG CGTCGTCGCA CTGGCGGTGC GTTGGTTCTT GCCGATGCGC TCCGCAAGAT CAAGTGCGCG ACCAGCTAA
|
Protein sequence | MTFHFLETKS AADVDEGDPS IVEVKSALTA LTEDVKKATA PVADLTKRLD EIETKINRPA IHTEKKDEIS DERKAFTGYL RRGKETLQPD EIKSLRVADD TSGGYLAPAE FSAEVVKGIV EMSPIRQAAR VGSTSSGEVL LPKRTGRPTG SWVGETDARP GTESSYGQIE VPIHEMACYV DVSQRLLEDA AVNVESEVAS DLSEEFGRLE GLGFSQGDGV KKPIGIMEAA GVAYTATGNA STLGTAPADT LIDVFYSLPA YYRNRGVWLM NSKTIAAVRK LKDGSTGAYL WQPGLAQGDP ATILGRPLIE DPTMDDIGSA AEPILFGSVS DAYRIYDRLN LSIMRDPYSQ ATSGVVRFHA RRRTGGALVL ADALRKIKCA TS
|
| |