Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1719 |
Symbol | |
ID | 4030319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 1921475 |
End bp | 1922629 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637970191 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_576995 |
Protein GI | 92117266 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTCC ACACTGCTAC TGCTCTTGAG ACCAAGAACG CAGCCGAACT TCCTGCGGAA GACGGCGCGG CCGAAATCAA GTCGGCGCTT GAAGCGCTGA CCGCTGACGT CAACACGAAG ACGGCACCGG TCGCCGATCT TGAAAAGCGA CTTGCCGCGG CCGAAGTCAA ACTCGCCCGC CCTGCCATTC ATACTGAGAA GAAGGACGAG ATCAGCGCTG AACGCAAAGC ATTCACCGGC TACCTGCGCA ACGGCAAAGA GACGTTGACC GCGGATGAGG TGAAGTCGCT CACGATCGCC CCAGATAGCT CTGGAGGTTA CCTCGCTCCG ATTGAGTTCA GCGCGGAAGT CGTGAAGGGT ATTGTCGAGC AATCGCCCGT TCGCCAGGCT GCTCGCGTGG GCAACACTTC GAGCGGCGAA GTTCTAATTC CTAAGCGCAC GGGTCGCCCG ACTGGCAAGT GGGTCGGTGA AACCGAGACC CGCACCGGAA CGGAGTCCAG TTACGGTCAG GTTGAGATCC CGATCCATGA AATGGCGTGC TATGTGGATG TTTCGCAGCG CCTCTTGGAA GACGCCGCGG TCAACGTTGA AGCCGAAGTT GCCTTCGACC TCGCCGAGGA ATTCGGACGT CTGGAGGCTC TTGGTTTCCA GCGTGGTGAT GGCGTGAAGA AACCGCTCGG CGTCATGGCG TCTGCGGGTA TCGCCTACAC GCCAACGGGC AACGCTTCGA CGCTCGGCAC GAATCCTGCC GACACTATCA TCGATGCGTT CTACGCCTTG CCGGCGTTCT ATCGTGGCCG GTCGGTTTGG ATGATGAACT CCAAAACCAT AGCCACCGTT CGCAAGCTGA AGGATGGCAC CACGGGCAGT TACCTGTGGC AGCCCGGTTT GGCCGCGTCT GATCCTTCGA CCATCCTCGG GCGTCCGGTT ATCGAAGATA ATACGCTCGA CGATGTTGGC AGCGCAGCAG AGCCGATCGT GTTCGGCGAC TTCGCCTCTG CGTACCGCAT TTATGATCGC GTCGCATTGA GCCTGCTTCG TGACCAGTAC AGCCAAGCTG CAAATGGTCT GGTGCGTTTC CACGCGCGTC GTCGTGTTGG TGGTGCGTTG GTTCTCGCCG ATGCGCTGCG CAAGATCAAG TGCGCGACCA GCTAA
|
Protein sequence | MTFHTATALE TKNAAELPAE DGAAEIKSAL EALTADVNTK TAPVADLEKR LAAAEVKLAR PAIHTEKKDE ISAERKAFTG YLRNGKETLT ADEVKSLTIA PDSSGGYLAP IEFSAEVVKG IVEQSPVRQA ARVGNTSSGE VLIPKRTGRP TGKWVGETET RTGTESSYGQ VEIPIHEMAC YVDVSQRLLE DAAVNVEAEV AFDLAEEFGR LEALGFQRGD GVKKPLGVMA SAGIAYTPTG NASTLGTNPA DTIIDAFYAL PAFYRGRSVW MMNSKTIATV RKLKDGTTGS YLWQPGLAAS DPSTILGRPV IEDNTLDDVG SAAEPIVFGD FASAYRIYDR VALSLLRDQY SQAANGLVRF HARRRVGGAL VLADALRKIK CATS
|
| |