Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1415 |
Symbol | |
ID | 4032333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 1599481 |
End bp | 1600734 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637969889 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_576697 |
Protein GI | 92116968 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTCG ATATCACTGA CACCGCGCCG GAGCACAAGT CCGGCATCGC CGCGCGCGGC GACTACGACG ACTTCCGCAC GACCTTCGAG GAGTTCAAAT CCGCTAACGA CGAACGTCTG GCGCGGCTGG AGAAAAAGCG CGGCGACGTG CTGCTGGAGG AAAAGGTCGA CCGCATCAAT GCCGCGCTCG ACGCCCAGCA CAAGCGCATG GATGAGCTGG CTCTGAAGCA CGCGCGCCCG GCGCTCGAAG GCCGCAGCCG CATAGCCAGC GACGCGGCGT CCCGCGAGCA CAAGAGCGCG TTCGAGGCTT ATGTGCGCGG TGGCGAGGCG GGTGCCTTGC GCGACCTGGA GACCAAGGCG ATGTCGGCCG GCTCCAATGC GGATGGCGGC TATCTCGTTC CGGTCGAACT CGAACACGAG ATCGGCGAGC GACTGGCGGC AATCTCGCCG ATCCGCGCGC TCGCGTCGGT GCGCACCATC TCGGGCAATG TCTACAAGAA GCCGTTCATG ACCGCAGGGC CGGCCACCGG CTGGGTCGGC GAGACGGACT CGCGGACGCA GACCACCTCG CCGACGCTGG ACGCGCTGAG CTTCCCGGCG ATGGAGCTTT ACGCCATGCC AGCGGCGACC GCGACGCTGC TCGACGATAG CGCCGTCAAC ATCGACGAGT GGATCGCGCA GGAGGTGGAG CTGACGTTTG CGGTGCAGGA AGGCGCGGCC TTCGTCAACG GCGACGGCAC CAACCAGCCG AAGGGTTTTC TGCAATCGGA TACGGTGGCG AACGGCTCGT GGGTGTGGGG CAAGCTCGGC ACTATCGCCA GCGGCGGCGC GAGCGGTTTC GCGGCGTCGA ATCCGTCCGA TGCGCTGGTG GACCTGATCT ACGCGCTGAA GGCCGGCTAT CGCCAGAACG CCACCTTCGT GATGAACCGC AAGACGCAAG CCGCGATCCG TAAGTTCAAG GACACCGGCG GGGCGTATCT GTGGCAGCCG CCGGCGCAGG CGGGCGGGCG CGCCTCGCTG ATGACGTTCC CGCTGGTCGA GGCCGAGGAC ATGCCGGACG TCGCGGCGAA TTCGCTGTCG ATTGCGTTCG GCGATTTCCG CCGCGGTTAC CTCGTGGTGG ATCGCGCCGG CGTGCGCGTG CTGCGCGATC CGTACTCGGC CAAGCCTTAC GTGCTGTTCT ACACGACGAA GCGCGTCGGC GGCGGCGTGC AGGACTTCGA CGCCATCAAG TTGATGAAGT TCGCAGCGAG TTGA
|
Protein sequence | MDFDITDTAP EHKSGIAARG DYDDFRTTFE EFKSANDERL ARLEKKRGDV LLEEKVDRIN AALDAQHKRM DELALKHARP ALEGRSRIAS DAASREHKSA FEAYVRGGEA GALRDLETKA MSAGSNADGG YLVPVELEHE IGERLAAISP IRALASVRTI SGNVYKKPFM TAGPATGWVG ETDSRTQTTS PTLDALSFPA MELYAMPAAT ATLLDDSAVN IDEWIAQEVE LTFAVQEGAA FVNGDGTNQP KGFLQSDTVA NGSWVWGKLG TIASGGASGF AASNPSDALV DLIYALKAGY RQNATFVMNR KTQAAIRKFK DTGGAYLWQP PAQAGGRASL MTFPLVEAED MPDVAANSLS IAFGDFRRGY LVVDRAGVRV LRDPYSAKPY VLFYTTKRVG GGVQDFDAIK LMKFAAS
|
| |