Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_1482 |
Symbol | |
ID | 3675571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | - |
Start bp | 1610566 |
End bp | 1611723 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637713034 |
Product | Phage major capsid protein, HK97 |
Protein accession | YP_318095 |
Protein GI | 75675674 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.353157 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGTAAGC ATTTCTCTGA TCTCGAATTC AAAGATACCG ACGACGCCGA TCCGGTCGAT CTGGTGACGA AGGCGGTCAA TGATCTGACT ACCAACGTTG ACGCGCGCTT GAAGGACATC GAGACCAAAG CCGACACGAC CAAGCTCGTG GACCGCATGG ACAAGCTCGA AGCCAAGATT AACCGGCCGG GCAGTGGCGA TCACAAGGAT CCGGACGCCG CGCTCGAAAT CAAGGCGTTC GGCGCTTATG TCCGTTCCGG CATCACGCCG CCTGATCCGC TCGAACTCAA GACGCTGATC GTTTCCAGCG ATCCGCAGGG CGGCTATCTG GCGCCGACCG AGATGGCGAC CGAGTTTATT CGTGATCTGG TGCAGTACTC GCCGATCCGC GGTCTCGCTT CGGTCCGATC CACATCGGCG CCTGCCGTGT CCTATCCCAA GCGCATCGGC CGCACGAATG CTCAATGGCG TGGTGAGACG CAAGCGCAGA CGACTGGTGA GCCGACCTTT GGTCAGCTTG AAATCCCGGT TCGCGAGATC AACACTTACG TCGATATCTC GAACCAGCTT CTCGCCGACA GCGCCGGCGC AGCCGAGTCC GAAGTTCGGA TGGCACTCGC GGAGGACTTC GGCCTGAAAG AGGGACTGTC GTTCCTGAAG GGCACCGGCC CGCTGCAGCC GGAAGGTTTG CTCATCAACG CGGACATCAG CATCGTCGCG ACCGGCAACG CTTCCACGCT TGGCAGCGGA CCGGCCGACA TGCTCATCGA CACGTTCTAT TCGCTTCCGG CTGCTTATCG GAACGCAGGC ACTTGGTTGA TGAATTCCAC GACGCTCGCG GCCATCCGGA AGCTGAAGGA CGGCACGACC GGGACATACC TTTGGTCGCC CGGCTTCCAG GGGCAGGCGG ACACCATCCT GGGTCGTCCA GTCATCGATT GCCCCGACAT GGATGATGTC GGCAGCGGCA CCACGCCGAT CGCATTCGGC GACATCGCCG CGACCTATCG GATCCTCGAC CGTATTGGTT TGTCGATTTT GGCCAATCCA TATTTGCTGG CGACCACCGG CACGACCAGA ATTCATGCCA CGCGGCGTGT CGGCGGTGCG GTCGTTCAGC CAGCGGCGAT GAAGAAGATC GTCTGCAAGA CCTCGTAA
|
Protein sequence | MGKHFSDLEF KDTDDADPVD LVTKAVNDLT TNVDARLKDI ETKADTTKLV DRMDKLEAKI NRPGSGDHKD PDAALEIKAF GAYVRSGITP PDPLELKTLI VSSDPQGGYL APTEMATEFI RDLVQYSPIR GLASVRSTSA PAVSYPKRIG RTNAQWRGET QAQTTGEPTF GQLEIPVREI NTYVDISNQL LADSAGAAES EVRMALAEDF GLKEGLSFLK GTGPLQPEGL LINADISIVA TGNASTLGSG PADMLIDTFY SLPAAYRNAG TWLMNSTTLA AIRKLKDGTT GTYLWSPGFQ GQADTILGRP VIDCPDMDDV GSGTTPIAFG DIAATYRILD RIGLSILANP YLLATTGTTR IHATRRVGGA VVQPAAMKKI VCKTS
|
| |