Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_1167 |
Symbol | |
ID | 3675802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | + |
Start bp | 1276327 |
End bp | 1277580 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637712717 |
Product | Phage major capsid protein, HK97 |
Protein accession | YP_317781 |
Protein GI | 75675360 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTCG ATATCACCGA CATGGCTCCG GAGCATAAGT CCGGAGGCGC CGTCCGCGGC GGCTATGACG ACTTTCGCGT CACCTTCGAG GAATTCAAGG CGGCCAACGA CGAACGCCTG GCGCGGCTGG AGCAAAAGCG CGGCGACGTG CTGCTGGAGG AGAAGGTCGA TCGCATTAAT GCCGCGCTCG ACGCCCATCA CAGGCGCATG GATGAACTTG CGCTCAAGCA TGCCCGCCCG GCGCTCGAAG GCCGCTCCGG TATCGTCGGC GGCGCGGCCG CTCTCGAACA CAAAAGCGCC TTCGAGGCTT ATGTGCGGGG CGGCGAGACC GGCGCCTTGC GCGCGCTGGA GACCAAGGCG ATGTCGGCCG GCTCCAATCC GGATGGGGGC TATCTCGTTC CTGTCGAACT GGAGCACGAG ATCGGGCAGC GGCTGGCGGC GATCTCGCCG ATCCGCGCGC TGGCGTCGGT TCGCACCATC TCGGGCAACG TCTACAAGAA GCCGTTCATG ACCGCGGGCC CCGCCACGGG GTGGGTCGGT GAGACGGATT CGCGGACGCA AACCACCTCG CCGACGCTGG ATGCCCTGAG CTTTCCGGCA ATGGAGCTTT ACGCCATGCC GGCGGCGACC GCGACGCTGC TCGATGATTC CGCCGTCAAC ATCGACGAGT GGATCGCGCA GGAAGTTGAG CTGACATTCG CGGTTCAGGA AGGGGCTGCG TTCGTGAACG GCGACGGCAC CAACAAGCCG AAGGGCTTTC TGCAATCCGA AACCGTCGCC AACGGCTCCT GGGCGTGGGG CAAGCTGGGC TTCGTCGCCA GCGGCGGCGC GAGCGGCTTT GCGTCGTCCA ATCCTTCCGA TGCGCTGGTC GATCTCGTCT ATGCGCTGAA GGCAGGCTAT CGCCAGAATG CCGCGTTCAT CATGAACCGC AAGACGCAGG CCGCGATCCG CAAGTTCAAG GACACCGGCG GGTCCTATCT GTGGCAGCCG CCGGCGCAGG CTGGCGGGCG CGCCTCGCTG ATGACCTTCC CGCTGGTCGA GGCCGAGGAT ATGCCGGATG TCGCCGAGAA TTCGCTGTCG ATCGCGTTCG GCGATTTCCG TCGCGGCTAT CTCGTGGTGG ACCGCGCCGG CGTCCGCGTG CTGCGCGATC CGTATTCGGC CAAGCCCTAC GTGCTGTTCT ACACGACCAA GCGGGTCGGC GGCGGCGTGC AGGACTTCGA CGCCATCAAG CTGATGAAGT TCGCGGCGAG CTGA
|
Protein sequence | MDFDITDMAP EHKSGGAVRG GYDDFRVTFE EFKAANDERL ARLEQKRGDV LLEEKVDRIN AALDAHHRRM DELALKHARP ALEGRSGIVG GAAALEHKSA FEAYVRGGET GALRALETKA MSAGSNPDGG YLVPVELEHE IGQRLAAISP IRALASVRTI SGNVYKKPFM TAGPATGWVG ETDSRTQTTS PTLDALSFPA MELYAMPAAT ATLLDDSAVN IDEWIAQEVE LTFAVQEGAA FVNGDGTNKP KGFLQSETVA NGSWAWGKLG FVASGGASGF ASSNPSDALV DLVYALKAGY RQNAAFIMNR KTQAAIRKFK DTGGSYLWQP PAQAGGRASL MTFPLVEAED MPDVAENSLS IAFGDFRRGY LVVDRAGVRV LRDPYSAKPY VLFYTTKRVG GGVQDFDAIK LMKFAAS
|
| |