Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_1542 |
Symbol | |
ID | 3676370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | - |
Start bp | 1687660 |
End bp | 1688919 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637713097 |
Product | Phage major capsid protein, HK97 |
Protein accession | YP_318155 |
Protein GI | 75675734 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCG CAATGAAATC CCGCGCCCGC GGGCTCGTCG CCGCGCGCGC CGATGCCGGC AGCGCAACGG CTATCCTCAA CGAGCTGCGC CAGACATTCG AAACGTTCAA AGCCGAGCGC GAGAAAGAAA TCGCTGACCT TAAGAAGGGT TTGGGCGATG TCGTGCAGTC AGAAAAGGTT GATCGCATCA ACGCCGAGAT CACGAAGCTG CAGGAACAGC TGGACCAGGT GAACTCGTCG ATCGCCGCTC TCAAGGTCGG CGGCGCTGGC GACGACAAGC CGCTGGCCGC CGAGCGTCGC GAGCATGCGA GAGCCTTCAA CCAGTTCTTC CGCAATGGCG CGGAAAACGG CTTGCGGGAT CTGGAAGTGA AAGCCGCGTT GCGCACCGAC AGTGATCCGG ATGGCGGCTT CGTCGTTCCG GACCAGATGG AAGCGGCCAT TGACCGCGTG CTCGGCACAG TGTCGGCGAT GCGCGCGATC TCGCGCGTCA TGTCGATTTC GTCCGGCACC TATAAGAAGT TGGTCAATCA GGGCGGCGCG GTCGGCGGTT GGGTCGGCGA GCGGCAGGCT CGCCCAGCGA CCGCCACTCC AACGTTGGTG GAATTGGCCT TCCAGGCGAT GGAACTCTAC GCTAACCCGG CGGCGACCCA GACGCTCCTC GATGATTCCC GCGTCAATAT CGAGCAGTGG CTCGCCGATG AGGTATCGAT CACCTTCGCT GAAATGGAAG GCGCGAGCTT CATCATCGGC GACGGCGTCG GCAAGCCGCG TGGGCTTCTT TCCTACGACA CGGTAGCCGA TACCTCTTAT GCGTGGGGCA AGCTCGGCTA TGTCGTCTCC GGAGTGGCGG CAGCGATGAC CGACTCGTCG CACAACGGCG CGGATGCGCT TACCGATCTG GTTTACTCGA TCAAGCAGGG CTACCGGCAG AATGCGCGGT TCCTCATGAA TCGGAAGACG CAGGCCGCGA TCCGCAAGTT CAAGTCGAAA ACCGAGGAAT TGTATCTGTG GCAGCCGTCG ATTCAGGCCG GTCAGCCGGC GACGATCCTC GGATATCCGG TGACGGATGA CGATAACATG CCGGACGCGA CCGCCGGCGG GAATTTCCCG ATCGCCTTCG GCGACTTCCA GCGCGGCTAC CTGATCGTTG ACCGCATGGG CGTGCGCGTG CTGCGCGACC CGTTCACCAA CAAGCCTTAC GTGCACTTCT ATACCACTTG TCCGTCGTCA GATAATTTGA GACTGATCAG ACATTTCTAG
|
Protein sequence | MTIAMKSRAR GLVAARADAG SATAILNELR QTFETFKAER EKEIADLKKG LGDVVQSEKV DRINAEITKL QEQLDQVNSS IAALKVGGAG DDKPLAAERR EHARAFNQFF RNGAENGLRD LEVKAALRTD SDPDGGFVVP DQMEAAIDRV LGTVSAMRAI SRVMSISSGT YKKLVNQGGA VGGWVGERQA RPATATPTLV ELAFQAMELY ANPAATQTLL DDSRVNIEQW LADEVSITFA EMEGASFIIG DGVGKPRGLL SYDTVADTSY AWGKLGYVVS GVAAAMTDSS HNGADALTDL VYSIKQGYRQ NARFLMNRKT QAAIRKFKSK TEELYLWQPS IQAGQPATIL GYPVTDDDNM PDATAGGNFP IAFGDFQRGY LIVDRMGVRV LRDPFTNKPY VHFYTTCPSS DNLRLIRHF
|
| |