Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4361 |
Symbol | |
ID | 4649381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 4676582 |
End bp | 4677751 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639807833 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_955144 |
Protein GI | 120405315 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.455164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.407566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACTC GCATTGTTGA CGACCCCTGG GACACGACGC AGATCCGGGG CTCCGGCGAC GATTACGTGA CCCAAATGCG CTCCCATGCT TGCGCCGCGA TCGAGAAGAT GCCGTTCGCT GACGACAAGG TTCGCGAGGT GGCCACCCGG TTCGTCGAAC GTGACGGCGA AGAGCACGCT CCCGTTGCCA ATCTCGTTCT GGCCACGACG TCGCCCGACT ACAGCAGGGC GTTCACGAAG ATGATCCGGT CTCGCGGTAA CCCGACCGTG CTGTCCGGTC GGGAAGTTCA GGCCTACCAG CGGGCCATGT CCCTGACCGA CAATCAGGGT GGCTTCCTGG TGCCTATGCA GCTCGATCCG ACGATCATCT TGACCGCCAA CGGGTCGTTC AATCAGGTGC GCCAGATCTC ACGCGTGGTG CAGGCCACCG GCAAGTCGTG GACCGGTGTC ACCTCGGCCG GCGTGTCCGG GTCGTGGGAC GGGGAAGCCG TTGAGGTCTC CGACGACTCG CCAGAGCTGC AGCAGCCGGA GATCCCGGTG CACAAGCTGC AGATCTGGGT CGAGTTCTCC CACGAGCTCC AGCACGACGC GGCGGGTCTG GCTGATGACA TCGCCAAGAT GATCGCCTTC GAGAAGGACG TGAAGGAGTC GATCGCGTTC GCGACGGGTT CGGGCGTCGG CCAGCCCAGG GGCGTCATCA CCGCTCTGAT GGGCAGCGAC TCCGTTGTCA ATTCGGCCGT GACGGATACG TTCGCCGCCG GCGACGTGCA CAACCTCGAC GGTGACCTGC CGCAGCGGTA TGCGTTCAAC GCGTCGTGGC TGGCGCACCG CAAGATCTAC AGCAAGATCC GCCAGTTCGA CACCAACGGC GGCGCATCGC TGTGGGGTCA GCTCGCCGAA GGGCGCAAGT CCGAACTCCT CGGCCGGCCC GACTACGTCG CCGAGGCGAT GGATAGCTCG ATCACCAACG GGCAGGACAA CCACGTCCTG GCGTTCGGCG ACTTCCAGAA CTTCGTCATT GCGGACCGGT TGGGCACCAC CTTGTCCTAC ATCCCGAACC TGATGGGGCC GAACGGGCGC CCGGTCGGCA AGGCGGGATG GCATGCCTGG ATCCGTGTCG GTTCCGACGT CGTCAACCCG GGCGCGTTCC GGCTGCTGAA CGTCACGTAG
|
Protein sequence | MTTRIVDDPW DTTQIRGSGD DYVTQMRSHA CAAIEKMPFA DDKVREVATR FVERDGEEHA PVANLVLATT SPDYSRAFTK MIRSRGNPTV LSGREVQAYQ RAMSLTDNQG GFLVPMQLDP TIILTANGSF NQVRQISRVV QATGKSWTGV TSAGVSGSWD GEAVEVSDDS PELQQPEIPV HKLQIWVEFS HELQHDAAGL ADDIAKMIAF EKDVKESIAF ATGSGVGQPR GVITALMGSD SVVNSAVTDT FAAGDVHNLD GDLPQRYAFN ASWLAHRKIY SKIRQFDTNG GASLWGQLAE GRKSELLGRP DYVAEAMDSS ITNGQDNHVL AFGDFQNFVI ADRLGTTLSY IPNLMGPNGR PVGKAGWHAW IRVGSDVVNP GAFRLLNVT
|
| |