Gene Mvan_4361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4361 
Symbol 
ID4649381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4676582 
End bp4677751 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content65% 
IMG OID639807833 
Productphage major capsid protein, HK97 
Protein accessionYP_955144 
Protein GI120405315 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.455164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.407566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTC GCATTGTTGA CGACCCCTGG GACACGACGC AGATCCGGGG CTCCGGCGAC 
GATTACGTGA CCCAAATGCG CTCCCATGCT TGCGCCGCGA TCGAGAAGAT GCCGTTCGCT
GACGACAAGG TTCGCGAGGT GGCCACCCGG TTCGTCGAAC GTGACGGCGA AGAGCACGCT
CCCGTTGCCA ATCTCGTTCT GGCCACGACG TCGCCCGACT ACAGCAGGGC GTTCACGAAG
ATGATCCGGT CTCGCGGTAA CCCGACCGTG CTGTCCGGTC GGGAAGTTCA GGCCTACCAG
CGGGCCATGT CCCTGACCGA CAATCAGGGT GGCTTCCTGG TGCCTATGCA GCTCGATCCG
ACGATCATCT TGACCGCCAA CGGGTCGTTC AATCAGGTGC GCCAGATCTC ACGCGTGGTG
CAGGCCACCG GCAAGTCGTG GACCGGTGTC ACCTCGGCCG GCGTGTCCGG GTCGTGGGAC
GGGGAAGCCG TTGAGGTCTC CGACGACTCG CCAGAGCTGC AGCAGCCGGA GATCCCGGTG
CACAAGCTGC AGATCTGGGT CGAGTTCTCC CACGAGCTCC AGCACGACGC GGCGGGTCTG
GCTGATGACA TCGCCAAGAT GATCGCCTTC GAGAAGGACG TGAAGGAGTC GATCGCGTTC
GCGACGGGTT CGGGCGTCGG CCAGCCCAGG GGCGTCATCA CCGCTCTGAT GGGCAGCGAC
TCCGTTGTCA ATTCGGCCGT GACGGATACG TTCGCCGCCG GCGACGTGCA CAACCTCGAC
GGTGACCTGC CGCAGCGGTA TGCGTTCAAC GCGTCGTGGC TGGCGCACCG CAAGATCTAC
AGCAAGATCC GCCAGTTCGA CACCAACGGC GGCGCATCGC TGTGGGGTCA GCTCGCCGAA
GGGCGCAAGT CCGAACTCCT CGGCCGGCCC GACTACGTCG CCGAGGCGAT GGATAGCTCG
ATCACCAACG GGCAGGACAA CCACGTCCTG GCGTTCGGCG ACTTCCAGAA CTTCGTCATT
GCGGACCGGT TGGGCACCAC CTTGTCCTAC ATCCCGAACC TGATGGGGCC GAACGGGCGC
CCGGTCGGCA AGGCGGGATG GCATGCCTGG ATCCGTGTCG GTTCCGACGT CGTCAACCCG
GGCGCGTTCC GGCTGCTGAA CGTCACGTAG
 
Protein sequence
MTTRIVDDPW DTTQIRGSGD DYVTQMRSHA CAAIEKMPFA DDKVREVATR FVERDGEEHA 
PVANLVLATT SPDYSRAFTK MIRSRGNPTV LSGREVQAYQ RAMSLTDNQG GFLVPMQLDP
TIILTANGSF NQVRQISRVV QATGKSWTGV TSAGVSGSWD GEAVEVSDDS PELQQPEIPV
HKLQIWVEFS HELQHDAAGL ADDIAKMIAF EKDVKESIAF ATGSGVGQPR GVITALMGSD
SVVNSAVTDT FAAGDVHNLD GDLPQRYAFN ASWLAHRKIY SKIRQFDTNG GASLWGQLAE
GRKSELLGRP DYVAEAMDSS ITNGQDNHVL AFGDFQNFVI ADRLGTTLSY IPNLMGPNGR
PVGKAGWHAW IRVGSDVVNP GAFRLLNVT