Gene Mchl_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1705 
Symbol 
ID7116850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1754142 
End bp1755407 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content70% 
IMG OID643524469 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002420496 
Protein GI218529680 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.125329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA TGCGTTTCGA GACCAAGGCC CCCACCGGCC TACCCGAGAA CAAGGCGGCG 
ACCTTCGGCA CCGAAGCGGT GCTCGACGAG TTCGCCCGCG CCTTCGAGGC GTTCAAGGAG
GCCAACGACG TCCGCCTCTC CGAGATCGAG ACCCGGCTCA CCGCGGATGT GGTTACGGAG
GAGAAGCTCA TCCGCATCGA CGCCGCCCTC GATCAGGCGA AGAACCGCCT CGACCGGATC
AGCCTCGACC GTGCCCGGCC GCCGCTCGGC GGGACGGAGC CGGCCCGCGA CGCCTCCGCC
ACCGAGCACA AGGCGGCCTT CGACCTTTAT GTTCGGGCCG GCGAGAGCGC CGGCCTCAAG
CGGCTGGAAG AAAAGGCACT TTCCGCCGGC TCCGGGCCGG ATGGCGGCTA CCTCGTGCCG
CCGACGATCG AGCGCGAGGT GCTGCGTCGG CTCGCCGAAA TCTCGCCGAT CCGCGCGATC
GCCACGGTGC GGACCGTCTC CGGCGGCCAG TACAAGCGAG CCGTCTCGGT CAACGGTCCC
GCCGCCGGCT GGGTCGCCGA GACCGCGCCC CGGCCGCAGA CCGACACGCC AAACCTGTCC
GAGCTGAGCT TTCCGGCGAT GGAGCTCTAC GCCATGCCGG CCGCGACCCA GACGCTGCTC
GACGACGCGG TGCTCGATAT CGATGCGTGG CTCGCCGAGG AGGTCGAGAC GGCCTTCGCC
GAGCAGGAGA GCGTCGCCTT CGTCACCGGC AATGGCGTCG GTCGGCCGAA GGGCTTTCTC
AGCTACGACA CCGTCGCCAA CGCGAACTGG GCTTCGGGCA GGCTCGGCTT CATCGCGACG
GGGGCGGCCG GCGCCTTCCC CGCGAGCAAC CCGAGCGACG TGCTGTTCGA TCTCATCTAC
GCACTGCGCG CCGGCTATCG CCAGGGTGCG AGCTTCGTGA TGAACCGGCG GGTGCAGAGC
GCGATCCGCA AGTTCAAGGA CGCGGACGGC AACTACCTCT GGCAGCCGCC GCTTGCCGCC
GACCGGGCCG CGACGCTGAT GGGCTTTCCG CTGGTCGAGG CCGAGGCGAT GCCCGACGTC
GCCGCCAGCA GCCACGCCAT CGCCTTCGGC GACTTCAAGC GCGGCTACCT CGTCGTAGAC
CGCGTCGGCC TACGGACCCT GCGCGATCCC TACTCCGCCA AGCCCTACGT GCTGTTCTAC
ACCACCAAGC GCGTCGGCGG CGGGGTGCAG GACTTTGCCG CGATCAAGCT GCTCCGGTTC
GCCTGA
 
Protein sequence
MTEMRFETKA PTGLPENKAA TFGTEAVLDE FARAFEAFKE ANDVRLSEIE TRLTADVVTE 
EKLIRIDAAL DQAKNRLDRI SLDRARPPLG GTEPARDASA TEHKAAFDLY VRAGESAGLK
RLEEKALSAG SGPDGGYLVP PTIEREVLRR LAEISPIRAI ATVRTVSGGQ YKRAVSVNGP
AAGWVAETAP RPQTDTPNLS ELSFPAMELY AMPAATQTLL DDAVLDIDAW LAEEVETAFA
EQESVAFVTG NGVGRPKGFL SYDTVANANW ASGRLGFIAT GAAGAFPASN PSDVLFDLIY
ALRAGYRQGA SFVMNRRVQS AIRKFKDADG NYLWQPPLAA DRAATLMGFP LVEAEAMPDV
AASSHAIAFG DFKRGYLVVD RVGLRTLRDP YSAKPYVLFY TTKRVGGGVQ DFAAIKLLRF
A