Gene Mpop_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_2686 
Symbol 
ID6311223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp2909697 
End bp2911097 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content66% 
IMG OID642651408 
Productphage major capsid protein, HK97 family 
Protein accessionYP_001925376 
Protein GI188581931 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCT CCGAGCGCCT GAAAGCGAAG TACCTCGAAC TCGTCGCCAA GTCGAAGGGG 
ATGATCGAGG CCGCGGAGAA GCGCGAGAAC AAGGACTTCA CCGACCAGGA ATCGAAGGAC
TTTGACGCGC TGACGGAGGA GATGGACGGC ACCTTCAAGG CCTATGAGCA GGCGCTCAAG
GCGGAGAAGG CTCAGGCCGG CGAGGAGTCC GTGCCGACGA ACGAGACCAA CGGCAACGCC
AATCAGGCAG CCGGCGCGGG CGAGCTCGCT GAGGGCGAGC GCGAGAAGCT TTACGCTCGC
ACCGAGACCA AGATGGAGAC CGGCCAGAAG ATCTCGCTCA TCGCCGCCTC GGTGGTGAAG
GCGAAGCTGT CCGGCGGCGA GAAGAACGCG TTCCAGGTGC TCAGCGACGA GGGCTTCCCT
CAGTTTGCTC TCGACATCCA GCGCGGCAGC CGGAACAAGT CCTCGAACAC CCTGACCCCG
GCCGCGGGCG GTGTGCTGCT GCCGACGCCG CTGGCGGCGG AGGTCGTGCC GTTCCTGCGT
CCCGAGACCA CGTTCCTTCA GCTCAACCCC GTGCGCGTGC CGCTCACCGC CGGCCAGTAC
AATCAGCCGG TCGGTGCAAC CGGCGCCGTG GCGCAGTACG TCGGCGAGGG CCAGAAGAAG
CCCGTCACCG ACGTGACCTT CGACAAGCTC GGTCTGAAGG CGAAGAAGCT GGCCGCGATC
ATCCTCCTGA CGAAGGAGGC CAAGAAGTGG ACCATCATCG ACATCCAGGC CTACATCGAG
CGCGAGTTGC GCAACGCAGG CGGCCAGACC CTCGACCTCA ACGGCTGGCT CGGTACCGGT
GCGAACGCCG ACACCCCGAC CGGCATCCTG AACGTATCGG GCGTCGGCGT CGTCACGCAC
ACCTTCGCCG ATCCGAAGGC GCCGACCCTG AAGGAACTCG ATGCAGCAGC GTCGAAGCTG
ATCCTCTACA TGACCCTCCG GTTCATCCCG GAGACGTCCC GTTGGGCGTG GGTCATGAAC
CCGCGGACCC TGCGCTATCT CGCGGACATG CGCGTCGGCG CCGGCACCGA TGGCGAATAC
GCCTTCCCTG AACTGCAGGG CGAGAACCCG CGGTGGAAGG GCAAGCGCGT GCTGGTCTCG
ACGCAGATCC CGGCGAACCT CGGCACCGGT CTCGACGAGT CGATCCTCGC CCTGGTCAAC
GCCGACGACG TGATCTTCGG CGAGGAAGAG GATGTCAGCC TCGACTTCTC CATGGAGGCG
ACGATCGACG TCGGCGGCAC CCTCGTCCAC CTGTTCCAGC AGAACATGTG GGGCGTGCTC
ATGGAGATGG CCCACGACTT CGGCCTCCGC CGCAAGGCCT CGGTGGTCCG CCTCAACGGC
GTGCGCTGGG GCGCCCCGTA G
 
Protein sequence
MKRSERLKAK YLELVAKSKG MIEAAEKREN KDFTDQESKD FDALTEEMDG TFKAYEQALK 
AEKAQAGEES VPTNETNGNA NQAAGAGELA EGEREKLYAR TETKMETGQK ISLIAASVVK
AKLSGGEKNA FQVLSDEGFP QFALDIQRGS RNKSSNTLTP AAGGVLLPTP LAAEVVPFLR
PETTFLQLNP VRVPLTAGQY NQPVGATGAV AQYVGEGQKK PVTDVTFDKL GLKAKKLAAI
ILLTKEAKKW TIIDIQAYIE RELRNAGGQT LDLNGWLGTG ANADTPTGIL NVSGVGVVTH
TFADPKAPTL KELDAAASKL ILYMTLRFIP ETSRWAWVMN PRTLRYLADM RVGAGTDGEY
AFPELQGENP RWKGKRVLVS TQIPANLGTG LDESILALVN ADDVIFGEEE DVSLDFSMEA
TIDVGGTLVH LFQQNMWGVL MEMAHDFGLR RKASVVRLNG VRWGAP