Gene Mrad2831_4965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMrad2831_4965 
Symbol 
ID6141033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium radiotolerans JCM 2831 
KingdomBacteria 
Replicon accessionNC_010505 
Strand
Start bp5289487 
End bp5290779 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content74% 
IMG OID641630674 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001757607 
Protein GI170751347 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000154569 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGCCC AGACCCCGAT CGCCGAACCC GCCCTCGCGC CGCCGCGCGG CCTGCCGCCG 
CTGGAGAACA AGGCCGCCCA CCCCGAGGGC GGCCCGGTCC GCTCCGCCCT CGACGAGCTC
GCCGCCGCCT TCGCGGCCTT CAAGGAGACG AACGACGCGC GGATCGACCG GATCGAGGGC
CGCCTCGGCG TCGACGTGCT CACCGAGGAG AAGCTCGCCC GGATCGACGC GGCCCTCGAC
GCCGCCCGCA CCCGCCTCGA CCGCATCGCC CTGGAGCGGG CCCGGCCGCC CCTCGAGCAG
CCGGATGCGC GGGGGCCGGG GACCGCGCAC GAGCACAAGG CCGCCTTCGA CCTCTACGTC
CGGGCGGGCG AGAGCGCCGG GCTCAAGCGC CTCGAGGCCA AGGCGCTGTC GGCCGGCTCC
GGGCCGGACG GCGGCTACCT CGTCCCCGAC ACGATCGAGC GGACCGTGCT GACGCGCCTC
GGCCAGGTCT CGCCGATCCG GTCGATCGCC AGCGTTCAGG CGATCTCGGG CGCCCAGTAC
AAGCGCGCCG TCTCGGTCGG CGCGCCGGTC ACCGGCTGGG CCGCCGAGAC CGCGCCGCGG
CCCGAGACCG CCGCGCCGGC CCTGTCGGAG ATCGCGTTCC CCGCCATGGA GCTCTACGCG
ATGCCGGCCG CCACCCAGAC GCTCCTCGAC GATGCCGTGG TGGATCTCGA CGCGTGGCTC
TCGGCCGAGG TGGAGACCGC CTTCGCCGAG CAGGAGGGCG TCGCCTTCGT GTCCGGCAAC
GGCGCGAGCC GCCCGCGGGG CTTCCTGAGC TACGACACGG TCGCCAACGC CGCCTGGGTG
CCGGGCAAGA TCGGCACGGT CGCCACCGGG GCGGCCGGGG CGTTCCCGTC GGCCAGCCCG
GGGGACGTGC TGTTCGACCT GATCTACGGG CTGCGCGCGG CCTACCGGCA GAATGCCGGC
TTCGTCATGA ACCGGCGCAC CCAGAGCGCG ATCCGCAAGT TCAAGGACTC GGAGGGCAAC
TATCTCTGGC AGCCGCCGCT CGCCGCCGGC CGGGCCGCGA CGCTGGTCGG CTTCCCGGTC
ACCGAGGCCG AGGCGATGCC GGATCTCGCC AAGGACAGCC TGTCGGTGGC CTTCGGCGAT
TTCCGCCGGG GCTACCTCGT GGTCGACCGG ACCGGGATGC GGGTGCTGCG CGACCCGTAC
TCGGCCAAGC CCTACGTGCT GTTCTACACC ACCAGGCGCG TCGGCGGCGG GGTGCAGGAC
TTCGACGCGC TCAAGCTCCT GAAGTTCTCC TGA
 
Protein sequence
MDAQTPIAEP ALAPPRGLPP LENKAAHPEG GPVRSALDEL AAAFAAFKET NDARIDRIEG 
RLGVDVLTEE KLARIDAALD AARTRLDRIA LERARPPLEQ PDARGPGTAH EHKAAFDLYV
RAGESAGLKR LEAKALSAGS GPDGGYLVPD TIERTVLTRL GQVSPIRSIA SVQAISGAQY
KRAVSVGAPV TGWAAETAPR PETAAPALSE IAFPAMELYA MPAATQTLLD DAVVDLDAWL
SAEVETAFAE QEGVAFVSGN GASRPRGFLS YDTVANAAWV PGKIGTVATG AAGAFPSASP
GDVLFDLIYG LRAAYRQNAG FVMNRRTQSA IRKFKDSEGN YLWQPPLAAG RAATLVGFPV
TEAEAMPDLA KDSLSVAFGD FRRGYLVVDR TGMRVLRDPY SAKPYVLFYT TRRVGGGVQD
FDALKLLKFS