Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mrad2831_4965 |
Symbol | |
ID | 6141033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium radiotolerans JCM 2831 |
Kingdom | Bacteria |
Replicon accession | NC_010505 |
Strand | + |
Start bp | 5289487 |
End bp | 5290779 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641630674 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001757607 |
Protein GI | 170751347 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000154569 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACGCCC AGACCCCGAT CGCCGAACCC GCCCTCGCGC CGCCGCGCGG CCTGCCGCCG CTGGAGAACA AGGCCGCCCA CCCCGAGGGC GGCCCGGTCC GCTCCGCCCT CGACGAGCTC GCCGCCGCCT TCGCGGCCTT CAAGGAGACG AACGACGCGC GGATCGACCG GATCGAGGGC CGCCTCGGCG TCGACGTGCT CACCGAGGAG AAGCTCGCCC GGATCGACGC GGCCCTCGAC GCCGCCCGCA CCCGCCTCGA CCGCATCGCC CTGGAGCGGG CCCGGCCGCC CCTCGAGCAG CCGGATGCGC GGGGGCCGGG GACCGCGCAC GAGCACAAGG CCGCCTTCGA CCTCTACGTC CGGGCGGGCG AGAGCGCCGG GCTCAAGCGC CTCGAGGCCA AGGCGCTGTC GGCCGGCTCC GGGCCGGACG GCGGCTACCT CGTCCCCGAC ACGATCGAGC GGACCGTGCT GACGCGCCTC GGCCAGGTCT CGCCGATCCG GTCGATCGCC AGCGTTCAGG CGATCTCGGG CGCCCAGTAC AAGCGCGCCG TCTCGGTCGG CGCGCCGGTC ACCGGCTGGG CCGCCGAGAC CGCGCCGCGG CCCGAGACCG CCGCGCCGGC CCTGTCGGAG ATCGCGTTCC CCGCCATGGA GCTCTACGCG ATGCCGGCCG CCACCCAGAC GCTCCTCGAC GATGCCGTGG TGGATCTCGA CGCGTGGCTC TCGGCCGAGG TGGAGACCGC CTTCGCCGAG CAGGAGGGCG TCGCCTTCGT GTCCGGCAAC GGCGCGAGCC GCCCGCGGGG CTTCCTGAGC TACGACACGG TCGCCAACGC CGCCTGGGTG CCGGGCAAGA TCGGCACGGT CGCCACCGGG GCGGCCGGGG CGTTCCCGTC GGCCAGCCCG GGGGACGTGC TGTTCGACCT GATCTACGGG CTGCGCGCGG CCTACCGGCA GAATGCCGGC TTCGTCATGA ACCGGCGCAC CCAGAGCGCG ATCCGCAAGT TCAAGGACTC GGAGGGCAAC TATCTCTGGC AGCCGCCGCT CGCCGCCGGC CGGGCCGCGA CGCTGGTCGG CTTCCCGGTC ACCGAGGCCG AGGCGATGCC GGATCTCGCC AAGGACAGCC TGTCGGTGGC CTTCGGCGAT TTCCGCCGGG GCTACCTCGT GGTCGACCGG ACCGGGATGC GGGTGCTGCG CGACCCGTAC TCGGCCAAGC CCTACGTGCT GTTCTACACC ACCAGGCGCG TCGGCGGCGG GGTGCAGGAC TTCGACGCGC TCAAGCTCCT GAAGTTCTCC TGA
|
Protein sequence | MDAQTPIAEP ALAPPRGLPP LENKAAHPEG GPVRSALDEL AAAFAAFKET NDARIDRIEG RLGVDVLTEE KLARIDAALD AARTRLDRIA LERARPPLEQ PDARGPGTAH EHKAAFDLYV RAGESAGLKR LEAKALSAGS GPDGGYLVPD TIERTVLTRL GQVSPIRSIA SVQAISGAQY KRAVSVGAPV TGWAAETAPR PETAAPALSE IAFPAMELYA MPAATQTLLD DAVVDLDAWL SAEVETAFAE QEGVAFVSGN GASRPRGFLS YDTVANAAWV PGKIGTVATG AAGAFPSASP GDVLFDLIYG LRAAYRQNAG FVMNRRTQSA IRKFKDSEGN YLWQPPLAAG RAATLVGFPV TEAEAMPDLA KDSLSVAFGD FRRGYLVVDR TGMRVLRDPY SAKPYVLFYT TRRVGGGVQD FDALKLLKFS
|
| |