Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_1784 |
Symbol | |
ID | 7114579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 1836984 |
End bp | 1837976 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643524548 |
Product | Rubrerythrin |
Protein accession | YP_002420575 |
Protein GI | 218529759 |
COG category | [S] Function unknown |
COG ID | [COG1633] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0899507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.128908 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGCC GTCTCTCGAT GCTGACGCTC GGCGGCCGGA AACGGTTCGA CGATCTCAGC GAGCAAGAGA TCCTGGCCCT CGCGATCGGG TCCGAGGAGG AGGACGGCCA GATCTACCGG GCCTATGCCG GGCGCCTGCG CGGCGATTAT CCCCATTCGG CCGCCCTGTT CGACGCGATG GCGGAGGCCG AGGACGAGCA TCGCCGCCGC CTGATCGCGC GCTACAAAGA GCGCTTCGGC GACTTCATCA TCCCGATCCG GCGCGAGCAC ATCGCCGGCT ATTACAGCCG CAAGCCGGTC TGGCTGATGC GCAATCTCGG GCTCGACCGG GTCCGCGAGG AGGCCGCCGC GATGGAGCGG CAGGCGCGCG ACTTCTACCT CGCTGCCGCC CGGCGCTCGA CCGACGCCGA CACGCGCCGC CTGCTCGGCG ACCTCGCCGC GGCGGAAAGC GCGCACGAGC GGACGGCGGA GGCGCTGGCG GACGAGCATC TCGGCGGCTC CGTCCGCGAC GAGGAGGACG CGGCCGCCCA TCGCCAGTTT ATCCTGACCT GGGTCCAGCC GGGGCTTGCC GGCCTCATGG ACGGGTCGGT CTCGACGCTC GCGCCGATCT TCGCCACGGC GTTTGCCACA CAGAACCCGT GGACGACCTT TCTCGTCGGC CTCTCGGCCT CGATCGGCGC GGGCATCTCG ATGGGCTTCA CCGAGGCCGC GCACGACGAC GGCAAGATCT CCGGGCGCGG CTCACCGCTG AAACGCGGCC TCGCCTCCGG CGTGATGACC GCGCTCGGCG GCCTCGGCCA CGCGCTGCCC TACCTGATCC CGAACTTCTG GCTGGCGACG AGCATCGCCT TCGCCGTGGT CTTCTTCGAG CTCTGGGCCA TCGTCTGGAT TCAGAACCGC TACATGGAGA CGCCCTTCCT GCGGGCCGCC TTCCAGATCG TGCTCGGCGG CTCCCTCGTG CTCGCGACGG GCATCCTCAT CGGCGGCGCC TGA
|
Protein sequence | MMSRLSMLTL GGRKRFDDLS EQEILALAIG SEEEDGQIYR AYAGRLRGDY PHSAALFDAM AEAEDEHRRR LIARYKERFG DFIIPIRREH IAGYYSRKPV WLMRNLGLDR VREEAAAMER QARDFYLAAA RRSTDADTRR LLGDLAAAES AHERTAEALA DEHLGGSVRD EEDAAAHRQF ILTWVQPGLA GLMDGSVSTL APIFATAFAT QNPWTTFLVG LSASIGAGIS MGFTEAAHDD GKISGRGSPL KRGLASGVMT ALGGLGHALP YLIPNFWLAT SIAFAVVFFE LWAIVWIQNR YMETPFLRAA FQIVLGGSLV LATGILIGGA
|
| |