Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1505 |
Symbol | |
ID | 5831200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1682287 |
End bp | 1683279 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367305 |
Product | rubrerythrin |
Protein accession | YP_001638977 |
Protein GI | 163850934 |
COG category | [S] Function unknown |
COG ID | [COG1633] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.276735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGCC GTCTCTCGAT GCTGACGCTC GGCGGCCGGA AACGGTTCGA CGATCTCAGC GAGCAGGAGA TCCTGGCCCT CGCGATCGGG TCCGAGGAGG AGGACGGCCA GATCTACCGG GCCTATGCCG GGCGCCTGCG CGGCGAATAC CCCCATTCGG CCGCCTTGTT CGACGCGATG GCGGAGGCCG AGGACGAGCA TCGCCGCCGC CTGATCGCGC GCTACAAGGA GCGCTTCGGC GACTTCATCA TCCCGATCCG GCGCGAGCAC ATTGCCGGCT ATTACAGCCG CAAGCCGGTC TGGCTGATGC GCAATCTCGG GCTCGACCGG GTCCGCGAGG AGGCCGCCGC GATGGAGCGG CAGGCGCGCG ACTTCTACCT CGCCGCCGCC CGGCGCTCGA CCGACGCCGA CACGCGCCGC CTGCTCGGCG ACCTCGCCGC GGCGGAGAGC GCGCACGAGC GGACGGCCGA GGCGCTGGCG GACGAGCATC TCGGCGGCTC CGTCCGCGAC GAGGAGGACG CGGCCGCCCA TCGCCAGTTT ATCCTGACCT GGGTCCAGCC GGGGCTTGCC GGCCTCATGG ACGGGTCGGT CTCGACGCTC GCGCCGATCT TCGCCACGGC GTTTGCCACG CAGAACCCGT GGACGACCTT CCTCGTCGGC CTCTCGGCCT CGATCGGCGC GGGCATCTCG ATGGGCTTCA CCGAGGCCGC GCACGACGAC GGCAAGATCT CCGGGCGCGG CTCACCGCTG AAGCGCGGCC TCGCCTCCGG CGTGATGACC GCGCTCGGCG GCCTCGGCCA CGCGCTGCCC TACCTGATCC CGAACTTCTG GCTGGCGACG AGCATCGCCT TCGCCGTGGT CTTCTTCGAG CTCTGGGCCA TCGTCTGGAT TCAGAACCGC TACATGGAGA CGCCCTTCCT GCGGGCCGCC TTCCAGATCG TGCTCGGCGG CTCCCTCGTG CTCGCGACGG GCATCCTCAT CGGCGGCGCC TGA
|
Protein sequence | MMSRLSMLTL GGRKRFDDLS EQEILALAIG SEEEDGQIYR AYAGRLRGEY PHSAALFDAM AEAEDEHRRR LIARYKERFG DFIIPIRREH IAGYYSRKPV WLMRNLGLDR VREEAAAMER QARDFYLAAA RRSTDADTRR LLGDLAAAES AHERTAEALA DEHLGGSVRD EEDAAAHRQF ILTWVQPGLA GLMDGSVSTL APIFATAFAT QNPWTTFLVG LSASIGAGIS MGFTEAAHDD GKISGRGSPL KRGLASGVMT ALGGLGHALP YLIPNFWLAT SIAFAVVFFE LWAIVWIQNR YMETPFLRAA FQIVLGGSLV LATGILIGGA
|
| |