Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_5623 |
Symbol | |
ID | 7305716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | - |
Start bp | 5751417 |
End bp | 5752628 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643603252 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002500767 |
Protein GI | 220925465 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGTGC TCGAACTCGC CGCCGCTCTG ATCCTCATCC TTCTCAACGG CGTGTTCTCG CTGTCCGAGC TCGCCGTGGT CTCGGCCCGA AAGGCGCGGC TGCGGGTCAT GGCCGAGCAG CGCCGGGCCG GCGCGCGCGC AGCCCTCGCC CTCGCGGAGG AGCCGGGCCG GTTCCTCTCC ACCGTCCAGA TCGGCATCAC GCTGATCGGC GTCTTGGCTG GCGCCTTCTC GGGCGCCGCT CTCGGCCAGC GGGCGGCCGA GCTGCTGCAG GATTTCGGCC TCGCGCAGGG CCTCGCCCAG ACCATCGGCT ACGGGCTGGT GATCGGCGCC ATCACCTACC TGTCGGTGGT GGTCGGCGAA CTCGTGCCGA AGACCCTCGC GCTGCGTGCC CCCGAGCGGA TCGCCTGCAT GGTCGCGCGC CCGACGAGCG CGGTCTCACG CGCGGCCGGC CCGGTGGTCT GGTTCCTCGA TGCCTCCACC CGCCGGATCT TCCGGCTGTT CGGCATCGAC GCGCGGGCGG ACGAGGCGGT CACCGCGGAC GAGATCCGCG CCGTCGTGGC GGAAGCCGAG ACCGCCGGCG CCATCGAGAC CGACGAGCGG CACATGATCG GCGGCGTGCT GCGCCTCGGC GACCGGACGG TACGGGGCGT GATGACGCCC CGCACGGACG TGACGTGGCT CGACCTCGGC GACACGGAGG AGGCGATCCG CGCGGCGCTC CTGGCGACGC CCCATGCCCG CCTGCCGGTG GGCGAGGGCG GCCCGGACGA GATCATCGGC GTGGTCCAGC TGCGCGACCT GCTGCCGGAC CTGCTGCGGG GGCGCCCGCT CGACATCCGG GCGCATGTCC GGCCGGCCCC CGTGGTGCCG GACCGCCTGG GCGCGCTCGA CGCCCTCGCC GTGCTGCGGA AGGCCGAGGT GCCCATCGGG CTGGTCCATG ACGAGTACGG GCATTTCGAC GGCGTGATCA CCCCGGCCGA CATTCTGGAT GCCATCGCGG GGGCCTTCCG GGCCGATCTC TCCGATTCGG ATGAGGCGGT GCGGCGCGAG GACGGCTCCT GGCTCCTGTC CGGCTGGATG CCCGTCGACG AGATGGCCGA CCAGCTGCGG GTGCCGCTGC CGGACCGGCG CGACTACGAG ACGGTGGCCG GCCTAGAGCC GCTCCCGATC AGGTTGAAGC GTAAGCATCA TCCTCGTATC CAGCAGCTGT GA
|
Protein sequence | MPVLELAAAL ILILLNGVFS LSELAVVSAR KARLRVMAEQ RRAGARAALA LAEEPGRFLS TVQIGITLIG VLAGAFSGAA LGQRAAELLQ DFGLAQGLAQ TIGYGLVIGA ITYLSVVVGE LVPKTLALRA PERIACMVAR PTSAVSRAAG PVVWFLDAST RRIFRLFGID ARADEAVTAD EIRAVVAEAE TAGAIETDER HMIGGVLRLG DRTVRGVMTP RTDVTWLDLG DTEEAIRAAL LATPHARLPV GEGGPDEIIG VVQLRDLLPD LLRGRPLDIR AHVRPAPVVP DRLGALDALA VLRKAEVPIG LVHDEYGHFD GVITPADILD AIAGAFRADL SDSDEAVRRE DGSWLLSGWM PVDEMADQLR VPLPDRRDYE TVAGLEPLPI RLKRKHHPRI QQL
|
| |