Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0691 |
Symbol | |
ID | 6133352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 800986 |
End bp | 803361 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641641010 |
Product | hypothetical protein |
Protein accession | YP_001767685 |
Protein GI | 170739030 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.140408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.176314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCACCG CCCCGTCCTC CCTCGCCCCG CGGGTTCCGA GCGTCCCGGT CGCCGAGCTG CGGTTCGCCG CGGCCCTCGC GGCGCTGGCC CTCTCGGCCT TCCTGCTGTT CTCGGCACCG ATCCTCCTCG CCCGCGCGGC GCTGCCCGGG CTCGGCGGCT CCCCCGCCGT CTGGTCCGTG GCCCTGGTCC TGCTGCAGGC GGCCCTGCTC GGCGGCCAGC TCTACGGGCT CGCCGCCACC CGCTGGCTCG CGGGGCGCCG CGCCCTCCTG CTGCACCTCG CCCTGATGGG GGCGGCGCTC CTCGTCCTGC CCCCGCATCG GCCGGCGGCC CTCGCGCGGC TGCCCGCCGA GGGCGAGGCG CCCTGGCTTC TCGGCCTCCT TGCCGTCGCG GCCGGACCGC CTCTCCTCGC CCTGGCGGCG AACGGCCTGC TCCTGCAGGC GTGGCTCTCC GGCGCGGATC ACCCGCGCGC CCGCGACCCC GCCCTCCTCG CCGCGGCCTC GCATGGCGGC GCCTTCGCGG CGCTCCTCGC CTATCCGGTG CTGATCGCGC CGCTCGTCTC CCTGCGCGCC CAGGAATGGG CCTGGACGGC CGGGTTCTGC CTCCTCGCCC CGCTGATCGC GGCCGCCGCC TGGAGCGCCG CCGGGACGCC GCCCGCGCGG GCGCCGGCCC CTCCCCCGGC CCCCGCCAAG CGCCCGGTCC CGCTGACCGG CGCGGCGCTG GCGGGCTTCG TCGCCCTGGC CTTCGTGCCG GCCGCCTTCC TGGTCGCCGT CACGACCCGC CTCTGCGCGG AGCTCGCGAC CGCGCCGCTG GTCTTCCTGC CGCCGCTCGC CCTGCACCTC CTGACCTTCG TGACCGCCTT CCGGGACAGC GCCGTGCGGG TCGCGCGCTG GCTCGCGCCG CTGCAGGTCG GGGGCACCGC GCTGGCGATC CTCGGCCTCG TGGTCGATCT CGGGCTCGCG GCGAACCTCG TCCTCGGCCT CGGCCTCGTC CTGGTGAATG CCAGCCTCTG CCACACGACG CTCTACCGCG CGCGGCCCGA GACCGCCCAC CTGACGATCT ACGCCGTCTG CATCGCGCTC GGCGGCCTCC TCGGCAGCGC CGCCGGCGCG CTGCTCGCGC CCGCCCTGTT CCCGGCCGGC CAGGAATACC CGCTCCTCCT CGGCGCGGCC CTGCTCGGCC GGCCCGGCCT CCTCGACGGC GCCCGGGCCG CCCGCGCCAG GGGCTTCGGG GAGGCCGCCC TCGTCTGCGC CCTGCTGGCG CTCCTCATCC TGGCGGGCTG CGCGGTGCTC GGCCTCGCCG CGCCGCGCCT GCTCATCGGC CTCGGCCTGT GCGGGCTCCT CGCCCTGTCC TGGGCGAGCC CGCGCCTCGC GGCCCTCTCC GGGATCGGGG CGCTCCTCGC GCTGGCGGGC CCGGCGCCCG CCCCGCCCGG GGCCGCGGGG GAGAGCCTGC GCTCCTTCTC CGGCATCCAC CGCATCGCCG CGAGCCCGGA CGGCGGCACC CGGCTGCTCT GGCACGAATC CCGCCCGCAG GGCGCGATGC GGCTGCGCCG GGAGGACGGC ACGCCCGCGA CCGGCGAGCC GGCGCCGCTC CTCGCCTACG CGCCCGACGG GCCGGTCGGG GCCGCGATCC GGGGGATCCG CGCCGCCCGG GGCGGCGATC TCGGCGACGT GGTGGTGGTG GGCCTCGGCG CCGGCAGCCT CGCCTGCGCG GCGCGTCCCC GGGAGGCCTG GACCTTCCTG GAGAGCGATC CGGTGCTCCT GCGCATCGCC CGCGACCCGG CCCGGTTCCG GTTCCTGGCC GCCTGCGCGC CGACCATGCC GGTCGTCGTC GGGGATCCGC GCCTGAGCCT CGCCGACCGG CCCGCGGATC TCGGCCTGAT CCTGATCGAC ACCGCCGCCT CCGGCCCGTT CCCGATCCAT CTCCTGACCC GCGAGGCCCT GCGCCTCGCC GTCTCGAAAC TCGACCGGAC CGGCGTGCTG CTGATCCACC TCGCCCACCC GCACCTCGAT CCCGGCGCGG TCCTGGCCGG GATCGGGGCC GAGTTCGGGC TGAGCGCCTG GGCGCTCGGC GCGCCCGGCG CGGCCTCGAC GCTGCTCGCG CTGGTGCGCG ATCCGGCCCA TCTCGGCCCA TGGGGCGGTC GCGCCCGGCC CGGCGGCGCC TTGCTCCGCG ACGCCCCGCT CCGCGACACC TTGCAAGGCG ACGCCCTGCT CCGCGACGCC TCGCCCCGTG GCACCTGGGC CGGCAATCCC TGGCCCGGCC CCCCGGAGGC CGGCGGCCCC GGGTGGCGGC GCGTGCCGGC CGATCCCGGC CGCCGGCCCT GGACGGACGA CCACGCCGAC CTCCTCGGGG CGCTGCGCGC CCGCCCGGGC GGGTGA
|
Protein sequence | MPTAPSSLAP RVPSVPVAEL RFAAALAALA LSAFLLFSAP ILLARAALPG LGGSPAVWSV ALVLLQAALL GGQLYGLAAT RWLAGRRALL LHLALMGAAL LVLPPHRPAA LARLPAEGEA PWLLGLLAVA AGPPLLALAA NGLLLQAWLS GADHPRARDP ALLAAASHGG AFAALLAYPV LIAPLVSLRA QEWAWTAGFC LLAPLIAAAA WSAAGTPPAR APAPPPAPAK RPVPLTGAAL AGFVALAFVP AAFLVAVTTR LCAELATAPL VFLPPLALHL LTFVTAFRDS AVRVARWLAP LQVGGTALAI LGLVVDLGLA ANLVLGLGLV LVNASLCHTT LYRARPETAH LTIYAVCIAL GGLLGSAAGA LLAPALFPAG QEYPLLLGAA LLGRPGLLDG ARAARARGFG EAALVCALLA LLILAGCAVL GLAAPRLLIG LGLCGLLALS WASPRLAALS GIGALLALAG PAPAPPGAAG ESLRSFSGIH RIAASPDGGT RLLWHESRPQ GAMRLRREDG TPATGEPAPL LAYAPDGPVG AAIRGIRAAR GGDLGDVVVV GLGAGSLACA ARPREAWTFL ESDPVLLRIA RDPARFRFLA ACAPTMPVVV GDPRLSLADR PADLGLILID TAASGPFPIH LLTREALRLA VSKLDRTGVL LIHLAHPHLD PGAVLAGIGA EFGLSAWALG APGAASTLLA LVRDPAHLGP WGGRARPGGA LLRDAPLRDT LQGDALLRDA SPRGTWAGNP WPGPPEAGGP GWRRVPADPG RRPWTDDHAD LLGALRARPG G
|
| |