Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4586 |
Symbol | |
ID | 7117982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4860542 |
End bp | 4861903 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643527285 |
Product | pentapeptide repeat protein |
Protein accession | YP_002423289 |
Protein GI | 218532473 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.20747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0812819 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGACAGG GCGATGCGGA GGGCGATGAT CCGCGCGTCG CAGCGCTGCT CGCGCGTCTC TCGGACGACG AGGTGCCGTT CTCGATCGCC GGGCGCGACG ACCTGGCCGG ACTCACCCTG TCGCGCGCCG CGCTGAACGG TCACATCGAT CCGACGTCGC CGCCGCGCTG GTGGATGGAG GGCGGGGGCG GGCTCGACCT CGCCGGCGCC GACCTCGCCG ACGCCCGGCT GGAGATGACC GACTTTTCCG ACGCCAACCT GCGTCGCGCC TCGCTCGCCG GAGCCCTCGC GCGCTCGGCC GGCTTCGCGA ATGCCTGCCT GGAGGAAGCG GACTTTGCCG GCGCCGACCT CAGCGGCGCG CGCTTTACCG GAATTGCCGG CGGGCAGGCC TCCTTCCGCG AGGCGATGCT GGAGGATGCC GACTTCTCCG ACGCCACCAT GCGCTTTGCC CGGCTCGACA AGGCTTTGCT CGACGGCGCC CGCTTCGAGG GCGCCGACCT CTGGGGCACC GACTTCACCG GGGCGGATGC CGACGATTCC GTGTTCCGAA AGGCCCGGCT CGACGAGGCC AACCTCTCCG ACTGCAACCT GACCGGCGCG GACTTCGAGG GGGCGAGCCT GAAGAAGGCG CGGCTCGTCG GCTCGCGGCT GCGCGGCGCC AACTTCTCCG GGGCCCACCT CGACGGGGCG GACCTGTCGG GGGCCGACTT CTCCCGCACC AGCCTCGTGC GGCTCGACCT CACGACGTGC AAGCTGCACC GCGCGCGCTT TGCCGGCGCG TGGCTGGAAG GCGTGCGGCT CTCCGTCGAG CAGATCGGCG GGATGGTCGG CGAGGAGGCG GCGGGCGAGT ACGAGGCGGC GCAGGCGAGC TATCTCGCGC TCGAGCGCAA CCTTCAGAGC ATCGGCAGCC CCGAAGGCGC GAGCTGGGCC TACAAGCGCG GGCGCCGCAT GGGCCGCCGC CATGCCGGCG TGCGGGCCCG CGAGGCCTTT TTCGCCCGCG ATGTGCGGGG AACGCTGAGC TCCGGTTACC GCTGGATCGC CGACCGCTTC GTCGAGTGGC TGTGCGACTA CGGCGAGAGC CTGTCGCGGA TCGCTCGCGC CTTCCTCGTC GGGATCTTCC TGTTCGCCGG GGCCTACGGC GCGACGGGCG GGCTCTTCCA CGAGGGCGAG AACGCGCCGA CCTACAACCC GCTCGATCTC GTGAGCTACA GCGCGCTCAA CATGATGACC GCCAACCCGC CCGAGATCGG GGTGAAGCCG CTGGGCCGTG TCACCAACCT GCTGGTCGGG TTGCAGGGGG CGGCGGGGAT CGTGCTGATG GGGCTGTTCG GCTTCGTCCT CGGCAACCGC CTGCGCCGCT GA
|
Protein sequence | MRQGDAEGDD PRVAALLARL SDDEVPFSIA GRDDLAGLTL SRAALNGHID PTSPPRWWME GGGGLDLAGA DLADARLEMT DFSDANLRRA SLAGALARSA GFANACLEEA DFAGADLSGA RFTGIAGGQA SFREAMLEDA DFSDATMRFA RLDKALLDGA RFEGADLWGT DFTGADADDS VFRKARLDEA NLSDCNLTGA DFEGASLKKA RLVGSRLRGA NFSGAHLDGA DLSGADFSRT SLVRLDLTTC KLHRARFAGA WLEGVRLSVE QIGGMVGEEA AGEYEAAQAS YLALERNLQS IGSPEGASWA YKRGRRMGRR HAGVRAREAF FARDVRGTLS SGYRWIADRF VEWLCDYGES LSRIARAFLV GIFLFAGAYG ATGGLFHEGE NAPTYNPLDL VSYSALNMMT ANPPEIGVKP LGRVTNLLVG LQGAAGIVLM GLFGFVLGNR LRR
|
| |