Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1513 |
Symbol | |
ID | 6131850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1682342 |
End bp | 1684012 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641641782 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001768451 |
Protein GI | 170739796 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCAA TGCAGCTCCT CGGTGTCAAT GTCACCAGCG GCGACACCGG CCGTCTGCCC GGAGTGGCAG GCGTGGACTA CGTTTACCCG ACTCGGTGGG ATATCGACTA TATATCCTCA AAAGGAATGA ATGTCATACG ACTGCCTGTT TTCTGGGAAC GCCTTCAACA CGTCCCGTTC GGCGCCCTGG ACGAGGCGGA GATGGCACAT ATCGATGACC TCGTCAGCTA TGCGACATCC AAAGGGATTT CCGTCGTCTT CGACCTTCAC AATTTCGGGT TCGGCTATGG CTACCCCGTG GGCGGGCCCA TCACAACGGA CAGCACCTTG GCTGATTTTT GGGGCAGGAT CGCGAAGCGT TACGTGTCAA ACTCAGGGAT TATATTTGGA CTTATGAATG AGCCGCAAGC TCAGCCAGCA TCAGACTGGA TCCGGTCCGT GAATTCAGCG ATTCAGGCAA TCCGGAGCGC TGGTGCGACG CAGGAGATCC TGGTTCCGGG CGCCTACGGT GACAGCGCAC TGTCCTGGAG TTCTACGGAC AACGCCACCG TAGTTGGGAC AGGAGTGAAT GACCCATTAC ACAATTTTGC TTTCGAAGTC CACATGTATT TCGACACGAA CAGCTGGGGC ACGGAACCTG GCGCGATCTC TGCGACGATT GGGTCCGAGC GCCTCGCGGC CGTCACCGCG TGGGCGGAGG CGAACAACGC GCAGCTGTTC CTTGGCGAGT TCGGTGTTGG AACAGATCAG ACGAGTCTGG CGGCGCTTGA CAATGCGCTT TCCTACATGG AGCAGCACGC ACATGTTTGG CAGGGCGGTA CGTACTGGGT CGCCGGTCCG CAGCTGCCCC ACCCATTTTA CTCCGTCGAG CCGCCGAACC CAGCCGCACA TATGTTGGCC GAGAATGGCA GCTTCACGTT CATGGTAAAT GACTCGTCAG CGCTGGGCCA GCGCAATATA CTCACACTGG ATATCTCTGA AGATGAATAT CAGGGCGATG CAGTGTTCAG CGTGTCAGTC GACGGTGTGC AGATGGGTGG CATTCTAACC GCACACGCGT CTCACTCGAG CTTGCAGAGC GAGACATTCT CGTTTGTTGG CGATTGGGGT ATCGGGCAGC ACGCGGTGAC CGTCAATTTT CTGAACGATC TTTATGGTGG TGCGTCGACC GTAGATCGCA ATCTCTACAT TAATTACGCC AGTTACGATA GCGTTCCCGT GACCGGATAC GATCAGCCGC AAATTGATAT CCTCCAAGAA CACCGCGAGG AAGAACACCG CGAGGAGGTT CCATTCCCGA CCGTCTCAGC AGCGTCTTAC ACCGACATCA GGCCTGGTGA TGACACCTTG CTTCTTGAGA TTTCCGAAGA TGCCTGGCTG GGCAGTGCTC AGTACACGAT CGCCGTTGAT GGCCAGCAGA TCGGTGGCAA AATGACAGCT ACAGCTTCGC ACGGCACCGG GCAGTCTGAT GTACTGCTAA TAAATGGTGA CTGGAGCGCC GGCGTCCATA AAGTTAGCAT TGACTTTCTG AATGACAATT ACGGCGGCAC GCCGGCAGCC GACCGAAATC TGTATCTCAA CGGAGCGACC TACAACGGGA CTGGAATTCC AGGTAGCCGA CTCACGCTGT TTTCGAGCGG GTCGCAAGAA TTTACTTTCA TGGAGTTGTG A
|
Protein sequence | MAAMQLLGVN VTSGDTGRLP GVAGVDYVYP TRWDIDYISS KGMNVIRLPV FWERLQHVPF GALDEAEMAH IDDLVSYATS KGISVVFDLH NFGFGYGYPV GGPITTDSTL ADFWGRIAKR YVSNSGIIFG LMNEPQAQPA SDWIRSVNSA IQAIRSAGAT QEILVPGAYG DSALSWSSTD NATVVGTGVN DPLHNFAFEV HMYFDTNSWG TEPGAISATI GSERLAAVTA WAEANNAQLF LGEFGVGTDQ TSLAALDNAL SYMEQHAHVW QGGTYWVAGP QLPHPFYSVE PPNPAAHMLA ENGSFTFMVN DSSALGQRNI LTLDISEDEY QGDAVFSVSV DGVQMGGILT AHASHSSLQS ETFSFVGDWG IGQHAVTVNF LNDLYGGAST VDRNLYINYA SYDSVPVTGY DQPQIDILQE HREEEHREEV PFPTVSAASY TDIRPGDDTL LLEISEDAWL GSAQYTIAVD GQQIGGKMTA TASHGTGQSD VLLINGDWSA GVHKVSIDFL NDNYGGTPAA DRNLYLNGAT YNGTGIPGSR LTLFSSGSQE FTFMEL
|
| |