Gene M446_1513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1513 
Symbol 
ID6131850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1682342 
End bp1684012 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content56% 
IMG OID641641782 
Productglycoside hydrolase family protein 
Protein accessionYP_001768451 
Protein GI170739796 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCAA TGCAGCTCCT CGGTGTCAAT GTCACCAGCG GCGACACCGG CCGTCTGCCC 
GGAGTGGCAG GCGTGGACTA CGTTTACCCG ACTCGGTGGG ATATCGACTA TATATCCTCA
AAAGGAATGA ATGTCATACG ACTGCCTGTT TTCTGGGAAC GCCTTCAACA CGTCCCGTTC
GGCGCCCTGG ACGAGGCGGA GATGGCACAT ATCGATGACC TCGTCAGCTA TGCGACATCC
AAAGGGATTT CCGTCGTCTT CGACCTTCAC AATTTCGGGT TCGGCTATGG CTACCCCGTG
GGCGGGCCCA TCACAACGGA CAGCACCTTG GCTGATTTTT GGGGCAGGAT CGCGAAGCGT
TACGTGTCAA ACTCAGGGAT TATATTTGGA CTTATGAATG AGCCGCAAGC TCAGCCAGCA
TCAGACTGGA TCCGGTCCGT GAATTCAGCG ATTCAGGCAA TCCGGAGCGC TGGTGCGACG
CAGGAGATCC TGGTTCCGGG CGCCTACGGT GACAGCGCAC TGTCCTGGAG TTCTACGGAC
AACGCCACCG TAGTTGGGAC AGGAGTGAAT GACCCATTAC ACAATTTTGC TTTCGAAGTC
CACATGTATT TCGACACGAA CAGCTGGGGC ACGGAACCTG GCGCGATCTC TGCGACGATT
GGGTCCGAGC GCCTCGCGGC CGTCACCGCG TGGGCGGAGG CGAACAACGC GCAGCTGTTC
CTTGGCGAGT TCGGTGTTGG AACAGATCAG ACGAGTCTGG CGGCGCTTGA CAATGCGCTT
TCCTACATGG AGCAGCACGC ACATGTTTGG CAGGGCGGTA CGTACTGGGT CGCCGGTCCG
CAGCTGCCCC ACCCATTTTA CTCCGTCGAG CCGCCGAACC CAGCCGCACA TATGTTGGCC
GAGAATGGCA GCTTCACGTT CATGGTAAAT GACTCGTCAG CGCTGGGCCA GCGCAATATA
CTCACACTGG ATATCTCTGA AGATGAATAT CAGGGCGATG CAGTGTTCAG CGTGTCAGTC
GACGGTGTGC AGATGGGTGG CATTCTAACC GCACACGCGT CTCACTCGAG CTTGCAGAGC
GAGACATTCT CGTTTGTTGG CGATTGGGGT ATCGGGCAGC ACGCGGTGAC CGTCAATTTT
CTGAACGATC TTTATGGTGG TGCGTCGACC GTAGATCGCA ATCTCTACAT TAATTACGCC
AGTTACGATA GCGTTCCCGT GACCGGATAC GATCAGCCGC AAATTGATAT CCTCCAAGAA
CACCGCGAGG AAGAACACCG CGAGGAGGTT CCATTCCCGA CCGTCTCAGC AGCGTCTTAC
ACCGACATCA GGCCTGGTGA TGACACCTTG CTTCTTGAGA TTTCCGAAGA TGCCTGGCTG
GGCAGTGCTC AGTACACGAT CGCCGTTGAT GGCCAGCAGA TCGGTGGCAA AATGACAGCT
ACAGCTTCGC ACGGCACCGG GCAGTCTGAT GTACTGCTAA TAAATGGTGA CTGGAGCGCC
GGCGTCCATA AAGTTAGCAT TGACTTTCTG AATGACAATT ACGGCGGCAC GCCGGCAGCC
GACCGAAATC TGTATCTCAA CGGAGCGACC TACAACGGGA CTGGAATTCC AGGTAGCCGA
CTCACGCTGT TTTCGAGCGG GTCGCAAGAA TTTACTTTCA TGGAGTTGTG A
 
Protein sequence
MAAMQLLGVN VTSGDTGRLP GVAGVDYVYP TRWDIDYISS KGMNVIRLPV FWERLQHVPF 
GALDEAEMAH IDDLVSYATS KGISVVFDLH NFGFGYGYPV GGPITTDSTL ADFWGRIAKR
YVSNSGIIFG LMNEPQAQPA SDWIRSVNSA IQAIRSAGAT QEILVPGAYG DSALSWSSTD
NATVVGTGVN DPLHNFAFEV HMYFDTNSWG TEPGAISATI GSERLAAVTA WAEANNAQLF
LGEFGVGTDQ TSLAALDNAL SYMEQHAHVW QGGTYWVAGP QLPHPFYSVE PPNPAAHMLA
ENGSFTFMVN DSSALGQRNI LTLDISEDEY QGDAVFSVSV DGVQMGGILT AHASHSSLQS
ETFSFVGDWG IGQHAVTVNF LNDLYGGAST VDRNLYINYA SYDSVPVTGY DQPQIDILQE
HREEEHREEV PFPTVSAASY TDIRPGDDTL LLEISEDAWL GSAQYTIAVD GQQIGGKMTA
TASHGTGQSD VLLINGDWSA GVHKVSIDFL NDNYGGTPAA DRNLYLNGAT YNGTGIPGSR
LTLFSSGSQE FTFMEL