Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3897 |
Symbol | |
ID | 7092594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 4273948 |
End bp | 4275237 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643467182 |
Product | Glycosyltransferase 28 domain protein |
Protein accession | YP_002364140 |
Protein GI | 217979993 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATATA ATTTTCTTTT GGCCTGTTGG GGCGTCGCGG GCAATCTCGG CCCTATGCTG ACCGCCGGCC GTCAATTGCG CCGTAGCGGT CACACGGTCC GCCTTTTAGC CGATTCGGCC CTGCGCGAAG AGATTGAGGC GGCCGGGTTT GGCTTCACGG CGTGGCGACG GGCGCCGAAC TATTCGGACT TCGAACCCTT GTTGGTTGCG CTTGACCCTA CGGATTTGGG CAGCTTTAGC GAACATATCC TGTTTGGCCC CGCCGCTGCT TGCGCGGCCG ACACGCGGGA AGAACTCAAC GCCGCGCCAA CCGACGCCCT TCTTGCTCAC GACATGCTGC TCGGCTCGGC AATCGCCGCG GAAGCCGCGG GCGTCCCCTG CGCTGTGCTT TCACCACATA TCAGCGTGAG GCCCTTGCCG GGCGTTCCGC ATGTCGGCAG CGGCTTGACG CCGCCGCGCA GCTTCGAAGA GCGCGCGGAC GTCGAAGCCG CGAACAGACG CTTCGGGGAC GCTCTGAATG AGCGGCTTTA TCTCCTGAAC GAAGCGCGCG AAGGGCAGGG CCTCGCTCCG TTGAACCACG TGTTCGATCA ATATGACCGG CCCGACCGGT TCCTGCTGGC GATAAGTTCA GCATTCGATT TCCCGGCTGA CGACCTCCCC GATAACGTCC GATACATAGG GCCGTTGCTC GACCCGCCCG GCTGGTCGAA GCCCTGGAGG GCGCCCTGGC CGGCACAATC AGATCGGCCT CGCGCCCTGG TGTCGTTTAG CACCACCTTC CAGGACCAGG CTGACGCGCT TCAGCGTGTC GTGAACGCGC TGGGCAGGGT CGAAATCGAC GCCGTCGTAA CGACAGGTCC CGCATTGGTC GGCAGCGCCT TGCACGCGCC GAAGAATGTG ACGCTGCTCC ATAGCGCTCC ACACGATGCG GTGATGAAGG AAGTGTCTCT GGTGGTGACG CATGGCGGGC ACGGGACGGT GAGCCGGGCG CTGCTTCACC GCCTGCCGCT GCTGATCATG CCGATGGGCC GCGACCAGGA CGACAACGCA TTGCGGGCGG AAGCGCGCGG CGTCGGCCTG ACTTTGCCGC CGACCGCCTC CGAAGCGGAG ATCGCGCGCG CCCTAAATCG CCTGCTCACC GAGCCCCATT TCCGAATCGC GGCGCACCGG CTCGGCGCAG CGATCGCCGC CGAACTCGAT TCAGCCGGGC TCGTCGGGGA GATGGAGGAG ATTGTCGCGT TCCGGCGCGC GGACCATCGC CCGGCGCGCA AGCGCCTGCT TCGAAACTGA
|
Protein sequence | MPYNFLLACW GVAGNLGPML TAGRQLRRSG HTVRLLADSA LREEIEAAGF GFTAWRRAPN YSDFEPLLVA LDPTDLGSFS EHILFGPAAA CAADTREELN AAPTDALLAH DMLLGSAIAA EAAGVPCAVL SPHISVRPLP GVPHVGSGLT PPRSFEERAD VEAANRRFGD ALNERLYLLN EAREGQGLAP LNHVFDQYDR PDRFLLAISS AFDFPADDLP DNVRYIGPLL DPPGWSKPWR APWPAQSDRP RALVSFSTTF QDQADALQRV VNALGRVEID AVVTTGPALV GSALHAPKNV TLLHSAPHDA VMKEVSLVVT HGGHGTVSRA LLHRLPLLIM PMGRDQDDNA LRAEARGVGL TLPPTASEAE IARALNRLLT EPHFRIAAHR LGAAIAAELD SAGLVGEMEE IVAFRRADHR PARKRLLRN
|
| |