Gene Msil_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3897 
Symbol 
ID7092594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4273948 
End bp4275237 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content65% 
IMG OID643467182 
ProductGlycosyltransferase 28 domain protein 
Protein accessionYP_002364140 
Protein GI217979993 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATATA ATTTTCTTTT GGCCTGTTGG GGCGTCGCGG GCAATCTCGG CCCTATGCTG 
ACCGCCGGCC GTCAATTGCG CCGTAGCGGT CACACGGTCC GCCTTTTAGC CGATTCGGCC
CTGCGCGAAG AGATTGAGGC GGCCGGGTTT GGCTTCACGG CGTGGCGACG GGCGCCGAAC
TATTCGGACT TCGAACCCTT GTTGGTTGCG CTTGACCCTA CGGATTTGGG CAGCTTTAGC
GAACATATCC TGTTTGGCCC CGCCGCTGCT TGCGCGGCCG ACACGCGGGA AGAACTCAAC
GCCGCGCCAA CCGACGCCCT TCTTGCTCAC GACATGCTGC TCGGCTCGGC AATCGCCGCG
GAAGCCGCGG GCGTCCCCTG CGCTGTGCTT TCACCACATA TCAGCGTGAG GCCCTTGCCG
GGCGTTCCGC ATGTCGGCAG CGGCTTGACG CCGCCGCGCA GCTTCGAAGA GCGCGCGGAC
GTCGAAGCCG CGAACAGACG CTTCGGGGAC GCTCTGAATG AGCGGCTTTA TCTCCTGAAC
GAAGCGCGCG AAGGGCAGGG CCTCGCTCCG TTGAACCACG TGTTCGATCA ATATGACCGG
CCCGACCGGT TCCTGCTGGC GATAAGTTCA GCATTCGATT TCCCGGCTGA CGACCTCCCC
GATAACGTCC GATACATAGG GCCGTTGCTC GACCCGCCCG GCTGGTCGAA GCCCTGGAGG
GCGCCCTGGC CGGCACAATC AGATCGGCCT CGCGCCCTGG TGTCGTTTAG CACCACCTTC
CAGGACCAGG CTGACGCGCT TCAGCGTGTC GTGAACGCGC TGGGCAGGGT CGAAATCGAC
GCCGTCGTAA CGACAGGTCC CGCATTGGTC GGCAGCGCCT TGCACGCGCC GAAGAATGTG
ACGCTGCTCC ATAGCGCTCC ACACGATGCG GTGATGAAGG AAGTGTCTCT GGTGGTGACG
CATGGCGGGC ACGGGACGGT GAGCCGGGCG CTGCTTCACC GCCTGCCGCT GCTGATCATG
CCGATGGGCC GCGACCAGGA CGACAACGCA TTGCGGGCGG AAGCGCGCGG CGTCGGCCTG
ACTTTGCCGC CGACCGCCTC CGAAGCGGAG ATCGCGCGCG CCCTAAATCG CCTGCTCACC
GAGCCCCATT TCCGAATCGC GGCGCACCGG CTCGGCGCAG CGATCGCCGC CGAACTCGAT
TCAGCCGGGC TCGTCGGGGA GATGGAGGAG ATTGTCGCGT TCCGGCGCGC GGACCATCGC
CCGGCGCGCA AGCGCCTGCT TCGAAACTGA
 
Protein sequence
MPYNFLLACW GVAGNLGPML TAGRQLRRSG HTVRLLADSA LREEIEAAGF GFTAWRRAPN 
YSDFEPLLVA LDPTDLGSFS EHILFGPAAA CAADTREELN AAPTDALLAH DMLLGSAIAA
EAAGVPCAVL SPHISVRPLP GVPHVGSGLT PPRSFEERAD VEAANRRFGD ALNERLYLLN
EAREGQGLAP LNHVFDQYDR PDRFLLAISS AFDFPADDLP DNVRYIGPLL DPPGWSKPWR
APWPAQSDRP RALVSFSTTF QDQADALQRV VNALGRVEID AVVTTGPALV GSALHAPKNV
TLLHSAPHDA VMKEVSLVVT HGGHGTVSRA LLHRLPLLIM PMGRDQDDNA LRAEARGVGL
TLPPTASEAE IARALNRLLT EPHFRIAAHR LGAAIAAELD SAGLVGEMEE IVAFRRADHR
PARKRLLRN