Gene M446_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1081 
Symbol 
ID6131536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1204944 
End bp1205918 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content75% 
IMG OID641641372 
Productglycosyl transferase family protein 
Protein accessionYP_001768044 
Protein GI170739389 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0364187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.921384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGAG ACCGGACCTG TCAGGTGAGC GTGATCATGC CGGCGTTCAA CGCGGCGGCG 
ACGATCGAGC GCGCGCTCGC CAGCGTCCTC GCCCAGAGCC ATGCCGCCCT CGAGGTGATC
GTGGTCGATG ACGGCTCGAC CGACGCGACG CGGACCCTCG TGGGCTCGGC CGCCGCGCGG
GATCGCCGCG TGCGGCTGCT CGCGAGCGAT CGGAACGGCG GGCCTGCGGC GGCCCGCAAC
ACGGCCCTGT CGGCCGCGAC CGGCGAGTGG CTCGCCCTCC TGGACGCCGA CGACGCGTGG
CGCCCGGAGC GGCTGGAGCG GCTGCTCGCC GTCGCCGGCG ACCGGGCCGA GGTGGTGTTC
GACAACCTCC TCGGCCTCGA TCCCCACAGC GGCGCCGAGG TCGGCCCGCT CTTCCCGCGG
CTGCCCGACG CGATCGGCGT CTCGGAGATG GCCGCGGCGG CGGTTCCGGG CAGCCGCTTC
GATTACGGCT ACCTCAAGCC GATCTTCCGG CGCGCCCTGG TCGCGCGGGC CGGGCTGCGC
TTCGACGAGG GGCTCAGGAC GAGCGAGGAT CTCCTGTTCT TCCTCGCGCT GCTCGTCGAG
GGCGGGTCCG CGCGGACGAC CTCGGAGGCG TTCTACGTCT ACACGCTGCC GGTGAGTTCG
GGCGGACGCA TCTCGCCCTA CTCGCATAGC CAGCCCCGCG ACCTCGCCGT CGCCGAGGCG
CTGGCGGCGC TGCGGCGGGC CTGCGCGGCT CGGCTGTCGC CCGCGGAGAG CGCGCAGCTC
GAGAGCAGGA TCGCGCATCT GCGGCGCGTC GCCCCCGTGA GCGAGTTCCA CTTCGCGCGG
CGCACCGGCG ACCTGCGCAG GATCATCGGC CTGCTCGCCC GCTCGTCGGC CGTGCGGCGG
GAACTCGCGC GCGGGATCCT GCGCCGCGTG CGGGGGCGCC CCGCGGCGGC CGAGCGGGCC
GGGCCGGATC GGTGA
 
Protein sequence
MMGDRTCQVS VIMPAFNAAA TIERALASVL AQSHAALEVI VVDDGSTDAT RTLVGSAAAR 
DRRVRLLASD RNGGPAAARN TALSAATGEW LALLDADDAW RPERLERLLA VAGDRAEVVF
DNLLGLDPHS GAEVGPLFPR LPDAIGVSEM AAAAVPGSRF DYGYLKPIFR RALVARAGLR
FDEGLRTSED LLFFLALLVE GGSARTTSEA FYVYTLPVSS GGRISPYSHS QPRDLAVAEA
LAALRRACAA RLSPAESAQL ESRIAHLRRV APVSEFHFAR RTGDLRRIIG LLARSSAVRR
ELARGILRRV RGRPAAAERA GPDR