Gene M446_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4034 
Symbol 
ID6132842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4500409 
End bp4501533 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content72% 
IMG OID641644191 
Productglycosyl transferase group 1 
Protein accessionYP_001770831 
Protein GI170742176 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCATG ATCTGTCGCA GCGCTTGGCG GCCCGCGGCC ACGACGTCAC CGTCCTGACC 
AGTTCTCCCG CGTCCGAAAC GCGGATCGAA GCCGATGGCC CGGTCCGGCG CGTCCTGCTT
CGCCGCAGAA CCGGGCCGGC CGCGCTGACG GGCCGCTGGT TCAACAGCCA GCATCTGTTC
GGCTGGGACT TGGCCAGGTG GCTCCGTGCG GAGCCGTTCG ACGCCGTGCA CTGCTTGAAC
TACCACGATG CGGTCGGCGC TCTGATCGCG CGCCGGGCGG GTGCACGGTT CCGCCTCGTC
TTTCAGTGTA CGGGCATTCC GGTGCGGCGC TACTTCCGGC GCATTCCCGC CGACGGCCTG
ATGTTCCGCA TGGTGCTGCG GCAGGCGGAT GCCGTGGCGG TCCTGTCCCG CTTCGCTCAG
GACGCGCTCG CGCGGGATTA CGGGGTCGCC GGGACGTTGC TTGCGTCCCC CACCGAGACC
GCTCCCTTCG AGGCGCTGCC GGACGACGCT CCGCGCGAAC CCTACATCCT GTTCAGCGGG
GATGCCGACG AGCCGCGCAA AGGCGCACTC CTCCTCGCCC AGGCGTTCCC GGCCGTGGCC
GAGCGGCTGC CGGCTCTCCG GCTCGTCTAC ACGGGACGAT CGAGCCCGGC CACCCGCGCG
GCTTTGTCCG CTGCCGTTCC GGGCAACCTC CGCGATCGAG TCGAATTTCT CGGTCTCGGC
CGCGTCGAGG ACCTGCCGCA CCTCTACGCA CGCGCGACGG TCTGCGTGAA CCCGGCCGTC
TGGGAGGCGC TGGGCAATGT CCTGATCGAA GCCCTGGCGG CCGGAACCCC GGTGGTCGGC
GCGCGGCACG CCGGCATCCC GGACATCGTC GCGGACGAGA CGGTGGGGGC TCTGTTCGAT
CCGGGCTCGA CGCGGCTGGC CGCCACGAAC GCGGCCGGAC TGAGCGAGGC CATCCTGAGG
GCTGCGGCCC TGGCCGCGCG GCCCGAGACC CGCGCGCGGT GCCGCGCGCG GGCGCAGGCC
TTCTCCTGGA ACGCCCTGAT CCCCCGCTAC GAGGGCCTGC TCGGCGGCGA CGCCCCGCCG
CGCGAGATCG GCCCTCCCCT GCCCGCCGCC ATCCCGTTGC GATGA
 
Protein sequence
MLHDLSQRLA ARGHDVTVLT SSPASETRIE ADGPVRRVLL RRRTGPAALT GRWFNSQHLF 
GWDLARWLRA EPFDAVHCLN YHDAVGALIA RRAGARFRLV FQCTGIPVRR YFRRIPADGL
MFRMVLRQAD AVAVLSRFAQ DALARDYGVA GTLLASPTET APFEALPDDA PREPYILFSG
DADEPRKGAL LLAQAFPAVA ERLPALRLVY TGRSSPATRA ALSAAVPGNL RDRVEFLGLG
RVEDLPHLYA RATVCVNPAV WEALGNVLIE ALAAGTPVVG ARHAGIPDIV ADETVGALFD
PGSTRLAATN AAGLSEAILR AAALAARPET RARCRARAQA FSWNALIPRY EGLLGGDAPP
REIGPPLPAA IPLR