Gene Mext_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3564 
Symbol 
ID5832935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3943143 
End bp3944312 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content71% 
IMG OID641369358 
Productglycosyl transferase family protein 
Protein accessionYP_001641015 
Protein GI163852972 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCG TCCGCGCCCC TGAGCCCGCC TCCAGCCTCA GGTCAGTCGC CCCGGGACCG 
CGCTCGGCGC TGATCGACCA TCCGTCGCCA TCCCCACCGG TGTGCCCGCG CGTAAGCATC
ATCGTGCCCA TCCGCGACGA GGCCGCCGGC CTTGAAGCCT TGGTCGCGGA CCTTCTGTCC
CAGGACTATT CTGGGCTAGC CGAGATCCTG TTCGTCGATG GGGGCTCCCT CGACGGCACA
CGCGAGAGGC TCGGCGCGCT GGCGGTCCGG GATGCCCGCG TCCGGGTCAT CCTCAACGAG
CGGCGGGGCA CGGCGGCCGG GATCAACCTC GCCATGGGCG CAGCCACGGG AGAGGTGGTG
ATGCGTGTCG ACGCGCACGC GCGCTACCGC GCCGACGTCG TCCGGGTCTG CGTGGAGGCG
CTCCTGCGTA CCGGCGCGGG GGGCGTCGGC TCCATCGCCC GCCCGCGCGC CTCCGCGCAG
ACGCTGGTTG CGCGGGCCAT CGTGGCAGCG CATCTCAGCC CGCTCGGGAT CGGGGTTGCC
AAGTTTCGCC GAGCGGGGGC GGAGGGTTGG GCCGCAACCG CGTGGAACGG CTGTTACTGG
CGCCACGTCG TCGACCAAGT CGGGCCCATG CGCGAAGATC TGCCCCGCAA CGAGGACAAC
GACTTTAACG CGCGGGTACG CGCCCTGGGC TACGGTGTGT GGGTGACCTC GGCGGCGCAC
GCCTATTACC GCCCCCGCGA GACGCTTGGT GACCTGTGGC GCCAGTACCG GGGCAACGGG
CAAGGGATCG CGATGACACT GTTCGAGAAT CCGGCGGCCT TAGGCCCACA CCATTTCGCG
CCGCTGATTT TGGTGAGCAC CCTCGCGACG CTCGCCGCCC TCGCCCCCAC CTGGTCGGCG
GCGGCCTGGG CCCTGGCTTC GCTTCTCGCT GTCTACGGCG CCGCGCTTCT CGTAGCGACG
TGGCTCGCCG CCTGGCGTTG TGACGGGGTC GAGGAGCGCG AGAGCGCGTC GTGGCTCACT
CTCCCCGCGC TGCCGGCGGT GCTCGCCACC CTGCACGTGG CCTACGGGTT CGGCACGCTG
GAGGGGCTCC TCGGCCTGGG ACGTGCTCGA GCGCGGCGAC TGTTGCCCGG GCGAGGTGCC
GTGCGCACCC GCGTGGAGGA GAGCCGATGA
 
Protein sequence
MKSVRAPEPA SSLRSVAPGP RSALIDHPSP SPPVCPRVSI IVPIRDEAAG LEALVADLLS 
QDYSGLAEIL FVDGGSLDGT RERLGALAVR DARVRVILNE RRGTAAGINL AMGAATGEVV
MRVDAHARYR ADVVRVCVEA LLRTGAGGVG SIARPRASAQ TLVARAIVAA HLSPLGIGVA
KFRRAGAEGW AATAWNGCYW RHVVDQVGPM REDLPRNEDN DFNARVRALG YGVWVTSAAH
AYYRPRETLG DLWRQYRGNG QGIAMTLFEN PAALGPHHFA PLILVSTLAT LAALAPTWSA
AAWALASLLA VYGAALLVAT WLAAWRCDGV EERESASWLT LPALPAVLAT LHVAYGFGTL
EGLLGLGRAR ARRLLPGRGA VRTRVEESR