Gene Mext_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4212 
Symbol 
ID5831925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4686889 
End bp4688049 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content76% 
IMG OID641370003 
Productglycosyl transferase family protein 
Protein accessionYP_001641652 
Protein GI163853609 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.787376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.316316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCCCG TTTCCCCCAC GCCGTCGCGC CTCGACACCC TCATCGCCAT CCCCGTGCGC 
AACGAGGCCG AGCGGATCGC CCGCTGCCTG ACGGCGATCG ACCGGCAGAC CGGCCTCGCG
CCGGGGCGGC TCGGGCTCGT GCTGTTCCTC AACAACTGCA CCGACGACAC GGCGGAGATC
GTCGCCCGTC TCGTGCCGGC GCTCTCGATT CCCGTCCGGG TGATCGAGCG CGTTCATGCC
GGGGCGCATG CGGGCTGGGC GCGCCGCGCG GCGATGGACG CGGCGGTCGC GTGGCTCGAA
GCGGAGGGGA CGACCTCCGC GACGGCGACG CTCCTGACCA CCGATGCCGA CAGCATCGTG
CCGCCGGATT GGGTCGCGGC CAACCTCGCC GCCCTGGAGG CGGGCGCCGA CGCGGTCGCC
GGCCGGGTCG AGTTGATCCC GGAGGAGGCG GCCCTGCTGC CGCCCTCGCT GCCCGCCCGC
GGCCGGCTGG AGGACACCTA CGACGCGCTC ATCACCGAGA TGGAGGCGCG CATCGATCCC
GATCCGCACG ATCCCTGGCC CTGCCACCGC ACCACGATCG GCGCCTCGCT CGCCGTGCGG
CTTCCCGCCT ACCGCGACGT CGGCGGCATG CCGGAGATTC CGCTCGGCGA GGACGGCGCC
TTCGTCGGCG CGCTGCTCCA GCGGGGCTTT CGCGTGCGCC ATGACCGGGC GGTGCTGGTG
CTGATCTCGG CCCGGCTCAC CGGCCGCGCG GCCGGCGGCG TTGCCGACAC GATCCGCTCC
CGCTGTGAGG AGCCCGACGC CCTGTGCGAC GCCCGCATGG AGGCGGTCCC CCGCGCGCTC
CACCGCTACG TCTGGCGGGC GCGGCTGCGT CGCCTCTACG ACGAGGGCCG GCTCACCCGC
GATCTCGCCT GGGCGCGCCG GCTCGGCATC ACCGAGGCGG AAGCCCGCCG CATCGCCGCC
CTGCCACGGG TCGGCGAGAT CGTCGCGGCG GTCGATCGCG CCAGCCCGCG CCTCGCCTAC
CGCCCGCTGA TGCCGCGACA GCTCCCCGGC CAGATCCGGC TCGCACGCCT CGTGCTGCCG
CTGCTACGCG CGGGTCTCCG CCTGCCCCGG GCAACGCCGT CCGCACGCCC GGTCGCCCCA
ACGGCCACCG CCGACGCGTA A
 
Protein sequence
MPPVSPTPSR LDTLIAIPVR NEAERIARCL TAIDRQTGLA PGRLGLVLFL NNCTDDTAEI 
VARLVPALSI PVRVIERVHA GAHAGWARRA AMDAAVAWLE AEGTTSATAT LLTTDADSIV
PPDWVAANLA ALEAGADAVA GRVELIPEEA ALLPPSLPAR GRLEDTYDAL ITEMEARIDP
DPHDPWPCHR TTIGASLAVR LPAYRDVGGM PEIPLGEDGA FVGALLQRGF RVRHDRAVLV
LISARLTGRA AGGVADTIRS RCEEPDALCD ARMEAVPRAL HRYVWRARLR RLYDEGRLTR
DLAWARRLGI TEAEARRIAA LPRVGEIVAA VDRASPRLAY RPLMPRQLPG QIRLARLVLP
LLRAGLRLPR ATPSARPVAP TATADA