Gene Mext_0845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0845 
Symbol 
ID5833335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp920550 
End bp922727 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content72% 
IMG OID641366627 
Productglycosyl transferase family protein 
Protein accessionYP_001638321 
Protein GI163850278 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00935412 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.295818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCC GCGAGGCGCT CGGCCTCTAT CCCCTGCGCG ACCTGCGCCG TTGTGCGGAC 
GACGCCTGGA GCGCGACCGG CCCTGATCCG ATCTTCGCGC TCGGCCCAGC GGACGCGGTG
TCGGCCCTGT CGGGCCAGCG CATCCGCATC CGCTGCCGGC CGGATTCGGT GCGGCCGGTG
CTCGCGGTCG AGGCGCGGGG CGAGGCCGAG CCGCGCCGCT ACCGGCTCAC CGTGCGGGCG
GACGGCACCG ACGACCTGAT CCGCCTGCCG CGCGGCACAC GCGCGCTGCG CCTGGAAGCC
GCCGAGGCAG GCCCGCGCTT CCGCCTCGAT GGGGTGCGGA TCGCGCCGGC GGGCCGGATC
GAGGCGGCCA TGCTGTCCGC CCAAAAGATC ATCGAACGGT TGCCACCGCA GGAGCGTCGG
CCGCTGCGCC TGCTGCGACG CGCCGTCGGC CTCCTGCGCA GCGTGCCGCC CGGCGAAATC
TGGCGCCGCC TCACCCGCGC CGCCCAGACC CACTGTCCGC CCGCCAGCTA CGCCGCCTGG
ATTCAAGCGG TGGAAGCCAA GGCCCTGCCC TCCATCGAGA CGATGCGGGC GCAGCTCGCG
GCCCTGCCGG AGCGCCCATT GATCTCTCTG CTCATGGCGG GCGGCGACGA TCCGGTCCAG
CTCGGGGAGA CGGTCGCGAG TCTGCGGGCG CAGGTCTACC CGGATTGGGA ATTGTGCCTC
GCGGCGGCGC CATCGGCCGC GCTTGACGCG CTGGCGGCGG AGGACGACCG CATCCGTCTG
CTGCTGCCCG GACACGGCAA AGCGACCTCC CTCGATGCGG CGCTGGCGCA GGCGCGCGGT
TCCTATCTCG CCACGATGGA GCCCGGCGCC CGGCTTGCGC CGCAGGCGCT TCTCGCCCTC
GTCCGGCGGC TCGCCCGCGA GCCCGATCTC GACCTGATCT ACACGGATGA GGACCGGATC
GGGCCGGCGG GCGAGCGCTG CGATCCCTAT TTCAAGCCGG ACTGGTCGCC CGAGACACTC
GAGAGCAGCT TCTACATCGG CGGGCTTGCC CTCTACCGCA CCGCCGGGGT TCGGGCGGCC
GGCGGCTTTC CGGGCGAGAG CGAGGGCGCG GCCGACTACG ACCTCGCGCT CCGCGTGACC
GAGCGGAGCG ACCGCGTCGG CCACGTCGCG CAGGTGCTCT GCCATCGCCG CGCCGCCGCG
CCCGATCCGG AGGCGGCCGC CCGCGCGCTG GCTGGACGCG CCCGGCGCAC CGGCGGGCTC
GATGCCGTCC GCGCGCTCGG CCCCGTGCAT TTCGCCCTGC GGCGCGCGGT CGCGACGCGC
CCACTGGTCT CCCTGGTGAT CCCGACCGCC GGGCGCGACA GCCTCATCGG CGGGCGCACG
ATCGATCTGC TCGCCGCCTG CCTCGCTAGC ATCCGCGAGA CCGGCACTTA CGACAACATC
GAGATCGTGG CGGTGGATAA CGGCGACCTG CGCCCAAAGA CGCGGGCCGC CGTCGAGCGG
TTCGGCGCGC GAACCGTCAC CTGGGACAAG CCCGTCTTCA ACGTCGCCTC CAAAATGAAT
CTTGGCGCAA GGGCGGCGGG CGGCGAGGTT CTGGTCTTTC TCAACGACGA CATCAGCATC
CTCACGCCGG ATTGGATCGA GGCGATGCTG GCGCAGCTCG CCATCCCCGG CGTCGGTGCG
GTCGGCCCCA AGCTCCTGTT CGAGGATGGC AGCCTTCAGC ATGCCGGCGT GGTCTTCGGC
GAGGGCCTGC CGGACCACGT CCGCCGTGGC TTCCCCGGTG ATGACGCCGG GTATCACGGC
TCGAGTCTGG CCAACCGCAA CACTCTCGCC GTGACCGGCG CCTGCGTGAT GGTGCGGCGG
GCCGATTTCG AAGCCATTCG GGGCTTCGAT GAGGGATACG CCATCAACTA CAACGACATC
GACCTCTGCC TGCGGCTCGG CGAGCGAGGA CTGCGCACGG TCTATTGTGC CGAGGCGAGC
CTGCACCATT ACGAGAGCCG CAACCGCATC CCCACCGTCG ATCCCGCCGA GCAGGCGCGC
TTCCGCAAGC GCTGGGGCGC ACGGCTCGCC CGCGACCCGT ACTACCCGGA CCCGTTCGGC
ATCCGCCCGC CCGCCTTCAC CCTCGACGCC GAGCGGTTTC CGACAGCACG GCAGCGCATG
GTAGAGGCGT GGCGATGA
 
Protein sequence
MKRREALGLY PLRDLRRCAD DAWSATGPDP IFALGPADAV SALSGQRIRI RCRPDSVRPV 
LAVEARGEAE PRRYRLTVRA DGTDDLIRLP RGTRALRLEA AEAGPRFRLD GVRIAPAGRI
EAAMLSAQKI IERLPPQERR PLRLLRRAVG LLRSVPPGEI WRRLTRAAQT HCPPASYAAW
IQAVEAKALP SIETMRAQLA ALPERPLISL LMAGGDDPVQ LGETVASLRA QVYPDWELCL
AAAPSAALDA LAAEDDRIRL LLPGHGKATS LDAALAQARG SYLATMEPGA RLAPQALLAL
VRRLAREPDL DLIYTDEDRI GPAGERCDPY FKPDWSPETL ESSFYIGGLA LYRTAGVRAA
GGFPGESEGA ADYDLALRVT ERSDRVGHVA QVLCHRRAAA PDPEAAARAL AGRARRTGGL
DAVRALGPVH FALRRAVATR PLVSLVIPTA GRDSLIGGRT IDLLAACLAS IRETGTYDNI
EIVAVDNGDL RPKTRAAVER FGARTVTWDK PVFNVASKMN LGARAAGGEV LVFLNDDISI
LTPDWIEAML AQLAIPGVGA VGPKLLFEDG SLQHAGVVFG EGLPDHVRRG FPGDDAGYHG
SSLANRNTLA VTGACVMVRR ADFEAIRGFD EGYAINYNDI DLCLRLGERG LRTVYCAEAS
LHHYESRNRI PTVDPAEQAR FRKRWGARLA RDPYYPDPFG IRPPAFTLDA ERFPTARQRM
VEAWR