Gene Mext_0589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0589 
Symbol 
ID5835813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp650767 
End bp652821 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content71% 
IMG OID641366372 
Productglycosyl transferase family protein 
Protein accessionYP_001638074 
Protein GI163850031 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.271762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0559054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGC CTGATGCAAG GCGGGGCGAT GGATGGGACC TGTGGCGGGG ACGTGAGACG 
TCGTTCTCCC TGGTGTCTAT CGACGATCCC CTGTTCGACC GCCGCCTCCT CCTGTCGCCC
CCCGATCGGA GACCGTGGCA GGACCGTTGG ATCGTCCTGA CCTACACGCT GGCCGTGCGA
GCGTGGCCGC AGAGGCCGCT GCTGCGTCTG CTCCGCAGCG CCGGCCGACG CGACAGCTTC
GTGCTGCCGG GTGTGGCCCT CGGCGCCGCG CGCTGGCTCG GCTACCTGCC CGGCGATTGC
GAGGGCATCG AAATCGCGGC CGAGCCGGGC TTCGTGCTGG AGCGCGTTGG GCTGCGGAGC
AGCGCGAGCG TCTTTGCGGA GGCGCTGCTG AAGCGGCCGG GACGCGCCGT AGCAGCCCTG
CGCGCGGGTC TCGCGCGCGA CGAGCGGCGC TGGCGCGACA GCCTTCGCGG CGCCTGCGCG
GTGTCGCCCC TTGCCCGCTA TCCGGCCTGG AAGGCGTCGC GGCTGTTTCG CGCCGCATCG
ACCGAGCCAA GATCCGGCGC GCAGATCCGT CTGGTTCTTC CCGCTCCCTT CGCGCAGGCC
GATGCGGTTG CGCGCAGCGT CGCGAGCCTG CGCGCCCAGA CCCATCAGGA CTGGTCTCTG
CTCATCGCCT GGACGGACGG CGCCCCGCCG ACGAATCCGG GCATCGACCG GCGCGTGCTC
AACATCTCAT GGAATCCGGC GGCGACGCTG CGCGAGCTTG CCGGAGGGGC CGACCTGTTC
GGCCTGCTCC GCCCCGGCGA CGTCCTGGCG CCCGAGGCGC TGCACCTGCT TGCCAAGAGC
CGGGAGGCCG AGGCGTCCGA GATGGTCTAT GCCGACGAGG AGACCGGCGG CCGGACGCTG
AGGCCGCGCC TCAAGCCGGA TTGGAGCCCC GATCTGGCGC TTGCCATGGG CTATGTCGGC
GCGCCCGCGC TGATCGCCGG GGACTTCCTC GCCAGACTGC CCGCCGAGCC GGTGGACACG
CCCGACGCTT TGGCTGTCAC GCTCGATCTC GCGGCCAGTT CCGCGACCCG TGTCGCACAC
ATCCCCCGCA TCCTGTGCCG TCGCGAGCCC GTCACGGCCG ATCCGGCCGC CCGCGCGCCG
CACCTCGACC AGCACCTGCG CAGCACAGGA TCGTCGGCGC GTGCGGCGCT TCGGGACGGC
CGCCTTGATC TTCAATGGCC GCTGCCGGAC CCGGCACCGC TCGTCAGCAT CATCATCCCC
TCCCGCGACC GGTTCGACCT GATCGCGCGG GTCACCGAGG ATGTCCTCGA AAAGACGCCC
TATCCCGCCC TCGAACTGGT GATCGTGGAC AATGGCTCGA AGGAGCCGGC GGTGCTGGAT
CTGTATGAGC GGTTGCGCCT CGATCCACGG GTCCGGATCG AGCCCTATCC GCACCCCTTC
AATTTTTCGG CACTGGTCAA TGCCGGCGCG CGGAAGGCGC GCGGCGGCGT CCTCGTCCTG
CTCAACAACG ACGTGGCGGT ACTGCGGCCC GACTGGCTCG ACGTTCTCGT CGCTCAGGCG
GTCCGGCCGG AGGTCGGCGC GGTCGGCGCG AAACTCCTCT ACGAGGATGG GCGCCTTCAG
CACGCGGGTG TCGTGGTCGG ACTCGGCGGC GAGGCCGGCC ATATCCTGCG CCGCCGCCCC
GCCGACACGC CCGGCCATCT CGATCGCCTG AGCGTGGCGC ATGAGGTCTC GGGCGTCACG
GCGGCCTGCC TCGCCGTCAC GCGCGACAAG TACCAGGCCG TGGGCGGTTT CGACGAAGAG
ACCTTTGCCG TCGATTTCAA CGATATCGAC TTCTGCCTGC GTCTCGGCGT GCGGGGCTGG
AAGACGGTGT GGACACCGCA TGCGGTGCTG TCTCACCTCG AATCGGTGAG CCGCGGCCGG
CCGGTCGGTG AGGCCCGCGC GCGCTTCGAG CGCGAGGCCG CCGCCTTCAC CGAACGCTGG
CGCGACGTGA TCCGGCACGA TCCGTTCTAC CATCCGGCCC TCTCGCTCAC GACCTTCGGC
GAGGAGCTGG AATGA
 
Protein sequence
MAKPDARRGD GWDLWRGRET SFSLVSIDDP LFDRRLLLSP PDRRPWQDRW IVLTYTLAVR 
AWPQRPLLRL LRSAGRRDSF VLPGVALGAA RWLGYLPGDC EGIEIAAEPG FVLERVGLRS
SASVFAEALL KRPGRAVAAL RAGLARDERR WRDSLRGACA VSPLARYPAW KASRLFRAAS
TEPRSGAQIR LVLPAPFAQA DAVARSVASL RAQTHQDWSL LIAWTDGAPP TNPGIDRRVL
NISWNPAATL RELAGGADLF GLLRPGDVLA PEALHLLAKS REAEASEMVY ADEETGGRTL
RPRLKPDWSP DLALAMGYVG APALIAGDFL ARLPAEPVDT PDALAVTLDL AASSATRVAH
IPRILCRREP VTADPAARAP HLDQHLRSTG SSARAALRDG RLDLQWPLPD PAPLVSIIIP
SRDRFDLIAR VTEDVLEKTP YPALELVIVD NGSKEPAVLD LYERLRLDPR VRIEPYPHPF
NFSALVNAGA RKARGGVLVL LNNDVAVLRP DWLDVLVAQA VRPEVGAVGA KLLYEDGRLQ
HAGVVVGLGG EAGHILRRRP ADTPGHLDRL SVAHEVSGVT AACLAVTRDK YQAVGGFDEE
TFAVDFNDID FCLRLGVRGW KTVWTPHAVL SHLESVSRGR PVGEARARFE REAAAFTERW
RDVIRHDPFY HPALSLTTFG EELE