Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4212 |
Symbol | |
ID | 5831925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4686889 |
End bp | 4688049 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641370003 |
Product | glycosyl transferase family protein |
Protein accession | YP_001641652 |
Protein GI | 163853609 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.787376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.316316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCCCG TTTCCCCCAC GCCGTCGCGC CTCGACACCC TCATCGCCAT CCCCGTGCGC AACGAGGCCG AGCGGATCGC CCGCTGCCTG ACGGCGATCG ACCGGCAGAC CGGCCTCGCG CCGGGGCGGC TCGGGCTCGT GCTGTTCCTC AACAACTGCA CCGACGACAC GGCGGAGATC GTCGCCCGTC TCGTGCCGGC GCTCTCGATT CCCGTCCGGG TGATCGAGCG CGTTCATGCC GGGGCGCATG CGGGCTGGGC GCGCCGCGCG GCGATGGACG CGGCGGTCGC GTGGCTCGAA GCGGAGGGGA CGACCTCCGC GACGGCGACG CTCCTGACCA CCGATGCCGA CAGCATCGTG CCGCCGGATT GGGTCGCGGC CAACCTCGCC GCCCTGGAGG CGGGCGCCGA CGCGGTCGCC GGCCGGGTCG AGTTGATCCC GGAGGAGGCG GCCCTGCTGC CGCCCTCGCT GCCCGCCCGC GGCCGGCTGG AGGACACCTA CGACGCGCTC ATCACCGAGA TGGAGGCGCG CATCGATCCC GATCCGCACG ATCCCTGGCC CTGCCACCGC ACCACGATCG GCGCCTCGCT CGCCGTGCGG CTTCCCGCCT ACCGCGACGT CGGCGGCATG CCGGAGATTC CGCTCGGCGA GGACGGCGCC TTCGTCGGCG CGCTGCTCCA GCGGGGCTTT CGCGTGCGCC ATGACCGGGC GGTGCTGGTG CTGATCTCGG CCCGGCTCAC CGGCCGCGCG GCCGGCGGCG TTGCCGACAC GATCCGCTCC CGCTGTGAGG AGCCCGACGC CCTGTGCGAC GCCCGCATGG AGGCGGTCCC CCGCGCGCTC CACCGCTACG TCTGGCGGGC GCGGCTGCGT CGCCTCTACG ACGAGGGCCG GCTCACCCGC GATCTCGCCT GGGCGCGCCG GCTCGGCATC ACCGAGGCGG AAGCCCGCCG CATCGCCGCC CTGCCACGGG TCGGCGAGAT CGTCGCGGCG GTCGATCGCG CCAGCCCGCG CCTCGCCTAC CGCCCGCTGA TGCCGCGACA GCTCCCCGGC CAGATCCGGC TCGCACGCCT CGTGCTGCCG CTGCTACGCG CGGGTCTCCG CCTGCCCCGG GCAACGCCGT CCGCACGCCC GGTCGCCCCA ACGGCCACCG CCGACGCGTA A
|
Protein sequence | MPPVSPTPSR LDTLIAIPVR NEAERIARCL TAIDRQTGLA PGRLGLVLFL NNCTDDTAEI VARLVPALSI PVRVIERVHA GAHAGWARRA AMDAAVAWLE AEGTTSATAT LLTTDADSIV PPDWVAANLA ALEAGADAVA GRVELIPEEA ALLPPSLPAR GRLEDTYDAL ITEMEARIDP DPHDPWPCHR TTIGASLAVR LPAYRDVGGM PEIPLGEDGA FVGALLQRGF RVRHDRAVLV LISARLTGRA AGGVADTIRS RCEEPDALCD ARMEAVPRAL HRYVWRARLR RLYDEGRLTR DLAWARRLGI TEAEARRIAA LPRVGEIVAA VDRASPRLAY RPLMPRQLPG QIRLARLVLP LLRAGLRLPR ATPSARPVAP TATADA
|
| |