Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0845 |
Symbol | |
ID | 5833335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 920550 |
End bp | 922727 |
Gene Length | 2178 bp |
Protein Length | 725 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641366627 |
Product | glycosyl transferase family protein |
Protein accession | YP_001638321 |
Protein GI | 163850278 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00935412 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.295818 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCC GCGAGGCGCT CGGCCTCTAT CCCCTGCGCG ACCTGCGCCG TTGTGCGGAC GACGCCTGGA GCGCGACCGG CCCTGATCCG ATCTTCGCGC TCGGCCCAGC GGACGCGGTG TCGGCCCTGT CGGGCCAGCG CATCCGCATC CGCTGCCGGC CGGATTCGGT GCGGCCGGTG CTCGCGGTCG AGGCGCGGGG CGAGGCCGAG CCGCGCCGCT ACCGGCTCAC CGTGCGGGCG GACGGCACCG ACGACCTGAT CCGCCTGCCG CGCGGCACAC GCGCGCTGCG CCTGGAAGCC GCCGAGGCAG GCCCGCGCTT CCGCCTCGAT GGGGTGCGGA TCGCGCCGGC GGGCCGGATC GAGGCGGCCA TGCTGTCCGC CCAAAAGATC ATCGAACGGT TGCCACCGCA GGAGCGTCGG CCGCTGCGCC TGCTGCGACG CGCCGTCGGC CTCCTGCGCA GCGTGCCGCC CGGCGAAATC TGGCGCCGCC TCACCCGCGC CGCCCAGACC CACTGTCCGC CCGCCAGCTA CGCCGCCTGG ATTCAAGCGG TGGAAGCCAA GGCCCTGCCC TCCATCGAGA CGATGCGGGC GCAGCTCGCG GCCCTGCCGG AGCGCCCATT GATCTCTCTG CTCATGGCGG GCGGCGACGA TCCGGTCCAG CTCGGGGAGA CGGTCGCGAG TCTGCGGGCG CAGGTCTACC CGGATTGGGA ATTGTGCCTC GCGGCGGCGC CATCGGCCGC GCTTGACGCG CTGGCGGCGG AGGACGACCG CATCCGTCTG CTGCTGCCCG GACACGGCAA AGCGACCTCC CTCGATGCGG CGCTGGCGCA GGCGCGCGGT TCCTATCTCG CCACGATGGA GCCCGGCGCC CGGCTTGCGC CGCAGGCGCT TCTCGCCCTC GTCCGGCGGC TCGCCCGCGA GCCCGATCTC GACCTGATCT ACACGGATGA GGACCGGATC GGGCCGGCGG GCGAGCGCTG CGATCCCTAT TTCAAGCCGG ACTGGTCGCC CGAGACACTC GAGAGCAGCT TCTACATCGG CGGGCTTGCC CTCTACCGCA CCGCCGGGGT TCGGGCGGCC GGCGGCTTTC CGGGCGAGAG CGAGGGCGCG GCCGACTACG ACCTCGCGCT CCGCGTGACC GAGCGGAGCG ACCGCGTCGG CCACGTCGCG CAGGTGCTCT GCCATCGCCG CGCCGCCGCG CCCGATCCGG AGGCGGCCGC CCGCGCGCTG GCTGGACGCG CCCGGCGCAC CGGCGGGCTC GATGCCGTCC GCGCGCTCGG CCCCGTGCAT TTCGCCCTGC GGCGCGCGGT CGCGACGCGC CCACTGGTCT CCCTGGTGAT CCCGACCGCC GGGCGCGACA GCCTCATCGG CGGGCGCACG ATCGATCTGC TCGCCGCCTG CCTCGCTAGC ATCCGCGAGA CCGGCACTTA CGACAACATC GAGATCGTGG CGGTGGATAA CGGCGACCTG CGCCCAAAGA CGCGGGCCGC CGTCGAGCGG TTCGGCGCGC GAACCGTCAC CTGGGACAAG CCCGTCTTCA ACGTCGCCTC CAAAATGAAT CTTGGCGCAA GGGCGGCGGG CGGCGAGGTT CTGGTCTTTC TCAACGACGA CATCAGCATC CTCACGCCGG ATTGGATCGA GGCGATGCTG GCGCAGCTCG CCATCCCCGG CGTCGGTGCG GTCGGCCCCA AGCTCCTGTT CGAGGATGGC AGCCTTCAGC ATGCCGGCGT GGTCTTCGGC GAGGGCCTGC CGGACCACGT CCGCCGTGGC TTCCCCGGTG ATGACGCCGG GTATCACGGC TCGAGTCTGG CCAACCGCAA CACTCTCGCC GTGACCGGCG CCTGCGTGAT GGTGCGGCGG GCCGATTTCG AAGCCATTCG GGGCTTCGAT GAGGGATACG CCATCAACTA CAACGACATC GACCTCTGCC TGCGGCTCGG CGAGCGAGGA CTGCGCACGG TCTATTGTGC CGAGGCGAGC CTGCACCATT ACGAGAGCCG CAACCGCATC CCCACCGTCG ATCCCGCCGA GCAGGCGCGC TTCCGCAAGC GCTGGGGCGC ACGGCTCGCC CGCGACCCGT ACTACCCGGA CCCGTTCGGC ATCCGCCCGC CCGCCTTCAC CCTCGACGCC GAGCGGTTTC CGACAGCACG GCAGCGCATG GTAGAGGCGT GGCGATGA
|
Protein sequence | MKRREALGLY PLRDLRRCAD DAWSATGPDP IFALGPADAV SALSGQRIRI RCRPDSVRPV LAVEARGEAE PRRYRLTVRA DGTDDLIRLP RGTRALRLEA AEAGPRFRLD GVRIAPAGRI EAAMLSAQKI IERLPPQERR PLRLLRRAVG LLRSVPPGEI WRRLTRAAQT HCPPASYAAW IQAVEAKALP SIETMRAQLA ALPERPLISL LMAGGDDPVQ LGETVASLRA QVYPDWELCL AAAPSAALDA LAAEDDRIRL LLPGHGKATS LDAALAQARG SYLATMEPGA RLAPQALLAL VRRLAREPDL DLIYTDEDRI GPAGERCDPY FKPDWSPETL ESSFYIGGLA LYRTAGVRAA GGFPGESEGA ADYDLALRVT ERSDRVGHVA QVLCHRRAAA PDPEAAARAL AGRARRTGGL DAVRALGPVH FALRRAVATR PLVSLVIPTA GRDSLIGGRT IDLLAACLAS IRETGTYDNI EIVAVDNGDL RPKTRAAVER FGARTVTWDK PVFNVASKMN LGARAAGGEV LVFLNDDISI LTPDWIEAML AQLAIPGVGA VGPKLLFEDG SLQHAGVVFG EGLPDHVRRG FPGDDAGYHG SSLANRNTLA VTGACVMVRR ADFEAIRGFD EGYAINYNDI DLCLRLGERG LRTVYCAEAS LHHYESRNRI PTVDPAEQAR FRKRWGARLA RDPYYPDPFG IRPPAFTLDA ERFPTARQRM VEAWR
|
| |