Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2423 |
Symbol | |
ID | 5833019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2729223 |
End bp | 2730836 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641368224 |
Product | glycosyltransferase |
Protein accession | YP_001639889 |
Protein GI | 163851846 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.848309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTGC GCGGTGTTCT CGACCATGTT TCCCACGGGC GGATCACCGG CTGGGCATGG GACAGCGACC AGCCCGACAG TCCCGTCAGT GTCGTGCTCT CGGCCAATGA TCAGGTTCTC GGACGCGTCC TAGCCAACCG CTACCGCCCC GACCTCGAGA AGGCCGGCAT TGGCAAAGGC AAGCACGGCT TTGAGCTTTG GCTGCCCACA CCGCTGTCCG TTTTGCAGCG TCATGCCATC AGTGCGCGCA TCGAGGGACC GGGAACCCAT ATCAAGCGCT CTCCACGCAT ACTGGAGGCC TCGACTGCCT TCGACCGCTC AACCCGAGAA ACAATCGACC AACTCCTATC AGCCGTCGCG GACGAAGCGG ATTTCGTCCG CCGCCTCGCT TTCCTAACCG ATCAGGCGGA TCGCCTGCGC CAGCGCTATG CTAATTACAG CAGTGGCGCT GAGGAGCGGC AGAGACGGCA GCGTCTCGGG TGGCAGGAGC AGAGCGCCAA ACCCGAGGGT ATGCCGCCGG CTCCTCCGAA GCCGCGCGCA CTCGTCATCG ATGAAGGCAT GCCGTTGGGT GGGCACGATG CTGGTTCCAA CGCCGTCCTC TCACATATGC GCAGCCTGCA GCGGCTCGGC TACGCGGTGA GCTTCGCACC ATCCAACCTG CGCGGTGACA GCAGGCTGCT CGAAGAGGAA GGCATCACCT GCTGCGTCCA TCCTTGGTAC GCCACGATCG AGGAGGTGCT GAGGCGACAG CGCAACGGTT TCGAGGTCGT CTACATGCAT CGAGGGGCGA CAGCTTCCCG TTACGTGGCG CTCGCGCGGG ATCATCAGAA GCGCGCGAGA CTCTTGTATA GCGTCGCCGA CCTTCACCAT CTGCGGCTCT CTCGACAAGC CGATATCGAG GGGCGGCCGG AACTCCTTTC ATACAGCGAA CACGTTCGGA TCAAGGAATT GTCCGCGGCT TGGCAGGCTG ACGCGATCAT CACGCATTCC AGCGTCGAGG CCGAGATCCT GCGCAGACAT ATGCCGGCGG AGAAAGTCCA TGTCGTTCCT TGGAGCCTGA TACCCAATCC CACGACCGTT CCTTTTTCAG AGCGACGCGG ACTGGCCTTC ATCGGCGGCT ATGGCCACAG GCCGAATGTG GATGCAGCGC TCTGGCTGGT CGATACTGTC ATGCCTGAGA TCGATGCCCT TGGCGGCACG CTGCCCTGCC TGCTGGTGGG CAGCAACATG CCCAATCAGC TGCGCAGCCT TCAGCGACAC GATGTCGAGC CTGTAGGCCA CGTCGAACAT CTTTCTGCGG TTTTCGATCG TGTCCGCCTC ACGGTGGCGC CGCTTTCCTT CGGCGCGGGC GTGAAGGGCA AGGTGCTGGA CAGCTTGGCG GCCGGCGTCC CTTGCGTCTG CACCCCCGCG GCTGCGGAGG GCATGGATTT GCCCCAGGCA TTGCTTGATC TCGTTGCGGC AACGCCGGTC GACCTCGCTC GCTCAATCCG TGCGCTTCAC AACGATGAAG CTCTGAACCG CACCTGCAGC GAGGCTGGTC TCGCCTACAT CGCAGACCGC ACGAGCGAGA CGCGCGTTGA TGCCCTTCTG TCGGCAGCTA TCGGGCGCCG TTAA
|
Protein sequence | MPLRGVLDHV SHGRITGWAW DSDQPDSPVS VVLSANDQVL GRVLANRYRP DLEKAGIGKG KHGFELWLPT PLSVLQRHAI SARIEGPGTH IKRSPRILEA STAFDRSTRE TIDQLLSAVA DEADFVRRLA FLTDQADRLR QRYANYSSGA EERQRRQRLG WQEQSAKPEG MPPAPPKPRA LVIDEGMPLG GHDAGSNAVL SHMRSLQRLG YAVSFAPSNL RGDSRLLEEE GITCCVHPWY ATIEEVLRRQ RNGFEVVYMH RGATASRYVA LARDHQKRAR LLYSVADLHH LRLSRQADIE GRPELLSYSE HVRIKELSAA WQADAIITHS SVEAEILRRH MPAEKVHVVP WSLIPNPTTV PFSERRGLAF IGGYGHRPNV DAALWLVDTV MPEIDALGGT LPCLLVGSNM PNQLRSLQRH DVEPVGHVEH LSAVFDRVRL TVAPLSFGAG VKGKVLDSLA AGVPCVCTPA AAEGMDLPQA LLDLVAATPV DLARSIRALH NDEALNRTCS EAGLAYIADR TSETRVDALL SAAIGRR
|
| |