Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2033 |
Symbol | |
ID | 5834916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2270040 |
End bp | 2271071 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641367831 |
Product | glycosyl transferase family protein |
Protein accession | YP_001639500 |
Protein GI | 163851457 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.741083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTCG CCCCGCTCCT CGTCGTCCCG CTGTCGGCGC TGCTCTCCGC CGGCCTGATC GTGCTGCTGA AGCCGCTGCT GCAGCGCTAC GCCCTGGCGC GGCCGAACGC CCGCTCCAGC CACCGGGTGC CGACTCCCCA GGGCGGCGGG ATCGCGATCA TCGCCGCCGC GCTGATCACG GCGGTCGTCC TGGCCGTGCC GACCGGGATC GCGCTCGGGG ACGTCGAGCT GCCGGTGCTG GCGCTCTCGG TGATCGCTCT TGCCGTGGTC GGGGCCGTCG ACGACATCCG GCCGTTGCCG GCCTCGCTCC GACTCGTGAT CCAGGCCGGC GCGGTCGCCG CCGTCGTGCT CACCGCCGAG GGCCGTCTGC TGCCGGATAT GCCGCTCTGG CTGGAGCGGG CCTTCGCGAT CCTCGCCGGG CTCTGGTTCG TGAACCTCGT CAACTTCATG GACGGGCTCG ACTGGATGAC GCTGGCAGAG TTCGTGCCGC CCACCGCCTT CCTGTTCGCG CTGGGCCTCG CGGGCCGCTA CGCGCCCGAG CCGACGCTGG TCGCCGGCGC CCTGCTCGGT GGGCTTCTCG GCTTCGCGCC GTTCAACCGG CCGGTGGCAC GGCTGTTCAT GGGCGATGTC GGCTCGCTGC CGATCGGGCT GATCGTGGCG TGGCTGCTGT TCCGCCTCGC GGGGCAGGGG GGATTGCAAG AGGGCTTGGA AGGAGGTTTG GCCGCCGCGC TGATCCTGCC GCTCTATCCC ATCGCCGATG CGACGCTCAC CCTGCTGTGG CGCCTGCGGC GGGGGGAGGC CGTCTGGCAG GCCCATCGCA GCCACTATTA TCAGGTCGCC ACCGTCAATG GGTTCTCCGT GCGGGAGACC GTCGGCTCGG TCTTTGCCCT GCAGGTCGCG CTGGCCGCGC TGGCGGGCGT GACCTTGCTT TGGCCGTCTC GGGCCGTCAC GCTTGTCTGC CTCGTCCTCG CGGCGGGGCT GGTCGGGTGG CTGCTGCGCC GGTTCGCGCC GCCGCGGGCC GCCAGCGCGT GA
|
Protein sequence | MTLAPLLVVP LSALLSAGLI VLLKPLLQRY ALARPNARSS HRVPTPQGGG IAIIAAALIT AVVLAVPTGI ALGDVELPVL ALSVIALAVV GAVDDIRPLP ASLRLVIQAG AVAAVVLTAE GRLLPDMPLW LERAFAILAG LWFVNLVNFM DGLDWMTLAE FVPPTAFLFA LGLAGRYAPE PTLVAGALLG GLLGFAPFNR PVARLFMGDV GSLPIGLIVA WLLFRLAGQG GLQEGLEGGL AAALILPLYP IADATLTLLW RLRRGEAVWQ AHRSHYYQVA TVNGFSVRET VGSVFALQVA LAALAGVTLL WPSRAVTLVC LVLAAGLVGW LLRRFAPPRA ASA
|
| |