Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3454 |
Symbol | |
ID | 5832089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3832393 |
End bp | 3833589 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641369253 |
Product | putative glucosyltransferase |
Protein accession | YP_001640911 |
Protein GI | 163852868 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.250583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.104768 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTGA ACGAGCTGCC GGCCTGGATC GCATCTGTGC TACTGCTGAT GGCGCTCGCC GGCTGCGTCT ATGCCCTGCT CGCGGCCTGG CTCGTCAACC GTTTCGCCGC GCGGCCGTCG CCCGCGCTCG CGGCCGATGC GCCGCGCCCC GGCGTGACGA TCCTCAAGCC CCTCTGCGGC CTGGAGCCGG ACCTCTTCGA GAACCTCGGA AGCTTCTGCC GCCAGGATTA TGCCGGCCCG GTGCAGATCG TGTTCGGCGT CCAGAACGCG GCCGACCCGG CGATTGCCGT GGTGCAGCGC CTGCGCGAAG CCCATCCCGC CCTGCGCCTC GACCTCGTGG TGGATCCGAG CCAGCACGGC TCGAACCGCA AGGTCTCCAA CCTCATCAAC ATGTCGGAGA AGATCGCCCA CGCCGTCGTG GTGCTGGCCG ACAGCGACAT GTCGGTGAAG CCCGATTATC TCGAGCGCGT CGCCGCCGCC CTGTCGCAGC CCGGCATTTC CGGCGTGACC TGCCTCTATC ACGGCGTGCC GGGCGACCGG GGCCTGTGCG CCCAACTCGC GGCGCTCGCC ATCGACGTGC AGTTCGTGCC CAACGTCATC CTCGGCACCA CCTTCGATCT CGCCCGGCCC TGCTTCGGCT CGACCATCGC GATGACGGCC GAATCGCTGG CCCGCATCGG CGGCTTCCGC GCGTTCAAGG ATGATCTGGC CGACGATTAC GCGATCGGCG AAGCGCTGCG CGCCGAGGGC GGCACGGTGG CGATTCCCGC CCTCACCATC GGGCATGCCT GCGTCGATAC CGAGCTGTCG GGCCTGTGGC GGCACGAGCT GCGCTGGAAC CGCACCATCC GCAACGTCGA TCCGAAGGGC TATGCCGGAT CGGTCGTGAC CCACGCCTTT CCGCTGGCGC TGCTCGCCGC ACTGATGCCC GGCGCCGGCT CCGGCGCGCT CGCGGTCGCC GCCCTGGCCC TTACCTGCCG CATCCTGCTG TGCCTGCGCA TCGAGCGGGC CTTCGGGCTC TCCCCCCACG CCTACTGGCT GTTGCCGATA CGTGACATGC TGTCCTTCAT CAACTTCACC TGGAGCTTCG TCTCGGGTGC GGTGACATGG AAAGGTCACG ATTACCGTGT GGTTGCGGAC GGTACGCTGA TTCCGGAGCA CGGCCTCGGT CGCGAGTCGC GCGCGACTTC GGTCTAA
|
Protein sequence | MDLNELPAWI ASVLLLMALA GCVYALLAAW LVNRFAARPS PALAADAPRP GVTILKPLCG LEPDLFENLG SFCRQDYAGP VQIVFGVQNA ADPAIAVVQR LREAHPALRL DLVVDPSQHG SNRKVSNLIN MSEKIAHAVV VLADSDMSVK PDYLERVAAA LSQPGISGVT CLYHGVPGDR GLCAQLAALA IDVQFVPNVI LGTTFDLARP CFGSTIAMTA ESLARIGGFR AFKDDLADDY AIGEALRAEG GTVAIPALTI GHACVDTELS GLWRHELRWN RTIRNVDPKG YAGSVVTHAF PLALLAALMP GAGSGALAVA ALALTCRILL CLRIERAFGL SPHAYWLLPI RDMLSFINFT WSFVSGAVTW KGHDYRVVAD GTLIPEHGLG RESRATSV
|
| |