Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4173 |
Symbol | |
ID | 5832441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4643609 |
End bp | 4645417 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369963 |
Product | glycosyl transferase family protein |
Protein accession | YP_001641613 |
Protein GI | 163853570 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGCC TTGGCGTCGC GCCGGCCGCG CTGTCACGGG CGGAGGGTGG CTGGCCGCTC GACGGCGCGC TGCGCCTTGG CGACCGGGTG CTGTCCTTCG GGGCCGCGAG CCATCTCCGC GCCTGCCTCC TGCTCCTGCT GATCGGTCTC GCCAGCTTCC TGCCGGGCCT TGCCTCGCTC CAGCCGATGG ACCGGGACGA GCCGCGCTTT GCCCAAGCCT CCAAGCAGAT GCTGGAGACG GGCGACCTCG TCGATATCCG CTTCCAGGCC GAGGCTCGCC ACAAGAAGCC GGTCGGGATC TACTGGGCCC AGGCCGCCGT CGTCGCGGCC GGCGAGGCGC TCGGTGTGCC GCAGGCGCGC ACGCAGATCG GGCTGTACCG GATTCCCTCG CTCCTCGGCG CGCTGGCGGC GATCCTGCTG ACCTACTGGG CGGGCCTCGC CCTGCTCGAC CGGCGCCGGG CGCTGCTGGC CGCCGCCCTG TTTTCCGCCT GCATCATGCT CTCGGCGGAA GCGCGCCTTG CCAAGACCGA CGCGCTGCTC ACCGCCTGCT CGGTCGCCGC CTTCGGCGCG CTTGCCCGCG CCTGGCTCGG GCGCGCCCGG TTGGAGCGGC GCCGGGGCCC GGCCTCGCTC GGAACGGCCT TGGTCTTCTG GCTCGGGCTC GCGCTCGGCA TCCTCGTGAA GGGGCCGATG GTGCCGCTCT TCGCAGGGCT CGCCGTCTTC GTGCTGTGTC TGCGCGAGGG CTCGGCCCGC TGGCTGCTCG ACCTGCGCCC GCGCTTGGGC CTCCTCATCA CGCTCGCCGT CGTGGCGCCC TGGTTCCTGG CGATCGCCTG GAAGAGCGGT GGCGCCTTCT TCGGCGAGGC GGTGGGGCGC GACATGCTCG GCAAGGTCGG CACCGGCGCC GAGAAGCATT GGGGCCCGCC CGGCGCCTAC GCGCTGGCCT TCTTCGCCAC CTTCTGGCCG GGCGCCGCCT TCGCCGCCCT CAGCCTTCCC TTCGCCTGGG CGCGGCGGGG CGAGGAGGCG GTGGCGCTGT TGCTCGCCTG GATCGTGCCG ATGTGGCTGA TCTTCGAGGC GGTGCCGACC AAGCTGCCGC ATTACGTCCT CCCCCTGATG CCGGCGGTGG CGATCCTGAC CGTGCTGGCG CTGTCGCGTG GCGCGCTCGA TCCGCGACGT CCGGGCGCGC GCTGGGTGGC GGGGCTCGTG GGGTTGATTC CGGTCGGGCT GACGCTGGGC CTCAGCCTCG CCGCGTGGCG TCTCGACCAT GTGCTGCCCC TCGCCGCCCT GCCGCTTCTG CTCGCCGCCT GCCTCCTCGC CGGCCTCGCC TGGGCCGCCT TCGCCCGCGG GGCGAGAGAA GGGGCGGGGC AAGGGGCGGG GCAAGGGACA CGGCAAGAGG CAGGGGAGGG CGCTCTGGTC CTCGCCGTCG CCGCTTCGGT GGTGCTGTCG GGCGCCGTGT TCGGCCTGAC CCAGCCGGTG CTGCAAAGCC TCAAGGTCTC GCCACGGCTC GCCGCGATCC GCGATGCCCT GCCCTGCGAG GCCCCGCGTG TGGCAAGCCT CGGCCTTCGC GAGCCGAGCC TCGTCTTCAC CGTCGGCACG GATCTGGCCA TGCTGAATTC CGGCGCGGAG GCCATTGCCT TCCTACGGGA GGGCGGCTGT CGCCTCGTGC TGGTCGAGGA CCGGTTCGCC GCCGAATTCA CGGCGGCCGA AGGCGGGCAA CCGCTTACCC CCATCGGTCG GGTCACCGGC TTCAACATCA ACGGCGGCAA GCCGGTCGGG GTCTCCGCCT ACGCCGCGCT GCCGGGTTCC ACGCCATGA
|
Protein sequence | MTRLGVAPAA LSRAEGGWPL DGALRLGDRV LSFGAASHLR ACLLLLLIGL ASFLPGLASL QPMDRDEPRF AQASKQMLET GDLVDIRFQA EARHKKPVGI YWAQAAVVAA GEALGVPQAR TQIGLYRIPS LLGALAAILL TYWAGLALLD RRRALLAAAL FSACIMLSAE ARLAKTDALL TACSVAAFGA LARAWLGRAR LERRRGPASL GTALVFWLGL ALGILVKGPM VPLFAGLAVF VLCLREGSAR WLLDLRPRLG LLITLAVVAP WFLAIAWKSG GAFFGEAVGR DMLGKVGTGA EKHWGPPGAY ALAFFATFWP GAAFAALSLP FAWARRGEEA VALLLAWIVP MWLIFEAVPT KLPHYVLPLM PAVAILTVLA LSRGALDPRR PGARWVAGLV GLIPVGLTLG LSLAAWRLDH VLPLAALPLL LAACLLAGLA WAAFARGARE GAGQGAGQGT RQEAGEGALV LAVAASVVLS GAVFGLTQPV LQSLKVSPRL AAIRDALPCE APRVASLGLR EPSLVFTVGT DLAMLNSGAE AIAFLREGGC RLVLVEDRFA AEFTAAEGGQ PLTPIGRVTG FNINGGKPVG VSAYAALPGS TP
|
| |