Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4542 |
Symbol | |
ID | 7118507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4811776 |
End bp | 4813584 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643527241 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002423246 |
Protein GI | 218532430 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0207348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTGCC TTGGCATCGC GCCGCCCGCG CTGTCACGGG CGGAGGATGG CTGGCCGCTC GTCGGCGCGC TGCGCCTTGG CGATCGGGTG CTGTCCTTCG GGGCCGCGAG CCATCTCCGC GCCTGCCTCC TGCTCCTGCT GATCGGTCTC GCCAGCTTCC TGCCGGGCCT TGCCTCGCTC CAGCCGATGG ACCGGGACGA GCCGCGCTTT GCCCAAGCCT CCAAGCAGAT GCTGGAGACG GGCGACCTCG TCGATATCCG CTTCCAGGCC GAGGCCCGCC ACAAGAAGCC GGTCGGGATC TACTGGGCCC AGGCCGCCGC CGTCGCGGCC GGCGAGGCGC TCGGTGTGCC GCAGGCGCGC ACGCAGATCG GGCTCTACCG GATCCCCTCG CTTCTCGGCG CGCTGGCGGC GATCCTGCTG ACCTACTGGG CGGGCCTCGC CCTGCTCGAC CGTCGCCGGG CGCTGCTGGC CGCCGCCCTG TTTTCCGCCT GCATCATGCT CTCGGCGGAA GCGCGCCTTG CCAAGACCGA CGCGCTGCTC ACCGCCTGCT CGGTCGCCGC GTTTGGCGCG CTCGCCCGCG CCTGGCTCGG GCGCGCCCGG TTGGAGCGGC GCCGCGGCCC GGCCTCGCTC GGAACGGCCT TGGTCTTCTG GCTCGGGATC GCCCTCGGCA TCCTCGTGAA GGGGCCGATG GTGCCGCTCT TCGCCGGGCT CGCCGCCTTC GTGCTGTGCC TGCGCGAGGG CTCGGCCCGC TGGCTGCTCG ACCTGCGCCC GCGCTTAGGC CTCCTCATCA CGCTCGCCGT CGTGGCGCCC TGGTTCCTGG CGATCGCCTG GAAGAGCGGC GGCGCCTTCT TCGGCGAGGC GGTGGGGCGC GACATGCTCG GCAAGGTCGG CACCGGCGCC GAGAAGCATT GGGGCCCGCC CGGCGCCTAC GCGCTGGCCT TCTTCGCCAC CTTCTGGCCG GGCGCCGCCT TCGCCGCCCT CAGCCTTCCC TTCGCCTGGG CGCGGCGGGG CGAGGAGGCG GTGGCGCTGC TGCTCGCCTG GATCGTGCCG ATGTGGCTGA TCTTCGAGGC GGTGCCGACC AAGCTGCCGC ATTACGTCCT CCCCCTGATG CCGGCGGTGG CGATTCTCAC CGTGCTGGCG CTGTCGCGCG GCGCGCTCGA TCCGCGCCGT CCGGGCGCGC GCTGGGTGGC GGGGCTCGTC GTGCTGATTC CGGTCGGGCT GACGCTGGGC CTCAGCCTTG CTGCGTGGCG TCTCGACCAT GTGCTGCCCC TGGCCGCCCT GCCGCTGCTG CTCGCCGCTT GCATTCTCGC CGGCCTCGCT TGGGCCGCCT TCGCCCGCGG GGCGAGAGAA GGGGCGGAGC AAAGGGCGGG GCAAGAGACA CGGCCAGAGG CAGGGGAGGG CGCTCTGGTG CTCGCCGTCG CCGCCTCGGT GGTACTGTCG GGCGCCGTGT TCGGCCTGAC CCAGCCGGTG CTGCAAAGCC TCAAGGTCTC GCCGCGGCTC GCTGCGATCC GCGATGCCCT GCCCTGCGAG GCCCCGCGTG TGGCAAGCCT CGGCCTTCGC GAGCCGAGCC TCGTCTTCAC CGTCGGCACG GATCTGGCCA TGCTGAATTC GGGTGCGGAG GCCATCGCCT TCCTACGGGA GGGCGGCTGC CGCCTCGTGC TGGTCGAGGA CCGGTTCGCC GCCGAATTCA CGGCGGCCGA AGGCGGGCAA CCGCTTAGCC CCGTCGGCCG GGTCACCGGC TTCAACATCA ACGGCGGCAA GCCGGTCGGG GTCTCCGCCT ACGCCGCGCT GCCGGGTTCC ACGCCATGA
|
Protein sequence | MTCLGIAPPA LSRAEDGWPL VGALRLGDRV LSFGAASHLR ACLLLLLIGL ASFLPGLASL QPMDRDEPRF AQASKQMLET GDLVDIRFQA EARHKKPVGI YWAQAAAVAA GEALGVPQAR TQIGLYRIPS LLGALAAILL TYWAGLALLD RRRALLAAAL FSACIMLSAE ARLAKTDALL TACSVAAFGA LARAWLGRAR LERRRGPASL GTALVFWLGI ALGILVKGPM VPLFAGLAAF VLCLREGSAR WLLDLRPRLG LLITLAVVAP WFLAIAWKSG GAFFGEAVGR DMLGKVGTGA EKHWGPPGAY ALAFFATFWP GAAFAALSLP FAWARRGEEA VALLLAWIVP MWLIFEAVPT KLPHYVLPLM PAVAILTVLA LSRGALDPRR PGARWVAGLV VLIPVGLTLG LSLAAWRLDH VLPLAALPLL LAACILAGLA WAAFARGARE GAEQRAGQET RPEAGEGALV LAVAASVVLS GAVFGLTQPV LQSLKVSPRL AAIRDALPCE APRVASLGLR EPSLVFTVGT DLAMLNSGAE AIAFLREGGC RLVLVEDRFA AEFTAAEGGQ PLSPVGRVTG FNINGGKPVG VSAYAALPGS TP
|
| |