Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1457 |
Symbol | |
ID | 6133047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 1600562 |
End bp | 1602292 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641641732 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001768401 |
Protein GI | 170739746 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTAC AAGCCAGCCG CCTGACCACG CAGCTGCCGA TATTGCGAGA TCGCGGAGTT AATCTCCGCG AAATACTGAG CGTGTTGAAA CGCCGTCGGG CTATTTTCCT GGTTGTTACC TCGATCGTTC TATTTTCCGT GATAGCCTAC CTGTTCATCG CACGTCCTAC GTTTACCGCA ACAGCGCAGA TCCTTCTCGA CCAGCAGCAG AGAATCGCCG ACGATGTTCC CAGGGAGCAG CTACCGTCAG AAACTGTAAG CGCGATCGTC GAGAGCCAAG TTCAGACAGT GGCGTCCCAC GAAATTGTGA GGCGTCTTAT CGCAAGCGAG CATCTAACCA GCGATCCGGA ATTTGCTTCG CGGGGACTTG TATTGCAAAT ATTGCATTCC GCTTTGGGCA TGATCGGCAC AGCGGCGTAC GAGGAAGGAG ATCCAGAAGC GCGCGTACAC CAAAATGTTC GCAACGCGAT CTCAGCACGA AGGCTTGATA AAACTTTTGT TCTCGAGATC AGCTTTGAAT CGCATGATCG GCAAAAATCT GCGCGGCTCG CGAACGCCAC CGCGCGCGCC TTTATCGCCG ACCAGGTTGA GGCGAAAGTG GCCGCGAATC GCCGTCTTGC GGCGTCGTAT GAGGCGCGTC TACCGGAGTT GCGAGGGGAA TTGCAACGAG CCGAACAGGC GATTGAGACT TACAAGTTTC AGCACAGTAC GGTCGTTCCC TCCGGCGGCC CGGCCACGCT CGGCGGAAAT GAGGCCCTTG TCGGGCTGAG GGAACTCGAG CGGGAAGCAG AGACGAGTCG CGCCCTGTAT GTCTCGCAAC TGGCACGTTC GAGGAAGGCG TTCGAACAAG CGAATTTCTA CGTTGCTGAC GCACGCTTCA TTTCCCCCGC AATCCCGCCA GCGCGCCGCA GCTGGCCGCC GACGGGAGTG TTGCTCGTTG CTGGTCTCTT TGGTGGCATG AGCGTGGCCG CGGGCGTCGC GTTGCTGCGC GATCATTTGG ACACTCGCCT CTTTACGAAG GAACAAACCG AGTCGGAAAC GGAGTTTCCG GTCCTCGCCG ACATACCAGA AGCTCGGCCA AATTCACCCG ACATCGCGCG TTGCAATCAG GGTGCGTTCC TGCGTATCTT GGACTCTGTA CGCGAGCATT CAGAGAGAAA AAGCACCAGA ATTATACTTC TTACTTCGTC CGAACTGGGG GAGGGTAAAA GTACAATAGC TATAAACCTG GCGATGATTG CTGATAAACT CGGAGATAGC GTCCTGCTCG TTGACTCGCC GTTCGCCACG ACGGCGACGT CGGTGGCGGG GGAGCACATC TGGTTCGTCG ACTCGCCCTT CATCGTGCGA GCGGCGCTCT TGCCGTCGAG TGCCGGGATC AAGGCCACGA GTCAGAACGG AGAGGCGGCG CATCGTCAGG CGGCGCATCA CCCACTCGTG CTACAAGGCG ACTCCGCACG AAGAACCAGC CTACGAGACC AAATCGAATT TTTGCTGAAT TCGTCGACGC GAAAATTCGA TCTCATCATC TTGGAGCGAA GCGCGGCCAA CGACGACTGC GTTCTCCGTG ACATGAGCCA CATCGCACAC TCGATCATCA TTGTGGCGAA AGCCGGTCGA ACTCGGGTCG GCGACATTGC TTCGATCTCC GAAACACTTG GATTGGGACG CAAGCGAATC GCTGGTGTGG TACTTAACCG CACTCGGCGG AAGTCGAGGT TGCTGCCGTG A
|
Protein sequence | MSLQASRLTT QLPILRDRGV NLREILSVLK RRRAIFLVVT SIVLFSVIAY LFIARPTFTA TAQILLDQQQ RIADDVPREQ LPSETVSAIV ESQVQTVASH EIVRRLIASE HLTSDPEFAS RGLVLQILHS ALGMIGTAAY EEGDPEARVH QNVRNAISAR RLDKTFVLEI SFESHDRQKS ARLANATARA FIADQVEAKV AANRRLAASY EARLPELRGE LQRAEQAIET YKFQHSTVVP SGGPATLGGN EALVGLRELE REAETSRALY VSQLARSRKA FEQANFYVAD ARFISPAIPP ARRSWPPTGV LLVAGLFGGM SVAAGVALLR DHLDTRLFTK EQTESETEFP VLADIPEARP NSPDIARCNQ GAFLRILDSV REHSERKSTR IILLTSSELG EGKSTIAINL AMIADKLGDS VLLVDSPFAT TATSVAGEHI WFVDSPFIVR AALLPSSAGI KATSQNGEAA HRQAAHHPLV LQGDSARRTS LRDQIEFLLN SSTRKFDLII LERSAANDDC VLRDMSHIAH SIIIVAKAGR TRVGDIASIS ETLGLGRKRI AGVVLNRTRR KSRLLP
|
| |