Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3556 |
Symbol | |
ID | 5831102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3934208 |
End bp | 3936475 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369350 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001641007 |
Protein GI | 163852964 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATGAGG CGAGTGACAT CGCGCGCAGC GCGCGCCAGC GGGAGCAGCC GACGCTGCGG GAACCTGGCG CGACCAGGGA GACCATCGCG GCGCTGAGGG AGGTCGTAGG CCCAGGGGAG GCGCGCCACG AGCCGCCGTT CCCTGCGCAC GAGCAGTGCA CAGGGTTGCT CCCTGCTCCG GCGGCCGGCC TCACGCCCGC GGCGTCGGCG GCTTACGCCG ATAGCGGGCT CCGGATGCGC GACCTCCTCT CGACCCTCAG GCGGCGGGCG CGCCTCGTCG GGGCCGTGGC GGTCGCGGGG GCATTGCTGA CGGGCACGCT GTCCGCCCTC CTGCCGCCCT CCTACGTGAC GACCGCGCAT GTCATGGTCG AGCCGGCCGC CTCCGAGGTC GCTGCAGGCC TGCTGGCGCC CGTCGTAGAC ACTCACATCA TCCTGCTAAC CTCCGAGGCC CGCCTCAGGC GCGTGTTCGA TGACTTGGCC GCCGAGCCTC GATACATCGA GGCCTCGGCG GCCGTCGGGG CCCATCCCTC ATTCGCCGAA CGCGCCAAGT TCCTGCTGCG CGACGCGGCG CGCGAACTGG GCTTGGCGTC CCGCCCGGCA GAGCCCGAGT CCGAGACGCC GTCGACCGAC CTCGGGCCTG GCCAGGCCCG GGCGGCGATG CTGGTGCGCC AGGAGCGCCA GTCCAGCATC ATCAGCGTCA GCTTTCAAGA CCGCAGCGCG CAGAGGGCGG CGCTTGTCGC CAACCGTATG GTGACCCGAC ACGTGCAAGA ACTGAGCGAG CGCCGGACGC AGGAGGCCGC CGGACGCAAA GACTTCCAGG AGCAGCGGGT GGAGGAGGCT CGGGCGGAGG TTGAAGAGGC CGAGCGGCTG GTGCGCGCCT TTCAGGTCGA GAACGGGGCG TCCGTCACCG ACCGGGAGGG CGAGTCCGTC GCGGAGATGA CCCAACAGCT GGTCCTGTTG CGCTCGGAGA TCGCCTCACA CGAGCGGGAT GCCGGGGCAC CGGCCGCCGC CGACGAGCTT CGGGTCATCC GGCTCCAGGC CGATGCGCTC GAGGCGCGCC TGGCCGAGCT CAAGGCCGTT CAGGGCGCTA CCATCGATCG GCGCATCGAG AGGCACGCCC TGGACCTGCG CCTGGACACG GCCCGCAAGA ACCTGACCGA GCAGCTGCGC CAGCTGGAAG AGTCGGGCAA GCCCCAGCCT GCTTTCGCCT CCTCCGCCCG GGTCGTCGCC TCGGCCGGGG TTCCCACGCG CCCGAACTCC CTGCACCCCG CCGTGGTGGC CGTGCCCGCC CTAGGGGCCT TCGGCATCCT GGGCGCGATG GTTGCGCTGC TGGTGGAGCG GCTCGCCACG GGCTACCGGA GCGAGCGCGA GGTGGAAGAG GAACTGGGCG TTCCCTGCCT CGGCCTCGTC CCGCGCGCGC CGGTCGTGAG GGCGTTCGGC ACCGCGGCCG ACGATGCGCG CTCACCCTGG AGCCGCGCGG TGGGCTCGCT CGCGATGACA TTGCTGCACC GTTGCGGGCG GCCCGGCTCC CCCTGCGTGG TGCTGGTGAC GGCCTGCGCT CGCGAGGAGG ACAAGGCGGG GTTGGCGACT GCCCTCGCCG TACGCGCCCG TCAGGATGGA CTCCGCGTCC TGCTGGTGCG CTGGGACGAC GACGCCGCGC TTGGACAGGG TCTCCGCTCC TACCCGCTTT CCGACGGTGG GGCCGCCTTG AAATCACTTT CCGCCGCCTT CGCGCGCGAT CCGGCACTCG GCGTGGACCG CCCCGTCAGC GAGGGCGTCG GGCGCGATCT CGTGCTCGGC CTGGGAGGTG ACCGCTCCGT CCGAGTCAGG CATATCCTGG CTTACGAGTA CGACCTGGTG GTGGTGGACG CCCCGCCAGT GCTGGCCTCC TCGCAGGCGC GTTTGGTCGC GGATGAAGCC GACGCCGCGC TGCTCGCGCT CGCCTGGGGG CGCACGGACC GCAAGGTCGC GGACCAGGCG CTGCGCCTCC TGCGCCGGCC CATGCCGCTC GCGGGCGACG AGCCAGCTCC GGGCGAGCGC GCCATCCTCG CCGTGCTCAC AGACGTGAAC CTGAAGGCCC ATGCCCGCTA TCGCCTGGGT GACGTCGGCG AGCACCTGTT CGAGGCCCAG CGTCGGCGGG ACCGATCGCG CCGGACCGCT CCACGGCCCT CCGCCCAGAG CCAGACGCAG GCCGATTTCG CGTCCGACAA GCGCCCCGAT GCCGAGCCCG CGGCATCCCC GTTCCCGCGC AGGCGAGCAG GGGCGTGA
|
Protein sequence | MNEASDIARS ARQREQPTLR EPGATRETIA ALREVVGPGE ARHEPPFPAH EQCTGLLPAP AAGLTPAASA AYADSGLRMR DLLSTLRRRA RLVGAVAVAG ALLTGTLSAL LPPSYVTTAH VMVEPAASEV AAGLLAPVVD THIILLTSEA RLRRVFDDLA AEPRYIEASA AVGAHPSFAE RAKFLLRDAA RELGLASRPA EPESETPSTD LGPGQARAAM LVRQERQSSI ISVSFQDRSA QRAALVANRM VTRHVQELSE RRTQEAAGRK DFQEQRVEEA RAEVEEAERL VRAFQVENGA SVTDREGESV AEMTQQLVLL RSEIASHERD AGAPAAADEL RVIRLQADAL EARLAELKAV QGATIDRRIE RHALDLRLDT ARKNLTEQLR QLEESGKPQP AFASSARVVA SAGVPTRPNS LHPAVVAVPA LGAFGILGAM VALLVERLAT GYRSEREVEE ELGVPCLGLV PRAPVVRAFG TAADDARSPW SRAVGSLAMT LLHRCGRPGS PCVVLVTACA REEDKAGLAT ALAVRARQDG LRVLLVRWDD DAALGQGLRS YPLSDGGAAL KSLSAAFARD PALGVDRPVS EGVGRDLVLG LGGDRSVRVR HILAYEYDLV VVDAPPVLAS SQARLVADEA DAALLALAWG RTDRKVADQA LRLLRRPMPL AGDEPAPGER AILAVLTDVN LKAHARYRLG DVGEHLFEAQ RRRDRSRRTA PRPSAQSQTQ ADFASDKRPD AEPAASPFPR RRAGA
|
| |