Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1859 |
Symbol | |
ID | 5831627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2079066 |
End bp | 2081480 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367658 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001639329 |
Protein GI | 163851286 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.708627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGCC TCAGGCTGTC GACCGATCCG TCTCTTCCGC CCTCCGGCGG GCCCGGTGGG CCCGGCGATG GATTGGCGGT GGCGCAGATT GGCGGCGTCC TGCGCCGCTC CTGGGCCTGG ATCGCCGTGC CGACGCTCGT GGCCGCACTC GGGGCGGGCG TCTTCGTGCA GGTGGTGACC CCGCGCTACA CCGGCGAAGC CAAGGTTCTG CTGGAAAGCC GCGATCCCGC CTTCGCTCGC ACCGCCGCCG AACGGACGGA CCAGTCCCAG CCGATCGACG AGCAGGCCGT CGCGAGCCAA GTACAGGTGG CGATGTCGCG CGACATCGCC CGCGAAGCGA TTCGCAGCCT GAAGCTGGTC GGAAACCCCG AATTCGATCC GGAGGCCGAG GGCAGCTCGG CGATCCGGCG CACGCTGATG ATGCTCGGGC TCGTCTCGGC GCCGATGGAT CGTCCGTCCG AGGATCGCAT CCTCGAAAGC TACCTCGACC ACCTGCTGGT CTATCCGGTC GGCAAGTCGC GCATCCTCGC CGTCGAGTTC CGCTCGCGCG ATCCCGAACT CGCCGCGCGC GGCGCCAACA CGGTGGCGGA CCTCTATCTC GCCTCGCTGG AGGCGGCCTC CGTCGATACC GCCCGTTACG CCTCGACCTG GCTCGGCAAC AACATCGCGA ACCTGCGTGC CCGCGTGGCC GAAGCCGAGG CGAAGGTCGA AGCGTTCCGC GCCAAGCACG GCCTGATCGG CACCGGCAGC AGCGCGGCGG CTCAACCGCT CTCGTCCCAG CAGCTCTCCG AATTGTCGAG CCAGCTCTCG CAGGCCCGCA CGATCCAGGC GGACCTGACT GCCCGCGCCA AGCTCCTCAA GGACATGATC AAGGAGGGCC GTGCCTTCGA GATCCCCGAT GTCGCCAACA ACGAGCTGAT TCGGCGCACC GTCGAGAGCC GCATGGCGAT GCGTGCGCAG CTCGCCCTCG AGTCGCGCAC GCTGCTGCCG GCCCACCCGC GCATCAAGGA GCTGACGGCC CAGGTCCAGG ATCTCGAGAA TCAGATCAAG GCAGCCGCCG AGCGGGTGGT GCGCACCCTC GAGAACGACG CCAAGATCGC GGGCGCTCGG GTCGAGAGCC TGCGCGCGGC GGTCGAGGGA CAGCAGGATG TGGTCGCCAA GGGCAACACC AGCGAGGTGG AGCTGCGCGC CCTGGAGCGC GAGGCGAAAT CCCAGCGCGA ACAGCTCGAA TCCTATCTCG CGCGCTACCG CGAGGCCGCC GCGCGTGACG CCGAGAATGC CAGCCCGGCC AATGCCCGCG TGGTGTCCCG TGCGATCGTG CCCGATCTGC CCTCCTTCCC CAAGAAGCTG CCGATCGTCG CCTTCGCCAC GATGCTGGCC TTCTTGCTCG CGAGCGCCGG GGTCATCGGC CGCCATCTCC TCGTCACGCC GGCCGGGCCG GGCGGAGACC GCACCGGGGA TGAGGGAGAG CCCCTCGTCC AAAGCACCCG GACCGATCGC CCGCGCGACT TCTATCCCGA GCCGGAGCCG CCCCCGTCGC GCCGCCCTGT CTACGAGCCG GTCTATGGCG GCGCCTATCC GGCCTCGGCG GCGGAACGCT TCGCGCCGGC TCTCGCCTTT GCCCATTCCC TGAGGGCGAC GGCGACGGTG CATCACCCGG TCTTCGCCAG CACGGCGCCG ATCGGGGCCG AGGCGGCCGC GCAAGCGGGT GGGGCGAAAA CGGACAAGGA AGGGCTCGGC GTCCCCGCGA AATCCTCCGC CTCCTCTGCC GATCTCGACG GGCTGATCGC CCGACTGGCG AACGGAACGG GCAAGTCGCA GGGCGAAGGG CAGGGCAACG GGCCGTTGGC CGGGCCGTCC AAGGGGGGCT GTGTGCTCGT GGTCGAAACC CCGCGCGCCG ATGGCACGCC CGGTCTTGCC TCCAATCTCG CCCGGGTGCT CGGTCCGCGC TACCAGACGC TGCTGGTCGA TGTGAACGGT GTGGTCTCCG GGCCGTCCGA GCCGGGACTG ACCGATCTAG TGGCCGGTGC CGCCGATTTC CTCGACGTGA TCCAGCCGCT GCGGGGTTCG CGTCTCCACG CGGTGAAGCG CGGCGCCGCG CCTCTCGATG TGCTGGTGGA GGAGCCGCAA GGCCTCGCCA TCGGGCTCAA CGCCCTGTCG CAGAGTTACG ATTGGGTGCT GTGCCGCCTC GATGCGCGCA ATACCGAAGA CGGTGCCGAA CTCATTCCCG CGGTCGGACC CTGCATGGAT TCGATCGTGA TCGCCTCGGA TGCGGCGTCC GACGACCCGG CTCTGGTCTC GCTCTATCGC CTCGCCAAGG AAACCGGTGT GGCCCGGGTG GTGGTTGCCC GCCACGGCGA GGACGCGGAC CTCACGCCGA GCCTGGAAGG GACGCCTCTG CGGCTCTCGG CCTGA
|
Protein sequence | MPRLRLSTDP SLPPSGGPGG PGDGLAVAQI GGVLRRSWAW IAVPTLVAAL GAGVFVQVVT PRYTGEAKVL LESRDPAFAR TAAERTDQSQ PIDEQAVASQ VQVAMSRDIA REAIRSLKLV GNPEFDPEAE GSSAIRRTLM MLGLVSAPMD RPSEDRILES YLDHLLVYPV GKSRILAVEF RSRDPELAAR GANTVADLYL ASLEAASVDT ARYASTWLGN NIANLRARVA EAEAKVEAFR AKHGLIGTGS SAAAQPLSSQ QLSELSSQLS QARTIQADLT ARAKLLKDMI KEGRAFEIPD VANNELIRRT VESRMAMRAQ LALESRTLLP AHPRIKELTA QVQDLENQIK AAAERVVRTL ENDAKIAGAR VESLRAAVEG QQDVVAKGNT SEVELRALER EAKSQREQLE SYLARYREAA ARDAENASPA NARVVSRAIV PDLPSFPKKL PIVAFATMLA FLLASAGVIG RHLLVTPAGP GGDRTGDEGE PLVQSTRTDR PRDFYPEPEP PPSRRPVYEP VYGGAYPASA AERFAPALAF AHSLRATATV HHPVFASTAP IGAEAAAQAG GAKTDKEGLG VPAKSSASSA DLDGLIARLA NGTGKSQGEG QGNGPLAGPS KGGCVLVVET PRADGTPGLA SNLARVLGPR YQTLLVDVNG VVSGPSEPGL TDLVAGAADF LDVIQPLRGS RLHAVKRGAA PLDVLVEEPQ GLAIGLNALS QSYDWVLCRL DARNTEDGAE LIPAVGPCMD SIVIASDAAS DDPALVSLYR LAKETGVARV VVARHGEDAD LTPSLEGTPL RLSA
|
| |