Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2300 |
Symbol | |
ID | 5835649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2548985 |
End bp | 2550271 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641368099 |
Product | O-antigen polymerase |
Protein accession | YP_001639766 |
Protein GI | 163851723 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00840255 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.00785546 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGCAGA AACAAACGCC TAGAGCGTCG TCCTGGATCA GGTATCCAGG ACGACGCTCT AGCGCGGCGG CGCGGCTCTA CGCTGCCGGC GCCGTCGCGC TGGCGCTGGT GGCGCCGATG ATGGCCCTCG CCAACCGGTC GAGTCCGCTT CTCGTCGGCG TGGCCGCGCT GCTGTTCCTG GCGGGCACCG TCGCCGAGCG CGGCGGGCGC GCCGCCTCCG ACCTGATCAC CCCCCTGCGC GCCCCGCTCG GCCTCGCCGC GCTGGCCTTC CTCGCCTGGT GTCTCGTCTC GCTGGCCTGG AGCCCGTTTC CCGCGCTCTG GGGCCGTGTG CTGTCCGAAT TCCTGCCGAC GCTCGCCGCC GCGGCGATCC TCGCCCGGCT CGCGCCGGCC CGGCTGCCGC CCTGGGCGCT GCCCCTCGGC GCCGGCCTGC TCGCCGCAGC CTGCCTTTTC ATCGCGGCAA GCCTCGCCCT CGGGCTGGCG CCGCAGGCCT GGCTCGGGCA GCGCGTGGCC CTGTTCATGT TCAACCGCCC GCTGCTGACG GTGCTGCTGC TGGCCGGGCC CATCGCCGCC TTCCTCGCCC TGCGCGGCCA CCGCCTCGCC GCCGTGATCC TGCTCGGGGT GACGGCGCTG GCGATCCTGC GCTCGATCAG CGGCGCGGCC ATGCTCGGGC TGCTCGCGGG CGCTGTGATG TTCGCGGTCG GGCGCTTGGC GCCCCGATCC GTGGCCCTGG CGCTCGCGGC GCTGACCCTC GGGCTCGCCT TCGCCCTTGC CCCGGTCGAG GGCGACATCC TCCACCGGCT GATGCCGGAG GCGGCGCATG AGCGGCTGAC GCAGTCTTCG TCGCGGGCGC GGGTCGCCAT CGCCCAGAGC TTCGCCGCCG CGGTGGCGCA GGCGCCCTGG ATCGGCTCCG GCTACGGCAT GGGCCTGCGC TTCGCCGAGG TACCGGCGTC GCAAGCTCTC GAGCCGGAGA TGCGGGCGAT GCTGGCCGTC GGCCATCCGC ATAACAGCTT CCTCCAGATC TGGGCCGAAC TCGGCTTCGT CGGCGCGGCG CTCGCGGCTT TGGTCGCCTT CCTGGCCCTG CGGGCGGCGG CCGCCTTGCC GCGGCTCCTG TTCGCCACGG CGCTCGGCTT GCTGGGCGCG GCGGTGGCGG TGATGTTCGT CGAGCACGGC GCGTGGCAGG GATGGTGGAC GGCGGGCCTC GGGGCCGCCA TCACATGGCT GCGGGCGGCG GCTTGCGCCA AGCCCCCATA CGAGCCCGCA CACGAGAGTG AAGACGCGCG CGCATGA
|
Protein sequence | MMQKQTPRAS SWIRYPGRRS SAAARLYAAG AVALALVAPM MALANRSSPL LVGVAALLFL AGTVAERGGR AASDLITPLR APLGLAALAF LAWCLVSLAW SPFPALWGRV LSEFLPTLAA AAILARLAPA RLPPWALPLG AGLLAAACLF IAASLALGLA PQAWLGQRVA LFMFNRPLLT VLLLAGPIAA FLALRGHRLA AVILLGVTAL AILRSISGAA MLGLLAGAVM FAVGRLAPRS VALALAALTL GLAFALAPVE GDILHRLMPE AAHERLTQSS SRARVAIAQS FAAAVAQAPW IGSGYGMGLR FAEVPASQAL EPEMRAMLAV GHPHNSFLQI WAELGFVGAA LAALVAFLAL RAAAALPRLL FATALGLLGA AVAVMFVEHG AWQGWWTAGL GAAITWLRAA ACAKPPYEPA HESEDARA
|
| |