Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2538 |
Symbol | |
ID | 5833222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2850255 |
End bp | 2851562 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641368339 |
Product | capsule polysaccharide biosynthesis protein |
Protein accession | YP_001640003 |
Protein GI | 163851960 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3562] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.157988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAGC CTCGTGCCCA TGCCATAGGC CCCCGCGCCG CCACCTTCCT GTTCCTGCAG GGGATCGCCT CCCCGTTCTT CTCGGTCCTC GGCAGCGCCC TGCGCGAGCG CGGCCACGGG GTTCGCCGCA TCAATTTCTC GGCCGGCGAC TGGTTGTTCT GGCCCTTTCC CGCCGACCAC TACCGCGGCA AGCGCGACGG CTGGGACGCC TATCTCGAAG CCTATATCCG CACGCATGGC GTCACCGACA TCGTGCTGTT CGGCGATTGC CGGCCCTACC ATCAGGCGGC GGTGCTCGTC GCCAAGCCGA TGGGCGTGCG CATCCACGTG TTCGAGGAGG GCTACATCCG GCCCTATTGG ATCACGTGCG AGGCGGGCGG GGTGAACGGC AACTCGACCC TGCCGAAGCG GGCGGAGGAG ATCCGGGAAC TGGCGCGCAA GCTGCCGCAG CCCGGACGGG CCATGCCGCT CACCGGAGAT ATGGGCCGGC GCAGCCTGTG GGATATCAGC TTCAACATCG CCAATATCGG CTTTCCGTAC CTTTATCCGG GTTTCCGCAC CCACCGGCCG AACCACATCG CCGCCGAATA TGCCGGCTGG ATCCGGAAGT TCGTGCGCCG CCGCCGCACC CGCCGCGAGG CGGCGCGGGT GAACGAGATC TACCACGCGA TCAACGCCGA CTACTTCCTG CTGCCGCTCC AGCTCGACAG CGACTACCAG ATCCGCGTCC ACTCGCCGTT CCTCGGCGTC GAGGGCTTCA TGGACCGGGT GATCGCCTCC TTCGCCAAGC ATTCGCAGGC GCCGACGCGG CTGCTGGTGA AGCTGCACCC GCTCGACAGC GGCATCATGA ACTGGCGCAA GCGCGCCCGC CAATCGGCCA AGCGCCACGG CTGCAACGAC CGGCTCGACT TCATCGACGG CGGCGACCTG CCCAAGCTCA TCGACGGCAG CCGCGGCGTC GTGCTGGTGA ACTCCACCGT CGGGATGCTC GCCCTTGAGC GCGGGCGGCC GACGCTGGCC TTCGGCTCTG CGGTCTACAA CATGCCGGGC CTGACCCATC AGGGCGACAT CGACACGTTC TGGGGGGCCC CGCAGGCACC CGACGCGGCA CTGATGCAGG ATTTCTTCCG GGTCGTCATG CACCGCACCC AGATCAACGG TGGCTACTTC TCCCGCTCGG CGATCGAGCG GGCGGTGGCC GGCGCCGTGC CGCGTCTCGA GGCGGCACTT CCGCCTGCCG CGCTGGCTGC CGCCCGCGAC ACGCTGGAGC AGGCCGGCCG CGACGGAAAC CTCTCCCCCG CCTATTGA
|
Protein sequence | MTQPRAHAIG PRAATFLFLQ GIASPFFSVL GSALRERGHG VRRINFSAGD WLFWPFPADH YRGKRDGWDA YLEAYIRTHG VTDIVLFGDC RPYHQAAVLV AKPMGVRIHV FEEGYIRPYW ITCEAGGVNG NSTLPKRAEE IRELARKLPQ PGRAMPLTGD MGRRSLWDIS FNIANIGFPY LYPGFRTHRP NHIAAEYAGW IRKFVRRRRT RREAARVNEI YHAINADYFL LPLQLDSDYQ IRVHSPFLGV EGFMDRVIAS FAKHSQAPTR LLVKLHPLDS GIMNWRKRAR QSAKRHGCND RLDFIDGGDL PKLIDGSRGV VLVNSTVGML ALERGRPTLA FGSAVYNMPG LTHQGDIDTF WGAPQAPDAA LMQDFFRVVM HRTQINGGYF SRSAIERAVA GAVPRLEAAL PPAALAAARD TLEQAGRDGN LSPAY
|
| |