Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4621 |
Symbol | |
ID | 5833691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5166206 |
End bp | 5167081 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370415 |
Product | WecB/TagA/CpsF family glycosyl transferase |
Protein accession | YP_001642060 |
Protein GI | 163854017 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1922] Teichoic acid biosynthesis proteins |
TIGRFAM ID | [TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00251676 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGAGA TCGAGGGAAG CCTGCCCATG GATGCCGTGG CCATCCCGGC CTCCGGGCCC TACGCGCTCC GGATCAACCG GAACGATCCC TACCTCGTCC AGGCGGACGG GCCGGCGGCG CATCGTCTCC CGGCGACCCG CATCCTCGGC GTGCCGGTGA GCGTCATCGA CATGCCGTTC GCTGTCCGCA CCATCATCGG CTGGGCGCGG GCGCGGCGCA CCAGCATGAT CTGCGTGCGC GACGTGCACG GCATCATGTG CGCCCAGGAC CAGCCCGACC TGAGGGCCGC GCACGAGCGG GCCAGCATGA TCACCCCGGA CGGCGCGCCG CTCACGATCA TCAGCCGGTT CTTCCGCGCG CACGGCACCG GCCGCGTGCC GGGCCCCTCG CTGATGGAGG AGATGTTCGC GGCCACGCAG AACACCGGCA TCCGCCACTA TCTCTACGGC GGCAAGGAGG GCGTGGCCGA GCAGGTTGCG GCCAATTTCG CCCGCAAATA TCCCGGCACC GAGGTCTGCG GCCTCGCTTG CCCGCCCTTC GGCGCCGTCG CGCCCGATCT CGACGCGCGG CTCACCGATG CGATCAAGGC GGCCGAGCCG CACATCGTCT GGGTCGGCAT GTCGACGCCC AAACAGGACA TCTGGATGAA CGATCACCTC GATCGTTTGC CGGGCATGGT GCTGATCGGT GTCGGCGCCG CCTTCGATTT CCATTCCGGG GCGGTCAAAC GCGCGCCGAA ATTCATGCAG GTGCTCGCCC TCGAATGGCT GCACCGACTC CTGAGCGAGC CCCGGCGGCT GTGGCGGCGC TATCTCGTCA TGGCGCCCGT CTTCCTGTGG AAGATCGCGC GCCAGCCCGA GCCGCGCCGG TCGTAA
|
Protein sequence | MAEIEGSLPM DAVAIPASGP YALRINRNDP YLVQADGPAA HRLPATRILG VPVSVIDMPF AVRTIIGWAR ARRTSMICVR DVHGIMCAQD QPDLRAAHER ASMITPDGAP LTIISRFFRA HGTGRVPGPS LMEEMFAATQ NTGIRHYLYG GKEGVAEQVA ANFARKYPGT EVCGLACPPF GAVAPDLDAR LTDAIKAAEP HIVWVGMSTP KQDIWMNDHL DRLPGMVLIG VGAAFDFHSG AVKRAPKFMQ VLALEWLHRL LSEPRRLWRR YLVMAPVFLW KIARQPEPRR S
|
| |