Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0831 |
Symbol | |
ID | 5832717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 905500 |
End bp | 907575 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641366613 |
Product | glycosyl transferase family protein |
Protein accession | YP_001638307 |
Protein GI | 163850264 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.74925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.236177 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAA CCCCCGCCGT CCCGCTGCTC GACACCGTCG ATCCCGGCAC CGGTCGCCGC GCGGCGCTGG CCGAGGCCGC GCTGATTGTC GAGGCGATCC AGCGCAACGG CGTGCGCCTC GCCTTCGCGC CGCAGCCCGA TCCGGATGTC TCCATCGTCA TCGTCGCCCG CGATGCCCGC CACCTGCTGG CGCTCACCCT CTATCGCCTC TGCGCGAGCC AGGGGCTGGC AGGCGTCCGC TTCGAGGTCG TGCTGTTCGA CAACGCCTCC GCGCCGGAGA CCCGCGCCCT CTACCCCCAT CTCGACGGGG TGACGCTGAT CGAGAACGCC ACCAACACCG GCTTCGGGCC CGCCTGCAAC GCGGGCGCGG CCAGGGCGCG GGGGCGCTTC ATCCTGTTCC TCAACCCGGA TGTCGATCTC CTGCCCGGCG CGCTGGCGGC GATGGTCGCG ACCTTCCGCG ATCATGAAGG CGCTGGCATT GTCGGCGCCC GCCTCGTCTT CCCCGGCGGC GTGCTGCAGG AATCCGGCGC AGGCTTTCGC GACGACGCGC AACTCACCCA TCCGCACGGG CGCGGCAACG CCGACCCCTT CGCCCCCGAG CATGCCGCGA CCCGCGATGT CGGCTACGTC TCGGGCGCGG TGCTGATGAT CGAGCGGGCC CTGTTCGAGG CGCTCGGCGG CTTCGATCCG CTCTTCGCCC CGGCCTATTT CGAGGACACC GACCTCTGCC TGCGCTGCCA TCAGGCCGGG CGTCGCGTCA TCGTGCAGCC GCGCGCCACG GCGATCCATT ACGAGAACGC CACCAGCGCC CGCCGCGAGG ACGTGGAGGC GCTGCTCGAC CGCAACCGCG CCCGCTTCCT CGACCGGCAC CGGCAGAGCC TGTTCGCGCA GGGGCCGCAG CCGCGGGGAA CCGGCCTCCT CGACCACGAT CCCTGGCGCC TGAGGGTGCT CTACGTCGAT GATCGGGTGC CGCATCTCGA TCTCGGCGCC GGCCTGCCGC GGGCCAACGC CATCCTCAAC GCCATGGCGG GTCTCGGCTA CGCGGTGACG TTCTTCCCGA ACTACGAGGC GGATGCGGAG GAGGCGCGGC GCTACCGCGA CCTCGATGAG CGCATCGAGA TTTCTTACGC CAGCGGTGAC GAGGGGTTCG CCCGCCTGAT CGCCGAGCGG CGCGACCATT ACGACGTGCT CTGGGTCAGC CGGCCGCACA ACATCCTGTT CGTCACGCAG GCGCTGCACG CCGCCGGGCT CGACCCGCGC AGCTTCGTGC GCTCGAAGGT GATCTTCGAT TCCGAGGCCC TGTTCGCGCT GCGCGACTTC GTGACGGAGG CCGCGACCGC GGGCAGCGCG GTGGCCGCCG ATCTGGCGTG GCAGGCCGAG CGCGAGACGC GCCTGTTCGG GCTCGCCGAC GCGGTGGTCT GCGTCTCGCC GGCCGAGGCG CGGGTGCTCG CCCGCTACAG CGCCTGCAAC GCCACTGTGC TCGGCCACGC CCTGACCCGG CCCGAGGCGC CGACGCCGGG TTTTGCCGGC CGCGCGGGCT TCGTCTTCGT CGGCGCGCTC GCCCGCGAAG GGCAGCCCAA CGTCGATTCC CTCGACTGGT TTTTCGGGAG CGTCTGGCCG CTCGTGCGGG CGCGGTTGCC GGCGGCGCAG CTCACCATCG TGGGTGGGAT CGCGCCGGAA ATCCGGGAGC GCTACGCGCG CGAGCCGGGC GTGCAGGTCA CCGGTCGGGT GCCGCAGACC GAGCCCTATC TCGACGCCGC CCGCGTGTTC CTGGCGCCGA CGCGCTTCGC CGCCGGCATT CCGCACAAGG TCCATGAGGC GGTGGCGCGC GGCCTGCCCT GCGTCGTCAC GCCGATCCTC GCCGATCAGG TCGGCTGGGC GGACGGCGCC GGCTTCCTCG TGCGCGACTG GCGCAACCCA AAACCTTTCG CGGAGGCGCT GGTGGCGCTC CACGAGGACG CGGCCTTGTG GGACGCGGTT CGGGAGGAGG GGAGCCGGCA CATCGCCGAG GATTGCGACA CCGAGGCCTT CGCGGCCGCG ATCCGGGCCC TGTGCGAGGC GCAGGTCGTC GCATGA
|
Protein sequence | MSETPAVPLL DTVDPGTGRR AALAEAALIV EAIQRNGVRL AFAPQPDPDV SIVIVARDAR HLLALTLYRL CASQGLAGVR FEVVLFDNAS APETRALYPH LDGVTLIENA TNTGFGPACN AGAARARGRF ILFLNPDVDL LPGALAAMVA TFRDHEGAGI VGARLVFPGG VLQESGAGFR DDAQLTHPHG RGNADPFAPE HAATRDVGYV SGAVLMIERA LFEALGGFDP LFAPAYFEDT DLCLRCHQAG RRVIVQPRAT AIHYENATSA RREDVEALLD RNRARFLDRH RQSLFAQGPQ PRGTGLLDHD PWRLRVLYVD DRVPHLDLGA GLPRANAILN AMAGLGYAVT FFPNYEADAE EARRYRDLDE RIEISYASGD EGFARLIAER RDHYDVLWVS RPHNILFVTQ ALHAAGLDPR SFVRSKVIFD SEALFALRDF VTEAATAGSA VAADLAWQAE RETRLFGLAD AVVCVSPAEA RVLARYSACN ATVLGHALTR PEAPTPGFAG RAGFVFVGAL AREGQPNVDS LDWFFGSVWP LVRARLPAAQ LTIVGGIAPE IRERYAREPG VQVTGRVPQT EPYLDAARVF LAPTRFAAGI PHKVHEAVAR GLPCVVTPIL ADQVGWADGA GFLVRDWRNP KPFAEALVAL HEDAALWDAV REEGSRHIAE DCDTEAFAAA IRALCEAQVV A
|
| |