Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3674 |
Symbol | mdoG |
ID | 5833130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4064087 |
End bp | 4065757 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369467 |
Product | glucan biosynthesis protein G |
Protein accession | YP_001641123 |
Protein GI | 163853080 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTGA GAGCCGACGA GGGAATGACG CATCCGACCG ACCTTTCGAC CCGCAATGCG GGTCCGGCGC CGATCGACCG CCGCCGGCTC CTCGGCGGGG TCGCGGCCGG CGCCGCCCTG GCCCTGCTGC CCGGCCGGGT GGAGGCGCAG GCCGCGCCGC CGGCCCTGCC CGCGGCGGGC TCGCCCTTCT CGGATGCCAC CGTGCCCGAC CTCGCCCGGG CGCTCGGCAA CCGGCCCTTC GTGGCCCAGA CCGCGAACGA CGTCCCAGAC GCGCTCAAGA ACCTGCCGCG GGAAGCCTAC GAGGCGATCC GCATCCGGCC CGAGGCGCTG ATCTGGGGCG GCGAGCCGCA CGGTTTCGCG GTCGAGCCGC TGCCCCGCGG CTTCTACTTC ACCGACCGGG TCGCCCTGTT CCTCGTCGAG GACGGCGTGG TCCGCCCCGT CGTCTACGAC CGCACGCATT ACGATGCGGG CACGGAGGCG GGGGCGGCGG CTCTGCCGGA GGCCGGTCGC GAGCCGGGCT TTTCCGGCCT GCGCATCCGC GCACGCTTCG GCGAGCGGCA CCAGGATTTC GCGGTGTTCC AGGGCGCCTG CTTCTACCGG CTGGTAGGAC AGGGCCAGGA ATTCGGCGTC GATGGGCGCG CGCTGATGCT GCGCCCGGCC GATCCGCGCG GCGAGGAATT CCCCCGCTGG CGCGCCCTGT TCGTGGAGCG CCCGAAGACG CCCGACGGCC CGCTGGTGAT CCACGCCCTG CTCGATTCCG ATTCGCTTGC CGCCGCCCTG CGCCTGGAAC TGCGCCCCGG CGAGACCTCG ACCGCGGCGA TCACCGCCAC CCTGGTCACC CGCAAGGCCG TCGATCACCT CGGGCTCAGC GGCATGCAGG CACCGTTCCT GTTCGGCCCG CACGACCGGC GCAGCGCGGA CGATGCCCGG GCCGCGGTCT ATGCCGCGGG CGGCCTGCAG ATCCGCAATG GCAGCGGGGA GGCGATCTGG CGCCCGGTGC GCAACCCGGA GACGCTCCAG ATCTCGGGCT TCCTCGACAA CCGCCCGCAG GGCTTCGGCC TGACGCAACG GGACCGGTCC TTCACAACCT TCGAGGATGA CGGCCGCCAC TGGGAGCGCT GCCCCTCGCT CTGGGTCGAG CCGGGCGAGG CCACCGGCGC CGCCGAGGCA GGCGCCGAGA GCCTGTGGGG GGAGGGCGCC GTGACGCTCC TCGAAATCCC GAGCGATTCG GAAGTCAACG AGAACGTCAT CGCCTATTGG CGGCCGAAGG CGGCCCTGCC CGCGGGCCAG GAGGTCCGTC TCGCCTACCG GCAGAACTGG GGCCGCGAGC CAGCCTCGGC ATCACCGCAG GGCAGCCCGC TCGCGCGGGT CACGAGCACC CGCAGCGGCC GCGGCACGGC CAATCCCCGG CGCCTGTTCC TCGTCGACTT TACCGGCGAC GGGCTGTTCA CCGCCGAGGG CGCCCTCGTG CCCGTCGAGA CCGTGCTGAT CGCCGGCCCC GGCCGGATCG TCGAGGGCGC GACCCGCTGG ATCGCCCATC CGGAGACCCG TACCGTCCGC GTCGCCTTCG AGCTCGATCC GGGCAGCGAG CGGGCCTGCG AGTTGCGCCT TGCCCTCAAG ACGGAAGGGC GACAGATCAC GGAAACGTGG CTGTACCGCT GGACGCCGTA A
|
Protein sequence | MTVRADEGMT HPTDLSTRNA GPAPIDRRRL LGGVAAGAAL ALLPGRVEAQ AAPPALPAAG SPFSDATVPD LARALGNRPF VAQTANDVPD ALKNLPREAY EAIRIRPEAL IWGGEPHGFA VEPLPRGFYF TDRVALFLVE DGVVRPVVYD RTHYDAGTEA GAAALPEAGR EPGFSGLRIR ARFGERHQDF AVFQGACFYR LVGQGQEFGV DGRALMLRPA DPRGEEFPRW RALFVERPKT PDGPLVIHAL LDSDSLAAAL RLELRPGETS TAAITATLVT RKAVDHLGLS GMQAPFLFGP HDRRSADDAR AAVYAAGGLQ IRNGSGEAIW RPVRNPETLQ ISGFLDNRPQ GFGLTQRDRS FTTFEDDGRH WERCPSLWVE PGEATGAAEA GAESLWGEGA VTLLEIPSDS EVNENVIAYW RPKAALPAGQ EVRLAYRQNW GREPASASPQ GSPLARVTST RSGRGTANPR RLFLVDFTGD GLFTAEGALV PVETVLIAGP GRIVEGATRW IAHPETRTVR VAFELDPGSE RACELRLALK TEGRQITETW LYRWTP
|
| |