Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_3969 |
Symbol | mdoG |
ID | 7117167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4173719 |
End bp | 4175365 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643526697 |
Product | glucan biosynthesis protein G |
Protein accession | YP_002422707 |
Protein GI | 218531891 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.309452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCATC CGACCGACCT TTCGACCCGC GATGCGGGTC CGGCGCCGAT CGACCGCCGC CGGCTCCTCG GCGGGGTCGC GGCCGGCGCC GCCCTGGCCC TGCTGCCCGG CCGGGTGGAG GCGCAGGCCG CGCCGCCGGC CCTGCCCGCG GCGGGCTCGC CCTTCTCGGA TGCCACCGTG CCCGACCTCG CCCGGGCGCT CGGCAACAGG CCCTTCGTGG CCCAAACCGC GAACGACGTC CCCGACGCGC TCAAGAACCT GCCGCGGGAA GCCTACGAGG CGATCCGCAT CCGGCCCGAG GCGCTGATCT GGGGCGGCGA GCCGCACGGT TTCGCGGTCG AGCCGCTGCC CCGCGGCTTC TACTTCACTG ACCGGGTCGC CCTGTTCCTC GTCGAGGACG GCGTGGTCCG CCCCGTCGTC TATGACCGCA CGCATTACGA TGCGGGCACG GAGGCGGGGG CGGCGGCCCT GCCGGAATCC GGGCGCGAGC CGGGCTTTTC CGGCCTGCGC ATCCGCGCGC GCTTCGGCGA ACGGCACCAG GATTTCGCGG TGTTCCAGGG CGCCTGCTTC TACCGGCTGG TGGGGCAGGG CCAGGAATTC GGCGTCGATG GGCGCGCGCT GATGCTGCGC CCGGCCGACC CGCGCGGCGA AGAATTCCCC CGCTGGCGCG CCCTGTTCGT GGAGCGCCCG AAGACGCCCG AAGGCCCGCT GGTGATCCAC GCCCTGCTCG ATTCCGATTC GCTCGCCGCC GCCCTGCGCC TGGAACTGCG CCCCGGCGAG ACCTCGACCG CGGCGATCAC CGCCACCCTG GTCACCCGCA AGGCCGTCGA TCACCTCGGG CTCAGCGGCA TGCAGGCACC GTTCCTGTTC GGCCCACACG ACCGGCGCGG CGCGGACGAT GCCCGGGCCG CGGTCTACGC CGCGGGCGGC CTGCAGATCC GCAATGGCGG CGGGGAGGCG ATCTGGCGCC CGGTGCGCAA CCCGGAGACG CTCCAGATCT CGGGCTTCCT CGACAACCGC CCGCAGGGCT TCGGCCTGAC GCAACGGGAC CGTTCGTTCA CGACCTTCGA GGATGACGGC CGCCACTGGG AGCGCTGCCC CTCGCTCTGG GTCGAGCCGG GCGAGCCCGC CGGCGCCGCC GAGGCAGGCG CCGAGGGCCT GTGGGGGGAG GGCGCCGTGA CGCTCCTCGA GATCCCGAGC GATTCGGAAG TCAACGAGAA CGTCATCGCC TATTGGCGGC CGAAGGCGGC CCTGCCCGCG GGCCAGGAGG TCCGCATCGC CTACCGGCAG AACTGGGGCC GCGAGCCAGC CTCGGCATCG CCGCAGGGCA GCCCGCTCGC GCGGGTCACA AGCACGCGCA GCGGCCGCGG CACCGCCAAT CCCCGGCGCC TGTTCCTCGT CGACTTTACC GGCGACGGGC TGTTCACCGC TGAGGGCGCC CTCGTGCCCG TCGAGACCGT GCTGATCGCC GGCCCCGGCC GGATCGTCGA GGGCGCGACC CGCTGGATCG CCCACCCGGA GACCCGCACC GTCCGCGTCG CCTTCGAGCT CGATCCGGGC AGCGAGCGGG CCTGCGAGTT GCGCCTTGCC CTCAAGACGG AAGGGCGACA GATCACGGAA ACGTGGCTGT ACCGCTGGAC GCCGTAA
|
Protein sequence | MTHPTDLSTR DAGPAPIDRR RLLGGVAAGA ALALLPGRVE AQAAPPALPA AGSPFSDATV PDLARALGNR PFVAQTANDV PDALKNLPRE AYEAIRIRPE ALIWGGEPHG FAVEPLPRGF YFTDRVALFL VEDGVVRPVV YDRTHYDAGT EAGAAALPES GREPGFSGLR IRARFGERHQ DFAVFQGACF YRLVGQGQEF GVDGRALMLR PADPRGEEFP RWRALFVERP KTPEGPLVIH ALLDSDSLAA ALRLELRPGE TSTAAITATL VTRKAVDHLG LSGMQAPFLF GPHDRRGADD ARAAVYAAGG LQIRNGGGEA IWRPVRNPET LQISGFLDNR PQGFGLTQRD RSFTTFEDDG RHWERCPSLW VEPGEPAGAA EAGAEGLWGE GAVTLLEIPS DSEVNENVIA YWRPKAALPA GQEVRIAYRQ NWGREPASAS PQGSPLARVT STRSGRGTAN PRRLFLVDFT GDGLFTAEGA LVPVETVLIA GPGRIVEGAT RWIAHPETRT VRVAFELDPG SERACELRLA LKTEGRQITE TWLYRWTP
|
| |