Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2223 |
Symbol | mdoG |
ID | 6873371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2129046 |
End bp | 2130599 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642785325 |
Product | glucan biosynthesis protein G |
Protein accession | YP_002215988 |
Protein GI | 198243682 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.966209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATA AACGACAAAT GATGAAAATG CGTTGGTTGG GCGCAGCTAT TATGTTAACG CTCTACGCAT CATCGAGCTG GGCGTTCAGT ATTGATGACG TGGCAAAACA AGCTCAATCT TTAGCCGGGA AAGGCTATGA GGCGCCTAAA AGCAACTTGC CCTCCGTTTT CCGCGACATG AAATATGCGG ATTATCAGCA GATCCAGTTT AACAGCGATA AAGCCTACTG GAACAACTTA AAGACCCCTT TTAAGCTCGA ATTTTACCAT CAGGGGATGT ACTTCGATAC GCCGGTCAAG ATTAACGAAG TGACGGCGAC GACGGTCAAA AGAATCAAAT ACAGCCCGGA TTACTTCAAT TTTGGCAATG TTCAGCACGA TAAAGACACG GTAAAAGATT TAGGCTTCGC CGGGTTCAAA GTTCTGTACC CCATCAACAG TAAAGATAAG AACGACGAAA TCGTCAGTAT GCTTGGCGCC AGCTATTTCC GCGTTATCGG CGCAGGCCAG GTATATGGCT TATCTGCGCG CGGCCTGGCG ATTGATACCG CCTTACCATC TGGTGAAGAG TTTCCCCGCT TTCGCGAGTT CTGGATTGAG CGTCCAAAAC CCACCGATAA TCGTTTGACC GTCTATGCAT TACTGGATTC TCCGCGCGCG ACCGGCGCTT ACCGTTTTGT GATCATTCCT GGCCGCGATA CCGTGGTGGA CGTGCAGTCA AAAGTCTATC TGCGCGATAA AGTGGGCAAG CTGGGCGTTG CGCCATTAAC CAGTATGTTC CTGTTTGGGC CAAACCAGCC GTCGCCGACG ACCAACTATC GTCCGGAACT GCATGACTCG AACGGCTTAT CCATTCATGC GGGTAATGGC GAGTGGATTT GGCGTCCGCT GAACAATCCA AAACACCTCG CTGTGAGCAG CTATGCAATG GAAAACCCTC AGGGATTCGG CCTGTTGCAG CGTGGTCGCG AGTTCTCGCG CTTTGAAGAT TTAGACGATC GCTACGACCT GCGTCCAAGC GCCTGGATTA CCCCGAAAGG CGACTGGGGC AAAGGTAAGG TTGAACTGGT TGAAATTCCG ACCAATGATG AAACCAACGA TAACATCGTC GCTTACTGGA CTCCGGATCA ACTGCCGGAA CCGGGTAAAG AGATGAACTT CAAGTACACT CTGACCTTCA GCCGCGATGA AGATAAACTT CATGCGCCGG ATAATGCCTG GGTGCTGCAA ACACGCCGCT CAACGGGCGA CGTTAAACAG TCGAATCTGA TTCGCCAGCC CGACGGCACT ATTGCCTTTG TGGTGGATTT CGTTGGCGCC GACATGAAAA AACTGCCGCC GGATACGCCT GTCGCTGCGC AAACCAGCAT TGGCGATAAC GGTGAAATCG TTGACAGTAA TGTACGCTAT AACCCAGTCA CTAAAGGCTG GCGTTTAATG CTGCGCGTGA AAGTCAAAGA CGCGAAGAAA ACCACGGAAA TGCGTGCCGC ATTGGTGAAT GCCGATCAGA CGCTAAGTGA AACCTGGAGC TACCAGTTAC CTGCCAATGA ATAA
|
Protein sequence | MKHKRQMMKM RWLGAAIMLT LYASSSWAFS IDDVAKQAQS LAGKGYEAPK SNLPSVFRDM KYADYQQIQF NSDKAYWNNL KTPFKLEFYH QGMYFDTPVK INEVTATTVK RIKYSPDYFN FGNVQHDKDT VKDLGFAGFK VLYPINSKDK NDEIVSMLGA SYFRVIGAGQ VYGLSARGLA IDTALPSGEE FPRFREFWIE RPKPTDNRLT VYALLDSPRA TGAYRFVIIP GRDTVVDVQS KVYLRDKVGK LGVAPLTSMF LFGPNQPSPT TNYRPELHDS NGLSIHAGNG EWIWRPLNNP KHLAVSSYAM ENPQGFGLLQ RGREFSRFED LDDRYDLRPS AWITPKGDWG KGKVELVEIP TNDETNDNIV AYWTPDQLPE PGKEMNFKYT LTFSRDEDKL HAPDNAWVLQ TRRSTGDVKQ SNLIRQPDGT IAFVVDFVGA DMKKLPPDTP VAAQTSIGDN GEIVDSNVRY NPVTKGWRLM LRVKVKDAKK TTEMRAALVN ADQTLSETWS YQLPANE
|
| |