Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1750 |
Symbol | mdoD |
ID | 6143428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1752415 |
End bp | 1754034 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616626 |
Product | glucan biosynthesis protein D |
Protein accession | YP_001743804 |
Protein GI | 170679744 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 2.5781800000000004e-18 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGCCG TGTGCGGTAC CAGCGGCATT GCTTCTCTTT TTTCTCAGGC GGCATTCGCG GCAGATTCTG ATATTGCCGA CGGGCAAACC CAGCGTTTTG ACTTCTCCAT TCTACAGTCA ATGGCGCACG ACTTAGCGCA AACAGCGTGG CGTGGTGCGC CGCGTCCGTT ACCTGACACT CTGGCGACAA TGACGCCGCA GGCTTATAAC AGTATTCAAT ACGACGCCGA AAAATCGCTC TGGCATAACG TTGAGAACCG TCAACTGGAC GCTCAGTTCT TCCATATGGG AATGGGATTC CGTCGCCGCG TTCGTATGTT TTCTGTAGAT CCCGCAACAC ATCTGGCGCG TGAAATTCAC TTTCGCCCGG AGTTGTTCAA ATACAACGAT GCGGGTGTTG ATACCAAACA ATTAGAAGGG CAAAGCGATC TCGGTTTTGC CGGTTTTCGC GTGTTTAAAG CCCCCGAACT GGCGCGCCGT GATGTCGTAT CATTCCTCGG TGCGAGTTAT TTCCGCGCCG TTGACGACAC ATATCAATAC GGTCTATCGG CTCGCGGCCT GGCGATCGAC ACTTACACCG ACAGTAAAGA AGAGTTCCCC GACTTTACCG CCTTCTGGTT TGATACGGTA AAACCGGGGG CAACCACCTT TACCGTTTAT GCGTTGCTCG ATAGCGCCAG CATTACTGGT GCCTATAAGT TCACTATCCA TTGCGAGAAA AGTCAGGTGA TTATGGATGT GGAAAATCAC CTGTATGCGC GCAAAGACAT TAAACAGCTG GGCATTGCGC CGATGACCAG TATGTTCAGC TGCGGTACTA ATGAACGTCG GATGTGCGAC ACCATTCATC CGCAAATCCA TGACTCTGAT CGTTTGTCCA TGTGGCGGGG CAACGGCGAG TGGATTTGTC GTCCGCTGAA CAATCCGCAA AAATTGCAGT TCAATGCTTA CACCGACAAC AACCCGAAAG GGTTTGGTTT ATTGCAACTG GATCGTGATT TCTCCCATTA TCAGGACATT ATGGGCTGGT ATAACAAACG CCCAAGTCTG TGGGTGGAAC CGCGTAACAA GTGGGGTAAG GGCACCATCG GCCTGATGGA AATCCCAACA ACGGGCGAAA CGCTGGATAA CATTGTCTGC TTCTGGCAGC CAGAAAAAGC TGTAAAAGCG GGTGATGAGT TTGCATTCCA GTATCGTCTG TACTGGAGTG CGCAACCGCC TGTTCATTGC CCATTAGCGC GCGTTATGGC GACGCGTACC GGCATGGGTG GTTTCCCGGA AGGTTGGGCT CCAGGTGAAC ACTATCCCGA AAAATGGGCG CGTCGTTTTG CCGTCGATTT CGTTGGTGGT GATCTGAAAG CTGCCGCGCC AAAAGGCATT GAGCCGGTGA TTACGCTTTC CAGTGGGGAA GCGAAGCAAA TCGAAATTCT CTATATTGAA CCCATTGATG GTTATCGTAT TCAGTTTGAC TGGTATCCGA CTTCGGACTC CACTGATCCG GTCGATATGC GGATGTATCT GCGTTGTCAG GGCGACGCTA TCAGTGAAAC ATGGCTGTAT CAGTATTTCC CGCCAGCGCC CGATAAACGT CAGTATGTTG ACGACCGCGT GATGAGTTAA
|
Protein sequence | MAAVCGTSGI ASLFSQAAFA ADSDIADGQT QRFDFSILQS MAHDLAQTAW RGAPRPLPDT LATMTPQAYN SIQYDAEKSL WHNVENRQLD AQFFHMGMGF RRRVRMFSVD PATHLAREIH FRPELFKYND AGVDTKQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY GLSARGLAID TYTDSKEEFP DFTAFWFDTV KPGATTFTVY ALLDSASITG AYKFTIHCEK SQVIMDVENH LYARKDIKQL GIAPMTSMFS CGTNERRMCD TIHPQIHDSD RLSMWRGNGE WICRPLNNPQ KLQFNAYTDN NPKGFGLLQL DRDFSHYQDI MGWYNKRPSL WVEPRNKWGK GTIGLMEIPT TGETLDNIVC FWQPEKAVKA GDEFAFQYRL YWSAQPPVHC PLARVMATRT GMGGFPEGWA PGEHYPEKWA RRFAVDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYIE PIDGYRIQFD WYPTSDSTDP VDMRMYLRCQ GDAISETWLY QYFPPAPDKR QYVDDRVMS
|
| |