Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2029 |
Symbol | mdoD |
ID | 6967079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1928991 |
End bp | 1930610 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385945 |
Product | glucan biosynthesis protein D |
Protein accession | YP_002270434 |
Protein GI | 209396923 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.511288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0000000000014294 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGCCG TGTGCGGTAC CAGCGGCATT GCTTCTCTTT TTTCTCAGGC GGCATTCGCG GCAGATTCTG ATATTGCCGA CGGGCAAACC CAGCGTTTTG ACTTCTCCAT TCTACAGTCA ATGGCGCACG ACTTAGCGCA AACAGCGTGG CGTGGTGCGC CTCGTCCGTT ACCTGACACG CTGGCGACAA TGACGCCGCA GGCTTATAAC AGTATTCAAT ACGACGCCGA AAAATCGCTC TGGCATAACG TTGAGAACCG TCAACTGGAC GCTCAGTTCT TCCATATGGG AATGGGATTC CGTCGCCGCG TTCGTATGTT TTCTGTAGAT CCAGCAACAC ATCTGGCGCG TGAAATTCAC TTTCGCCCGG AGTTGTTCAA ATACAACGAT GCAGGTGTTG ATACCAAACA ATTAGAAGGG CAAAGCGATC TCGGCTTTGC CGGTTTTCGC GTGTTTAAAG CCCCCGAACT GGCGCGCCGT GATGTAGTAT CATTCCTCGG AGCGAGTTAT TTCCGCGCCG TTGATGACAC ATATCAATAC GGTTTGTCGG CCCGCGGCCT GGCGATTGAC ACTTACACCG ACAGTAAAGA AGAGTTCCCC GACTTTACCG CCTTCTGGTT TGATACGGTA AAACCGGGGG CAACTACCTT TACCGTTTAT GCGTTGCTCG ATAGCGCCAG CATTACTGGT GCCTATAAGT TCACTATCCA TTGTGAGAAA AGTCAGGTGA TTATGGATGT GGAAAATCAC CTGTATGCGC GCAAAGACAT TAAACAGCTG GGCATTGCGC CGATGACCAG TATGTTCAGC TGCGGTACTA ATGAACGTCG GATGTGCGAC ACCATTCATC CGCAAATTCA TGACTCTGAT CGTCTGTCCA TGTGGCGGGG CAACGGCGAG TGGATTTGCC GTCCGCTGAA TAATCCGCAA AAATTGCAGT TCAATGCTTA CACCGACAAC AACCCGAAAG GGTTTGGTTT ATTGCAACTG GATCGTGATT TCTCCCATTA TCAGGACATT ATGGGCTGGT ATAACAAACG CCCAAGTCTG TGGGTGGAAC CGCGTAACAA GTGGGGTAAG GGCACCATCG GCCTGATGGA AATCCCAACA ACGGGCGAAA CGCTGGATAA CATTGTCTGC TTCTGGCAGC CAGAAAAAGC TGTAAAGGCG GGTGATGAGT TTGCATTCCA GTATCGTCTG TACTGGAGTG CGCAACCGCC TGTTCATTGC CCATTAGCGC GCGTTATGGC GACGCGTACC GGCATGGGCG GTTTCCCGGA AGGTTGGGCA CCAGGTGAAC ACTATCCCGA AAAATGGGCG CGTCGTTTTG CCGTCGATTT CGTTGGTGGT GATCTGAAAG CTGCCGCACC AAAAGGCATT GAGCCGGTGA TTACGCTTTC CAGTGGGGAA GCGAAGCAAA TCGAAATTCT CTATATTGAA CCCATTGATG GTTATCGTAT TCAGTTTGAC TGGTATCCGA CTTCGGACTC CACTGATCCG GTCGATATGC GGATGTATCT GCGTTGTCAG GGCGACGCTA TCAGTGAAAC ATGGCTGTAT CAGTATTTCC CGCCAGCGCC GGATAAACGT CAGTATGTTG ACGACCGCGT GATGAGTTAA
|
Protein sequence | MAAVCGTSGI ASLFSQAAFA ADSDIADGQT QRFDFSILQS MAHDLAQTAW RGAPRPLPDT LATMTPQAYN SIQYDAEKSL WHNVENRQLD AQFFHMGMGF RRRVRMFSVD PATHLAREIH FRPELFKYND AGVDTKQLEG QSDLGFAGFR VFKAPELARR DVVSFLGASY FRAVDDTYQY GLSARGLAID TYTDSKEEFP DFTAFWFDTV KPGATTFTVY ALLDSASITG AYKFTIHCEK SQVIMDVENH LYARKDIKQL GIAPMTSMFS CGTNERRMCD TIHPQIHDSD RLSMWRGNGE WICRPLNNPQ KLQFNAYTDN NPKGFGLLQL DRDFSHYQDI MGWYNKRPSL WVEPRNKWGK GTIGLMEIPT TGETLDNIVC FWQPEKAVKA GDEFAFQYRL YWSAQPPVHC PLARVMATRT GMGGFPEGWA PGEHYPEKWA RRFAVDFVGG DLKAAAPKGI EPVITLSSGE AKQIEILYIE PIDGYRIQFD WYPTSDSTDP VDMRMYLRCQ GDAISETWLY QYFPPAPDKR QYVDDRVMS
|
| |