Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1427 |
Symbol | mdoC |
ID | 6971015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1410512 |
End bp | 1411669 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643385401 |
Product | glucans biosynthesis protein |
Protein accession | YP_002269895 |
Protein GI | 209400526 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0273757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.379852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCAG TACCCGCGCA ACGTGAATAT TTCCTCGACT CCATCCGCGC CTGGCTGATG TTGTTAGGGA TCCCTTTTCA TATTTCTTTA ATCTATTCGA GCCATACATG GCATGTGAAT AGCGCCGAAC CGTCATTGTG GCTGACCCTT TTTAATGACT TCATCCACTC GTTCCGCATG CAGGTATTTT TCGTTATATC CGGGTACTTT TCCTACATGC TTTTTTTACG CTATCCCTTG AAAAAATGGT GGAAAGTACG TGTCGAACGT GTAGGGATCC CGATGTTAAC AGCCATCCCC CTACTGACAT TGCCGCAATT TATTATGCTG CAATACGTCA AAGGGAAAGC GGAAAGTTGG CCTGGGCTGT CATTGTATGA CAAATATAAT ACGTTGGCCT GGGAATTAAT ATCACATCTG TGGTTTTTAC TGGTGTTAGT GGTCATGACG ACGCTGTGCG TATGGATATT TAAACGCATC AGAAATAATT TAGAAAATTC TGATAAAACG AATAAAAAAT TCTCGATGGT AAAACTATCG GTGATTTTTT TGTGCCTCGG CATCGGTTAT GCGGTAATAA GAAGAACGAT TTTTATTGTG TATCCACCCA TTCTGAGTAA TGGCACGTTC AATTTTATTG TCATGCAAAC GCTATTTTAT TTACCGTTCT TTATCCTCGG CGCACTGGCT TTCATTTTCC CTCATCTTAA AGCCTTGTTT ACCACGCCGT CTCGTGGCTG TACCCTCGCT GCAGCATTGG CGTTTGTCGC TTATTTACTC AACCAGCGCT ATGGCAGTGG CGATGCCTGG ATGTACGAAA CCGAGTCGGT GATCACCATG GTCCTCGGTC TGTGGATGGT GAATGTGGTC TTCTCCTTTG GCCACCGTTT GCTTAACTTC CAGTCAGCGC GGGTGACTTA TTTTGTTAAC GCATCGCTGT TTATCTATCT GGTTCACCAC CCGTTAACGC TGTTTTTCGG CGCATACATT ACACCGCACA TCACCTCCAA CTGGCTTGGT TTTCTCTGTG GCCTGATATT CGTAGTAGGG ATTGCGATAA TTCTGTATGA AATTCATTTG CGCATCCCGT TACTGAAGTT TTTGTTCTCT GGTAAACCGG TTGTTAAGCG TGAGAACGAT AAAGCACCAG CCCGTTAA
|
Protein sequence | MNPVPAQREY FLDSIRAWLM LLGIPFHISL IYSSHTWHVN SAEPSLWLTL FNDFIHSFRM QVFFVISGYF SYMLFLRYPL KKWWKVRVER VGIPMLTAIP LLTLPQFIML QYVKGKAESW PGLSLYDKYN TLAWELISHL WFLLVLVVMT TLCVWIFKRI RNNLENSDKT NKKFSMVKLS VIFLCLGIGY AVIRRTIFIV YPPILSNGTF NFIVMQTLFY LPFFILGALA FIFPHLKALF TTPSRGCTLA AALAFVAYLL NQRYGSGDAW MYETESVITM VLGLWMVNVV FSFGHRLLNF QSARVTYFVN ASLFIYLVHH PLTLFFGAYI TPHITSNWLG FLCGLIFVVG IAIILYEIHL RIPLLKFLFS GKPVVKREND KAPAR
|
| |