Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2084 |
Symbol | mdoC |
ID | 6142973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2098356 |
End bp | 2099513 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641616960 |
Product | glucans biosynthesis protein |
Protein accession | YP_001744136 |
Protein GI | 170684209 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.72315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.86966 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCAG TACCCGCGCA ACGTGAATAT TTCCTCGACT CCATCCGCGC CTGGCTGATG TTGTTAGGGA TCCCTTTTCA TATTTCTTTA ATCTATTCGA GCCATACATG GCATGTGAAT AGCGCCGAAC CGTCATTATG GCTGACCCTT TTTAATGACT TCATCCACTC GTTCCGCATG CAGGTATTTT TCGTTATATC CGGGTACTTT TCCTACATGC TTTTTTTACG CTATCCCTTG AAAAAATGGT GGAAAGTACG TGTCAAACGT GTAGGGATCC CGATGTTAAC AGCCATCCCC CTACTGACAT TGCCGCAATT TATTATGCTG CAATACGTCA AAGGGAAAGC GGAAAGTTGG CCTGGACTGT CATTGTATGA CAAATATAAT ACGTTGGCCT GGGAATTAAT ATCACACCTG TGGTTTTTAC TGGTGTTAGT AGTCATGACG ACGCTGTGCG TATGGATATT TAAACGCATC AGAAATAATT TAGAAAATTC TGATAAAACG AATAAAAAAT TCTCGATGGT AAAACTATCG GTGATTTTTT TGTGCCTCGG CATCGGTTAT GCGGTAATAA GAAGAACGAT TTTTATTGTG TATCCGCCCA TTCTGAGTAA CGGCATGTTC AATTTTATTG TCATGCAAAC GCTATTTTAT TTGCCGTTCT TTATCCTCGG CGCACTGGCT TTCATTTTCC CTCATCTTAA AGCCTTGTTT ACCACGCCGT CTCGTGGCTG TACCCTTGCA GCAGCATTGG CATTTGTCGC TTACTTACTC AACCAGCGCT ATGGCAGTGG CGATGCCTGG ATGTACGAAA CCGAGTCTGT GATCACCATG GTCCTCGGTC TGTGGATGGT GAATGTGGTC TTCTCCTTCG GCCACCGTTT GCTTAACTTC CAGTCAGCGC GGGTGACTTA CTTTGTTAAT GCATCGCTGT TTATCTATCT GGTTCACCAC CCGTTAACGC TGTTTTTCGG CGCGTACATT ACACCGCACA TCACCTCCAA CTGGCTTGGT TTTCTCTGTG GCCTGATATT CGTAGTAGGG ATTGCGATAA TTCTGTATGA AATTCATTTA CGCATCCCGT TACTGAAGTT TTTGTTCTCT GGTAAACCGG TTGTTAAGCG TGAGAACGAT AAAGCACCAG CCCGTTAA
|
Protein sequence | MNPVPAQREY FLDSIRAWLM LLGIPFHISL IYSSHTWHVN SAEPSLWLTL FNDFIHSFRM QVFFVISGYF SYMLFLRYPL KKWWKVRVKR VGIPMLTAIP LLTLPQFIML QYVKGKAESW PGLSLYDKYN TLAWELISHL WFLLVLVVMT TLCVWIFKRI RNNLENSDKT NKKFSMVKLS VIFLCLGIGY AVIRRTIFIV YPPILSNGMF NFIVMQTLFY LPFFILGALA FIFPHLKALF TTPSRGCTLA AALAFVAYLL NQRYGSGDAW MYETESVITM VLGLWMVNVV FSFGHRLLNF QSARVTYFVN ASLFIYLVHH PLTLFFGAYI TPHITSNWLG FLCGLIFVVG IAIILYEIHL RIPLLKFLFS GKPVVKREND KAPAR
|
| |