Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1606 |
Symbol | mic |
ID | 6144397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1595265 |
End bp | 1596485 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616483 |
Product | transcriptional regulator Mic |
Protein accession | YP_001743661 |
Protein GI | 170679760 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTGCTG AAAACCAGCC TGGGCACATT GATCAAATAA AGCAGACCAA CGCGGGCGCG GTATATCGCC TGATTGATCA GCTTGGTCCA GTCTCGCGTA TCGATCTTTC CCGTCTGGCG CAACTGGCTC CTGCCAGTAT CACTAAAATT GTCCGTGAGA TGCTCGAAGC ACACCTGGTG CAAGAGCTGG AAATCAAAGA AGCGGGGAAC CGTGGCCGTC CGGCGGTGGG GCTGGTGGTT GAAACTGAAG CCTGGCACTA TCTTTCTCTG CGCATTAGTC GCGGGGAGAT TTTCCTTGCT CTGCGCGATC TGAGCAGCAA ACTGGTGGTG GAGGAGGCGC AGGAACTGGC GTTAAAAGAT GACTCACCAT TGCTGGATCG TATCATTTCC CATATCGATC AGTTTTTTAT CCGCCACCAG AAAAAACTTG AGCGTCTAAC TTCGATTGCC ATAACCTTGC CGGGAATTAT TGATACGGAA AATGGCATTG TACATCGCAT GCCGTTCTAC GAGGATGTAA AAGAGATGCC GCTCGGCGAG GCGCTGGAGC AGCATACCGG CGTACCGGTT TATATCCAGC ATGATATCAG CGCATGGACG ATGGCAGAGG CCTTGTTTGG TGCCTCACGC GGGGCGCGCG ATGTGATTCA GGTGGTTATC GATCACAACG TGGGGGCGGG CGTCATTACC GATGGTCATC TGCTACACGC CGGCAGCAGT AGCCTCGTGG AAATAGGTCA CACGCAGGTC GACCCGTATG GGAAACGCTG TTATTGCGGG AATCACGGCT GCCTCGAAAC CATCGCCAGT GTGGACAGTA TTCTTGAGCT GGCACAGCTG CGTCTCAATC AATCCATGAG CTCGATGTTA CATGGACAGC CGTTAACCGT GGACTCATTG TGTCAGGCGG CATTGCGCGG CGATCTACTG GCAAAAGACA TCATTACCGG GGTGGGCGCG CATGTCGGGC GCATTCTTGC CATCATGGTG AATTTATTTA ACCCACAAAA AATACTGATT GGCTCACCGT TAAGTAAAGC GGCAGATATC CTCTTCCCGG TCATCTCGGA CAGCATCCGT CAGCAGGCCC TTCCTGCGTA TAGTCAGCAC ATTAGCGTTG AGAGTACTCA ATTTTCTAAC CAGGGTACGA TGGCAGGGGC TGCGCTAGTA AAAGACGCGA TGTATAACGG TTCTTTGTTG ATTCGTCTGT TGCAGGGTTA A
|
Protein sequence | MVAENQPGHI DQIKQTNAGA VYRLIDQLGP VSRIDLSRLA QLAPASITKI VREMLEAHLV QELEIKEAGN RGRPAVGLVV ETEAWHYLSL RISRGEIFLA LRDLSSKLVV EEAQELALKD DSPLLDRIIS HIDQFFIRHQ KKLERLTSIA ITLPGIIDTE NGIVHRMPFY EDVKEMPLGE ALEQHTGVPV YIQHDISAWT MAEALFGASR GARDVIQVVI DHNVGAGVIT DGHLLHAGSS SLVEIGHTQV DPYGKRCYCG NHGCLETIAS VDSILELAQL RLNQSMSSML HGQPLTVDSL CQAALRGDLL AKDIITGVGA HVGRILAIMV NLFNPQKILI GSPLSKAADI LFPVISDSIR QQALPAYSQH ISVESTQFSN QGTMAGAALV KDAMYNGSLL IRLLQG
|
| |