Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1780 |
Symbol | mic |
ID | 6271873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1623453 |
End bp | 1624673 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725851 |
Product | transcriptional regulator Mic |
Protein accession | YP_001880349 |
Protein GI | 187732834 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.905565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTGCTG AAAACCAGCC TGGGCACATT GATCAAATAA AGCAGACCAA CGCGGGCGCG GTTTATCGCC TGATTGATCA GCTTGGTCCA GTCTCGCGTA TCGATCTTTC CCGTCTGGCG CAGCTGGCTC CTGCCAGTAT CACTAAAATT GTCCGTGAGA TGCTCGAAGC ACACTTGGTG CAAGAGCTGG AAATCAAAGA AGCGGGGAAC CGTGGCCGTC CGGCGGTGGG GCTGGTGGTT GAAACTGAAG CCTGGCACTA TCTTTCTCTG CGCATTAGTC GCGGGGAGAT TTTCCTTGCT CTGCGCGATC TGAGCAGCAA ACTGGTGGTG GAAGAGTCGC AGGAACTGGC GTTAAAAGAT GACTTGCCAT TGCTGGATCG TATCATTTCC CATATCGATC AGTTTTTTAT CCGCCACCAG AAAAAACTTG AGCGTCTAAC TTCGATTGCC ATAACCCTGC CGGGAATTAT TGATACGGAA AATGGTATTG TACATCGCAT GCCGTTCTAC GAGGATGTAA AAGAGATGCC GCTCGGCGAG GCGCTGGAGC AGCATACCGG CGTACCGGTT TATATTCAGC ATGATATCAG CGCATGGACG ATGGCAGAGG CCTTGTTTGG TGCCTCACGC GGGGCGCGCG ATGTGATTCA GGTGGTTATC GATCACAACG TGGGGGCGGG CGTCATTACC GATGGTCATC TGCTACACGC AGGCAGCAGT AGTCTCGTGG AAATAGGCCA CACACAGGTC GACCCATATG GTAAACGCTG TTATTGCGGG AACCACGGCT GCCTCGAAAC CATCGCCAGT GTGGACAGTA TTCTTGAGCT GGCACAGCTG CGTCTCAATC AATCCATGAG CTCGATGTTA CATGGGCAAC CGTTAACCGT GGACTCATTG TGTCAGGCGG CATTGCGCGG CGATCTACTG GCAAAAGACA TCATTACCGG GGTGGGCGCG CATGTCGGGC GCATTCTTGC CATCATGGTG AATTTATTTA ACCCACAAAA AATACTGATT GGCTCACCGT TAAGTAAAGC GGCAGATATC CTCTTCCCTG TCATCTCGGA CAGCATACGT CAGCAGGCCC TTCCTACGTA TAGTCAGCAC ATTAGCGTTG AGAGTACTCA GTTTTCTAAC CAGGGCACGA TGGCAGGCGC TGCGCTGGTA AAAGACGCGA TGTATAACGG TTCTTTGTTG ATTCGTCTGT TGCAGGGTTA A
|
Protein sequence | MVAENQPGHI DQIKQTNAGA VYRLIDQLGP VSRIDLSRLA QLAPASITKI VREMLEAHLV QELEIKEAGN RGRPAVGLVV ETEAWHYLSL RISRGEIFLA LRDLSSKLVV EESQELALKD DLPLLDRIIS HIDQFFIRHQ KKLERLTSIA ITLPGIIDTE NGIVHRMPFY EDVKEMPLGE ALEQHTGVPV YIQHDISAWT MAEALFGASR GARDVIQVVI DHNVGAGVIT DGHLLHAGSS SLVEIGHTQV DPYGKRCYCG NHGCLETIAS VDSILELAQL RLNQSMSSML HGQPLTVDSL CQAALRGDLL AKDIITGVGA HVGRILAIMV NLFNPQKILI GSPLSKAADI LFPVISDSIR QQALPTYSQH ISVESTQFSN QGTMAGAALV KDAMYNGSLL IRLLQG
|
| |