Gene EcSMS35_1606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1606 
Symbolmic 
ID6144397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1595265 
End bp1596485 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content53% 
IMG OID641616483 
Producttranscriptional regulator Mic 
Protein accessionYP_001743661 
Protein GI170679760 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGCTG AAAACCAGCC TGGGCACATT GATCAAATAA AGCAGACCAA CGCGGGCGCG 
GTATATCGCC TGATTGATCA GCTTGGTCCA GTCTCGCGTA TCGATCTTTC CCGTCTGGCG
CAACTGGCTC CTGCCAGTAT CACTAAAATT GTCCGTGAGA TGCTCGAAGC ACACCTGGTG
CAAGAGCTGG AAATCAAAGA AGCGGGGAAC CGTGGCCGTC CGGCGGTGGG GCTGGTGGTT
GAAACTGAAG CCTGGCACTA TCTTTCTCTG CGCATTAGTC GCGGGGAGAT TTTCCTTGCT
CTGCGCGATC TGAGCAGCAA ACTGGTGGTG GAGGAGGCGC AGGAACTGGC GTTAAAAGAT
GACTCACCAT TGCTGGATCG TATCATTTCC CATATCGATC AGTTTTTTAT CCGCCACCAG
AAAAAACTTG AGCGTCTAAC TTCGATTGCC ATAACCTTGC CGGGAATTAT TGATACGGAA
AATGGCATTG TACATCGCAT GCCGTTCTAC GAGGATGTAA AAGAGATGCC GCTCGGCGAG
GCGCTGGAGC AGCATACCGG CGTACCGGTT TATATCCAGC ATGATATCAG CGCATGGACG
ATGGCAGAGG CCTTGTTTGG TGCCTCACGC GGGGCGCGCG ATGTGATTCA GGTGGTTATC
GATCACAACG TGGGGGCGGG CGTCATTACC GATGGTCATC TGCTACACGC CGGCAGCAGT
AGCCTCGTGG AAATAGGTCA CACGCAGGTC GACCCGTATG GGAAACGCTG TTATTGCGGG
AATCACGGCT GCCTCGAAAC CATCGCCAGT GTGGACAGTA TTCTTGAGCT GGCACAGCTG
CGTCTCAATC AATCCATGAG CTCGATGTTA CATGGACAGC CGTTAACCGT GGACTCATTG
TGTCAGGCGG CATTGCGCGG CGATCTACTG GCAAAAGACA TCATTACCGG GGTGGGCGCG
CATGTCGGGC GCATTCTTGC CATCATGGTG AATTTATTTA ACCCACAAAA AATACTGATT
GGCTCACCGT TAAGTAAAGC GGCAGATATC CTCTTCCCGG TCATCTCGGA CAGCATCCGT
CAGCAGGCCC TTCCTGCGTA TAGTCAGCAC ATTAGCGTTG AGAGTACTCA ATTTTCTAAC
CAGGGTACGA TGGCAGGGGC TGCGCTAGTA AAAGACGCGA TGTATAACGG TTCTTTGTTG
ATTCGTCTGT TGCAGGGTTA A
 
Protein sequence
MVAENQPGHI DQIKQTNAGA VYRLIDQLGP VSRIDLSRLA QLAPASITKI VREMLEAHLV 
QELEIKEAGN RGRPAVGLVV ETEAWHYLSL RISRGEIFLA LRDLSSKLVV EEAQELALKD
DSPLLDRIIS HIDQFFIRHQ KKLERLTSIA ITLPGIIDTE NGIVHRMPFY EDVKEMPLGE
ALEQHTGVPV YIQHDISAWT MAEALFGASR GARDVIQVVI DHNVGAGVIT DGHLLHAGSS
SLVEIGHTQV DPYGKRCYCG NHGCLETIAS VDSILELAQL RLNQSMSSML HGQPLTVDSL
CQAALRGDLL AKDIITGVGA HVGRILAIMV NLFNPQKILI GSPLSKAADI LFPVISDSIR
QQALPAYSQH ISVESTQFSN QGTMAGAALV KDAMYNGSLL IRLLQG