Gene EcHS_A1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1668 
Symbolmic 
ID5595212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1689479 
End bp1690699 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID640920816 
Producttranscriptional regulator Mic 
Protein accessionYP_001458372 
Protein GI157161054 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTGCTG AAAACCAGCC TGGGCACATT GATCAAATAA AGCAGACCAA CGCGGGCGCG 
GTTTATCGCC TGATTGATCA GCTTGGTCCA GTCTCGCGTA TCGATCTTTC CCGTCTGGCG
CAACTGGCTC CTGCCAGTAT CACTAAAATT GTCCGTGAGA TGCTCGAAGC ACACCTGGTG
CAAGAGCTGG AAATCAAAGA AGCGGGGAAC CGTGGCCGTC CGGCGGTGGG GCTGGTGGTT
GAAACTGAAG CCTGGCACTA TCTTTCTCTG CGCATTAGTC GCGGGGAGAT TTTCCTTGCT
CTGCGCGATC TGAGCAGCAA ACTGGTGGTG GAAGAGTCGC AGGAACTGGC GTTAAAAGAT
GACTTGCCAT TGCTGGATCG TATTATTTCC CATATCGATC AGTTTTTTAT CCGCCACCAG
AAAAAACTTG AGCGTCTAAC TTCGATTGCC ATAACCTTGC CGGGAATTAT TGATACGGAA
AATGGTATTG TACATCGCAT GCCGTTCTAC GAGGATGTAA AAGAGATGCC GCTCGGCGAG
GCGCTGGAGC AGCATACCGG CGTTCCGGTT TATATTCAGC ATGATATCAG CGCATGGACG
ATGGCAGAGG CCTTGTTTGG TGCCTCACGC GGGGCGCGCG ATGTGATTCA GGTGGTTATC
GATCACAACG TGGGGGCGGG CGTCATTACC GATGGTCATC TGCTACACGC AGGCAGCAGT
AGTCTCGTGG AAATAGGCCA CACACAGGTC GACCCGTATG GGAAACGCTG TTATTGCGGG
AATCACGGCT GCCTCGAAAC CATCGCCAGC GTGGACAGTA TTCTTGAGCT GGCACAGCTG
CGTCTTAATC AATCCATGAG CTCGATGTTA CATGGACAAC CGTTAACCGT GGACTCATTG
TGTCAGGCGG CATTGCGCGG CGATCTACTG GCAAAAGACA TCATTACCGG GGTGGGCGCG
CATGTCGGGC GCATTCTTGC CATCATGGTG AATTTATTTA ACCCACAAAA AATACTGATT
GGCTCACCGT TAAGTAAAGC GGCAGATATC CTCTTCCCGG TCATCTCAGA CAGCATCCGT
CAGCAGGCCC TTCCTGCGTA TAGTCAGCAC ATCAGCGTTG AGAGTACTCA GTTTTCTAAC
CAGGGCACGA TGGCAGGCGC TGCACTGGTA AAAGACGCGA TGTATAACGG TTCTTTGTTG
ATTCGTCTGT TGCAGGGTTA A
 
Protein sequence
MVAENQPGHI DQIKQTNAGA VYRLIDQLGP VSRIDLSRLA QLAPASITKI VREMLEAHLV 
QELEIKEAGN RGRPAVGLVV ETEAWHYLSL RISRGEIFLA LRDLSSKLVV EESQELALKD
DLPLLDRIIS HIDQFFIRHQ KKLERLTSIA ITLPGIIDTE NGIVHRMPFY EDVKEMPLGE
ALEQHTGVPV YIQHDISAWT MAEALFGASR GARDVIQVVI DHNVGAGVIT DGHLLHAGSS
SLVEIGHTQV DPYGKRCYCG NHGCLETIAS VDSILELAQL RLNQSMSSML HGQPLTVDSL
CQAALRGDLL AKDIITGVGA HVGRILAIMV NLFNPQKILI GSPLSKAADI LFPVISDSIR
QQALPAYSQH ISVESTQFSN QGTMAGAALV KDAMYNGSLL IRLLQG