Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2036 |
Symbol | |
ID | 6067852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2247928 |
End bp | 2249148 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641601448 |
Product | ROK family protein |
Protein accession | YP_001725007 |
Protein GI | 170020053 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTGCTG AAAACCAGCC TGGGCACATT GATCAAATAA AGCAGACCAA CGCGGGCGCG GTTTATCGCC TGATTGATCA GCTTGGTCCA GTCTCGCGTA TCGATCTTTC CCGTCTGGCG CAACTGGCTC CTGCCAGTAT CACTAAAATT GTCCGTGAGA TGCTCGAAGC ACACCTGGTG CAAGAGCTGG AAATCAAAGA AGCGGGGAAC CGTGGCCGTC CGGCGGTGGG GCTGGTGGTT GAAACTGAAG CCTGGCACTA TCTTTCTCTG CGCATTAGTC GCGGGGAGAT TTTCCTTGCT CTGCGCGATC TGAGCAGCAA ACTGGTGGTG GAAGAGTCGC AGGAACTGGC GTTAAAAGAT GACTTGCCAT TGCTGGATCG TATTATTTCC CATATCGATC AGTTTTTTAT CCGCCACCAG AAAAAACTTG AGCGTCTAAC TTCGATTGCC ATAACCTTGC CGGGAATTAT TGATACGGAA AATGGTATTG TACATCGCAT GCCGTTCTAC GAGGATGTAA AAGAGATGCC GCTCGGCGAG GCGCTGGAGC AGCATACCGG CGTTCCGGTT TATATTCAGC ATGATATCAG CGCATGGACG ATGGCAGAGG CCTTGTTTGG TGCCTCACGC GGGGCGCGCG ATGTGATTCA GGTGGTTATC GATCACAACG TGGGGGCGGG CGTCATTACC GATGGTCATC TGCTACACGC AGGCAGCAGT AGTCTCGTGG AAATAGGCCA CACACAGGTC GACCCGTATG GGAAACGCTG TTATTGCGGG AATCACGGCT GCCTCGAAAC CATCGCCAGC GTGGACAGTA TTCTTGAGCT GGCACAGCTG CGTCTTAATC AATCCATGAG CTCGATGTTA CATGGACAAC CGTTAACCGT GGACTCATTG TGTCAGGCGG CATTGCGCGG CGATCTACTG GCAAAAGACA TCATTACCGG GGTGGGCGCG CATGTCGGGC GCATTCTTGC CATCATGGTG AATTTATTTA ACCCACAAAA AATACTGATT GGCTCACCGT TAAGTAAAGC GGCAGATATC CTCTTCCCGG TCATCTCAGA CAGCATCCGT CAGCAGGCCC TTCCTGCGTA TAGTCAGCAC ATCAGCGTTG AGAGTACTCA GTTTTCTAAC CAGGGCACGA TGGCAGGCGC TGCACTGGTA AAAGACGCGA TGTATAACGG TTCTTTGTTG ATTCGTCTGT TGCAGGGTTA A
|
Protein sequence | MVAENQPGHI DQIKQTNAGA VYRLIDQLGP VSRIDLSRLA QLAPASITKI VREMLEAHLV QELEIKEAGN RGRPAVGLVV ETEAWHYLSL RISRGEIFLA LRDLSSKLVV EESQELALKD DLPLLDRIIS HIDQFFIRHQ KKLERLTSIA ITLPGIIDTE NGIVHRMPFY EDVKEMPLGE ALEQHTGVPV YIQHDISAWT MAEALFGASR GARDVIQVVI DHNVGAGVIT DGHLLHAGSS SLVEIGHTQV DPYGKRCYCG NHGCLETIAS VDSILELAQL RLNQSMSSML HGQPLTVDSL CQAALRGDLL AKDIITGVGA HVGRILAIMV NLFNPQKILI GSPLSKAADI LFPVISDSIR QQALPAYSQH ISVESTQFSN QGTMAGAALV KDAMYNGSLL IRLLQG
|
| |