Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2110 |
Symbol | cbl |
ID | 5595386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2098567 |
End bp | 2099517 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640921249 |
Product | transcriptional regulator Cbl |
Protein accession | YP_001458789 |
Protein GI | 157161471 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 3.3178700000000003e-19 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATTTCC AACAACTAAA GATAATCCGC GAGGCTGCAC GTCAGGATTA CAACCTGACA GAGGTTGCGA ATATGCTTTT TACCTCGCAG TCCGGCGTCA GCCGTCATAT TCGGGAACTG GAGGATGAAC TTGGCATCGA AATATTTGTT CGACGAGGTA AGCGACTGCT GGGCATGACT GAACCGGGCA AAGCATTACT GGTCATTGCA GAACGTATTC TGAATGAAGC CAGTAATGTT CGTCGGCTTG CAGACCTGTT TACCAACGAT ACGTCTGGCG TTCTCACTAT TGCAACGACG CATACTCAGG CACGTTATAG CTTGCCTGAG GTCATTAAAG CTTTTCGCGA ACTTTTCCCG GAGGTTCGGC TGGAGCTAAT CCAGGGTACG CCACAGGAAA TTGCGACGTT GTTGCAAAAT GGCGAAGCTG ATATTGGTAT CGCCAGCGAG CGTTTGAGTA ATGACCCGCA GCTCGTCGCC TTCCCGTGGT TTCGTTGGCA CCATAGTTTG CTTGTTCCAC TCGATCATCC CTTGACGCAA ATTACACCGT TGACGCTGGA ATCAATAGCG AAGTGGCCGT TAATCACTTA CCGACAGGGG ATTACGGGGC GCTCACGTAT TGATGACGCA TTTGCCCGCA AAGGTTTGCT GGCATATATT GTATTAAGTG CGCAGGATTC TGATGTCATT AAAACCTATG TTGCTCTTGG GCTGGGGATT GGATTAGTTG CCGAACAATC CAGCGGCGAA CAAGAGGAAG AGAATTTAAT CCGTCTGGAT ACGCGGCATC TTTTCGATGC CAATACTGTA TGGTTGGGAC TGAAGCGAGG GCAACTTCAA CGTAACTATG TCTGGCGCTT TCTGGAACTT TGTAATGCAG GACTGTCAGT TGAGGATATC AAGCGCCAGG TGATGGAGAA CAGTGAAGAG GAAATTGATT ATCAGATATA G
|
Protein sequence | MNFQQLKIIR EAARQDYNLT EVANMLFTSQ SGVSRHIREL EDELGIEIFV RRGKRLLGMT EPGKALLVIA ERILNEASNV RRLADLFTND TSGVLTIATT HTQARYSLPE VIKAFRELFP EVRLELIQGT PQEIATLLQN GEADIGIASE RLSNDPQLVA FPWFRWHHSL LVPLDHPLTQ ITPLTLESIA KWPLITYRQG ITGRSRIDDA FARKGLLAYI VLSAQDSDVI KTYVALGLGI GLVAEQSSGE QEEENLIRLD TRHLFDANTV WLGLKRGQLQ RNYVWRFLEL CNAGLSVEDI KRQVMENSEE EIDYQI
|
| |