Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01848 |
Symbol | cheA |
ID | 8116069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1918592 |
End bp | 1920556 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644848067 |
Product | hypothetical protein |
Protein accession | YP_002999640 |
Protein GI | 251785336 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0113897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCATGG ATATAAGCGA TTTTTATCAG ACATTTTTTG ATGAAGCGGA CGAACTGTTG GCTGACATGG AGCAGCATTT GCTGGTTTTG CAGCCGGAAG CGCCAGATGC CGAACAATTG AATGCCATCT TTCGGGCTGC CCACTCGATC AAAGGAGGGG CAGGAACTTT TGGCTTCAGC GTTTTGCAGG AAACCACGCA TCTGATGGAA AACCTGCTCG ATGAAGCCAG ACGAGGTGAG ATGCAACTCA ACACCGACAT TATCAATCTG TTTTTGGAAA CGAAGGACAT CATGCAAGAA CAGCTCGACG CTTATAAACA GTCGCAAGAG CCGGATGCCG CCAGCTTCGA TTATATCTGC CAGGCCTTGC GTCAACTGGC ATTAGAAGCG AAAGGCGAAA CGCCATCCGC AGTGACCCGA TTAAGTGTGG TTGCCAAAAG TGAACCGCAA GATGAGCAGA GTCGCAGTCA GTCGCCGCGA CGAATTATCC TTTCGCGCCT GAAGGCCGGG GAAGTCGACC TGCTGGAAGA AGAACTGGGA CATCTGACAA CGTTAACTGA CGTGGTGAAA GGGGCGGATT CGCTCTCGGC AATATTACCG GGCGACATCG CCGAAGATGA CATCACAGCG GTACTCTGTT TTGTGATTGA AGCCGATCAG ATTACCTTTG AAACAGTAGA AGTCTCGCCA AAAATATCCA CCCCACCAGT GCTTAAACTG GCAGCCGAAC AAGCGCCAAC CGGCCGCGTG GAGCGGGAAA AAACGACGCG CAGCAATGAA TCCACCAGCA TCCGTGTAGC GGTAGAAAAG GTTGATCAAT TAATTAACCT CGTCGGCGAG CTGGTTATCA CCCAGTCCAT GCTTGCCCAG CGTTCCAGCG AACTGGACCC GGTTAATCAT GGTGATTTGA TAACCAGCAT GGGGCAGTTA CAACGTAACG CCCGTGATTT GCAGGAATCA GTGATGTCGA TTCGCATGAT GCCGATGGAA TATGTTTTTA GTCGCTATCC CCGGCTGGTG CGTGATCTGG CGGGAAAACT CGGCAAGCAG GTAGAACTGA CGCTGGTGGG CAGTTCTACT GAACTCGACA AAAGCCTGAT AGAACGCATT ATCGACCCGC TGACCCACCT GGTACGCAAT AGCCTCGATC ACGGTATTGA ACTGCCAGAA AAACGGCTCG CCGCAGGTAA AAACAGCGTC GGAAATTTAA TTCTGTCTGC CGAACATCAG GGCGGCAACA TTTGCATTGA AGTGACCGAC GATGGGGCGG GGCTAAACCG TGAGCGAATT CTGGCAAAAG CGGCCTCGCA AGGTTTGACT GTCAGCGAAA ACATGAGCGA CGACGAAGTC GCGATGCTGA TATTTGCACC TGGCTTCTCC ACGGCAGAGC AGGTCACCGA CGTCTCCGGG CGCGGCGTCG GCATGGACGT CGTTAAACGT AATATCCAGG AGATGGGCGG TCATGTCGAA ATCCAGTCGA AGCAGGGTAC TGGCACTACG ATCCGCATTT TACTGCCGCT GACGCTGGCC ATCCTCGACG GCATGTCCGT ACGCGTTGCG GATGAAGTTT TCATTCTGCC GCTGAATGCT GTTATGGAAT CACTGCAACC CCGTGAAGCC GATCTCCATC CACTGGCCGG CGGCGAGCGG GTGCTGGAAG TGCGGGGTGA ATATCTGCCC ATCGTCGAAC TGTGGAAAGT GTTCAACGTC GCGGGCGCGA AAACCGAAGC CACCCAGGGA ATTGTGGTGA TCTTACAAAG TGGCGGTCGC CGCTACGCCT TGCTGGTGGA TCAATTAATT GGTCAACACC AGGTTGTGGT TAAAAACCTT GAAAGTAACT ATCGCAAAGT CCCCGGCATT TCTGCTGCGA CCATTCTTGG CGACGGCAGC GTGGCACTGA TTGTTGATGT CTCCGCCTTG CAGGCGATAA ACCGCGAACA ACGTATGGCG AACACCTCCG CCTGA
|
Protein sequence | MSMDISDFYQ TFFDEADELL ADMEQHLLVL QPEAPDAEQL NAIFRAAHSI KGGAGTFGFS VLQETTHLME NLLDEARRGE MQLNTDIINL FLETKDIMQE QLDAYKQSQE PDAASFDYIC QALRQLALEA KGETPSAVTR LSVVAKSEPQ DEQSRSQSPR RIILSRLKAG EVDLLEEELG HLTTLTDVVK GADSLSAILP GDIAEDDITA VLCFVIEADQ ITFETVEVSP KISTPPVLKL AAEQAPTGRV EREKTTRSNE STSIRVAVEK VDQLINLVGE LVITQSMLAQ RSSELDPVNH GDLITSMGQL QRNARDLQES VMSIRMMPME YVFSRYPRLV RDLAGKLGKQ VELTLVGSST ELDKSLIERI IDPLTHLVRN SLDHGIELPE KRLAAGKNSV GNLILSAEHQ GGNICIEVTD DGAGLNRERI LAKAASQGLT VSENMSDDEV AMLIFAPGFS TAEQVTDVSG RGVGMDVVKR NIQEMGGHVE IQSKQGTGTT IRILLPLTLA ILDGMSVRVA DEVFILPLNA VMESLQPREA DLHPLAGGER VLEVRGEYLP IVELWKVFNV AGAKTEATQG IVVILQSGGR RYALLVDQLI GQHQVVVKNL ESNYRKVPGI SAATILGDGS VALIVDVSAL QAINREQRMA NTSA
|
| |