Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02891 |
Symbol | aer |
ID | 8116634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3082842 |
End bp | 3084362 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644849079 |
Product | hypothetical protein |
Protein accession | YP_003000652 |
Protein GI | 251786348 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCTC ATCCGTATGT CACCCAGCAA AATACCCCGC TGGCGGACGA TACCACTCTG ATGTCCACTA CCGATCTGCA AAGCTATATC ACTCATGCTA ATGACACTTT TGTGCAGGTG AGCGGCTATA CCTTGCAAGA GTTACAAGGG CAGCCGCACA ACATGGTGCG TCACCCGGAT ATGCCAAAAG CGGCGTTTGC GGATATGTGG TTCACCCTGA AAAAAGGGGA GCCCTGGAGC GGCATCGTGA AAAATCGCCG CAAAAATGGT GACCATTATT GGGTGCGGGC CAATGCGGTA CCGATGGTGC GCGAGGGAAA AATCAGTGGC TATATGTCGA TTCGTACCCG GGCGACGGAT GAAGAGATCG CGGCGGTGGA GCCGCTGTAC AAAGCGTTGA ACGCCGGACG TACCAGTAAG CGTATTCATA AAGGCCTGGT GGTGCGTAAA GGCTGGCTGG GTAAACTGCC TTCATTACCG CTTCGCTGGC GGGCGCGTGG AGTGATGACC CTGATGTTTA TCTTGCTGGC GGCCATGCTT TGGTTTGTTG CTGCCCCGGT GGTGACGTAT ATCCTCTGTG CGTTAGTGGT ATTGTTGGCA AGCGCCTGTT TTGAATGGCA GATTGTGCGC CCGATAGAAA ATGTTGCCCA TCAGGCACTG AAGGTGGCGA CCGGAGAACG TAATAGTGTT GAGCATCTGA ATCGCAGCGA TGAGCTGGGG CTGACATTAC GTGCGGTAGG GCAACTTGGC CTGATGTGCC GTTGGCTAAT TAACGATGTC TCAAGCCAGG TGTCCAGTGT CAGAAATGGC AGTGAGACGC TGGCGAAAGG CACCGATGAA CTGAACGAAC ATACCCAGCA GACAGTTGAT AACGTTCAGC AAACGGTGGC GACCATGAAC CAAATGGCGG CGTCGGTGAA ACAGAACTCT GCCACGGCGT CGGCTGCCGA TAAACTGTCA ATCACTGCCA GTAATGCGGC AGTGCAGGGT GGGGAGGCGA TGACCACGGT GATCAAGACA ATGGACGATA TCGCCGACAG TACCCAGCGC ATTGGCACCA TTACTTCGCT GATTAACGAT ATTGCGTTTC AGACCAATAT TCTGGCCCTG AATGCGGCGG TGGAAGCGGC GCGTGCCGGC GAACAGGGCA AAGGTTTTGC AGTGGTGGCA GGGGAAGTGC GTCATTTAGC CAGCCGCAGC GCTAATGCTG CCAACGATAT TCGCAAGCTG ATTGATGCCA GTGCTGATAA GGTGCAATCC GGTTCGCAGC AGGTACACGC CGCCGGACGG ACGATGGAAG ATATTGTGGC ACAGGTGAAA AACGTCACCC AGTTGATCGC CCAGATTAGC CATTCAACGC TGGAACAGGC CGATGGGCTT TCCAGCCTGA CCCGTGCAGT GGATGAGCTT AACCTGATCA CCCAGAAAAA TGCCGAGCTG GTGGAAGAGA GTGCGCAGGT GTCGGCGATG GTGAAACACC GCGCCAGCCG ACTGGAAGAC GCGGTGACGG TACTGCATTA A
|
Protein sequence | MSSHPYVTQQ NTPLADDTTL MSTTDLQSYI THANDTFVQV SGYTLQELQG QPHNMVRHPD MPKAAFADMW FTLKKGEPWS GIVKNRRKNG DHYWVRANAV PMVREGKISG YMSIRTRATD EEIAAVEPLY KALNAGRTSK RIHKGLVVRK GWLGKLPSLP LRWRARGVMT LMFILLAAML WFVAAPVVTY ILCALVVLLA SACFEWQIVR PIENVAHQAL KVATGERNSV EHLNRSDELG LTLRAVGQLG LMCRWLINDV SSQVSSVRNG SETLAKGTDE LNEHTQQTVD NVQQTVATMN QMAASVKQNS ATASAADKLS ITASNAAVQG GEAMTTVIKT MDDIADSTQR IGTITSLIND IAFQTNILAL NAAVEAARAG EQGKGFAVVA GEVRHLASRS ANAANDIRKL IDASADKVQS GSQQVHAAGR TMEDIVAQVK NVTQLIAQIS HSTLEQADGL SSLTRAVDEL NLITQKNAEL VEESAQVSAM VKHRASRLED AVTVLH
|
| |