Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03833 |
Symbol | ybl202 |
ID | 8116698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 4108060 |
End bp | 4109325 |
Gene Length | 1266 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644849992 |
Product | hypothetical protein |
Protein accession | YP_003001565 |
Protein GI | 251787261 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0772561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTTA TGCAACGTTC TAAAGACTCC TTAGCTAAAT GGTTAAGCGC GATCCTCCCC GTGGTCATTG TTGGGCTGGT GGGGCTGTTT GCGGTGACGG TGATTCGCGA TTATGGGCGC GAGACTGCCG CCGCCAGACA AACGCTGCTG GAAAAAGGCA GTGTACTTAT TCGCGCTCTT GAATCCGGCT CGCGCGTCGG CATGGGGATG CGCATGCATC ATGCGCAGCA GCAGGCCTTA CTGGAAGAAA TGGCCGGGCA GCCTGGAGTA CGTTGGTTTG CGGTCACGGA TGAACAAGGA ACAATCGTGA TGCATAGCAA CTCCGGCATG GTGGGAAAAC AGCTTTATTC CCCGCAGGAA ATGCAGCAGT TACATCCGGG AGATGAAGAA GCGTGGCGGC GGATCGATAG CGCAGACGGC GAGCCTGTTC TGGAAATTTA TCGCCAGTTT CAACCGATGT TTGCTGCTGG AATGCACCGG ATGCGCCATA TGCAGCAGTA TGCCGCGACA CCACAAGCAA TTTTCATCGC TTTCGACGCC AGTAATATTG TGAGTGCCGA AGATCGTGAG CAGAGAAACA CCCTGATTAT CCTCTTCGCC CTGGCGACGG TCTTGCTGGC AAGCGTGTTG TCATTCTTCT GGTATCGCCG CTATCTGCGC TCGCGCCAGC TGTTGCAGGA TGAAATGAAG CGCAAAGAGA AGCTGGTGGC ACTGGGGCAT CTTGCAGCAG GCGTTGCCCA CGAAATCCGT AATCCACTTT CCTCAATTAA AGGGCTGGCG AAATACTTTG CCGAACGCGC GCCAGCAGGG GGAGAAGCGC ATCAATTGGC GCAGGTGATG GCGAAAGAAG CCGACCGTTT AAACCGTGTG GTAAGCGAGT TGCTGGAACT GGTTAGGCCA ACGCATCTGG CTTTGCAGGC GGTGGATCTC AACACGCTGA TTAACCACTC ATTACAGCTG GTAAGCCAGG ATGCAAACAG CCGGGAGATC CAGTTACGCT TTACCGCCAA CGACACATTA CCGGTAATTC AGGCCGACCC AGACAGGCTG ACTCAGGTCC TGTTGAATCT CTATCTCAAT GCTATTCAGG CGATTGGTCA GCATGGTGTG ATTAGCGTGA CGGCCAGCGA AAGCGGCGCG GGCGTGAAAA TCAGCGTTAC CGACAGCGGT AAGGGAATTG CGGCAGGTCA GCTTGAAGCC ATCTTCACTC CGTACTTCAC CACCAAAGCC GAAGGCACCG GATTGGGGCT GGCGGTCGTG CATAAT
|
Protein sequence | MRFMQRSKDS LAKWLSAILP VVIVGLVGLF AVTVIRDYGR ETAAARQTLL EKGSVLIRAL ESGSRVGMGM RMHHAQQQAL LEEMAGQPGV RWFAVTDEQG TIVMHSNSGM VGKQLYSPQE MQQLHPGDEE AWRRIDSADG EPVLEIYRQF QPMFAAGMHR MRHMQQYAAT PQAIFIAFDA SNIVSAEDRE QRNTLIILFA LATVLLASVL SFFWYRRYLR SRQLLQDEMK RKEKLVALGH LAAGVAHEIR NPLSSIKGLA KYFAERAPAG GEAHQLAQVM AKEADRLNRV VSELLELVRP THLALQAVDL NTLINHSLQL VSQDANSREI QLRFTANDTL PVIQADPDRL TQVLLNLYLN AIQAIGQHGV ISVTASESGA GVKISVTDSG KGIAAGQLEA IFTPYFTTKA EGTGLGLAVV HN
|
| |