Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01962 |
Symbol | yegE |
ID | 8113296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2041587 |
End bp | 2044904 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644848176 |
Product | hypothetical protein |
Protein accession | YP_002999749 |
Protein GI | 251785445 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3447] Predicted integral membrane sensor domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAC AATCACAGCA TGTATTAATT GCCCTGCCCC ACCCGCTGCT TCACCTGGTC AGTTTAGGTT TAGTCTCGTT TATCTTTACC CTTTTCTCGC TTGAGCTTTC GCAGTTTGGC ACCCAACTCG CCCCACTGTG GTTCCCGACG TCCATCATGA TGGTGGCGTT TTATCGCCAT GCCGGGCGCA TGTGGCCGGG AATTGCGCTG AGCTGTTCGC TGGGAAATAT CGCCGCATCC ATCCTGCTTT TTTCCACCAG CTCGCTGAAC ATGACCTGGA CGACCATCAA TATTGTTGAA GCCGTGGTCG GGGCAGTGCT GCTACGTAAA TTGCTGCCGT GGTATAACCC CTTGCAAAAT CTGGCTGACT GGCTGCGTCT GGCACTCGGC AGCGCCATTG TTCCACCTCT GTTGGGGGGT GTTCTGGTTG TCCTGCTGAC GCCCGGAGAC GATCCTCTCA GGGCATTTTT GATATGGGTA CTGTCAGAAT CCATCGGCGC TCTGGCACTG GTGCCGCTGG GATTGTTATT TAAACCACAC TATCTGCTGC GCCATCGCAA CCCACGGTTG CTTTTTGAGT CGCTGCTCAC ATTAGCCATC ACACTGACGT TAAGCTGGCT TTCGATGCTG TATCTGCCGT GGCCTTTTAC TTTCATTATT GTGCTGTTGA TGTGGAGCGC CGTGCGCCTG CCACGAATGG AAGCCTTTTT GATCTTCCTT ACCACGGTGA TGATGGTGTC GCTGATGATG GCCGCGGATC CCTCCCTGCT TGCTACGCCG CGTACGTACC TGATGACCCA TATGCCGTGG CTACCGTTTT TGCTGATCCT GCTGCCCGCC AACATCATGA CGATGGTGAT GTATGCCTTT CGTGCGGAAC GCAAACACAT TTCCGAAAGC GAAACCCGTT TTCGGAACGC GATGGAATAT TCCGCCATCG GCATGGCATT AGTGGGCACC GAGGGACAAT GGCTGCAATC CAACAAAGCA CTCTGCCAGT TTCTCGGTTA CAGTCAGGAA GAGCTGCGCG GACTCACCTT TCAGCAACTG ACCTGGCCGG AGGATCTCAA TAAAGATCTC CAACAGGTTG AAAAGCTGAT AAGCGGTGAA ATAAACACCT ATTCAATGGA AAAACGTTAC TACAACCGCA ATGGCGATGT TGTCTGGGCG TTGCTTGCCG TCTCACTGGT GCGCCACACG GATGGCACGC CGCTCTATTT TATCGCTCAG ATTGAAGACA TTAACGAGCT AAAACGCACC GAACAGGTGA ATCAGCAACT GATGGAACGC ATCACGCTGG CTAACGAAGC GGGCGGGATT GGCATCTGGG AGTGGGAGTT GAAGCCGAAT ATTTTTAGCT GGGATAAGCG GATGTTCGAG CTGTATGAAA TTCCTCCGCA TATCAAACCG AACTGGCAGG TGTGGTACGA GTGCGTGCTG CCGGAAGATC GCCAGCATGC CGAAAAAGTG ATTCGTGATT CGTTGCAATC ACGCTCGCCC TTTAAGCTGG AATTTCGCAT TGCCGTGAAA GATGGCATCC GCCATATCCG CGCCCTCGCC AACCGGGTAC TGAATAAAGA AGGCGAAGTC GAACGCCTGC TCGGCATTAA TATGGATATG ACTGAGGTTA AACAGCTTAA CGAGGCATTG TTTCAGGAAA AAGAGCGCCT GCACATTACG CTTGATTCCA TCGGTGAAGC CGTGGTCTGT ATTGATATGG CGATGAAAAT TACCTTTATG AATCCAGTGG CGGAGAAGAT GAGCGGCTGG ACGCAGGAAG AAGCGTTAGG TGTTCCGCTC CTGACGGTGT TGCATATTAC TTTTGGCGAC AACGGACCAT TAATGGAGAA CATTTACAGT GCCGACACCT CACGTTCCGC GATCGAACAA GATGTGGTGT TGCACTGTCG GAGCGGCGGC AGTTACGACG TGCATTACAG TATTACGCCG TTAAGTACTC TGGACGGCAG CAATATTGGT TCGGTTCTGG TGATTCAGGA CGTTACCGAA TCACGCAAAA TGCTGCGCCA GCTGAGCTAC AGCGCCTCCC ATGATGCACT GACGCATCTC GCCAACCGCG CCAGTTTTGA GAAACAACTG CGTATCCTGC TGCAAACGGT AAACAGTACG CATCAGCGAC ATGCACTGGT GTTTATCGAT CTTGATCGCT TTAAAGCGGT GAATGACAGC GCCGGGCATG CGGCGGGCGA CGCTTTGCTG CGCGAACTGG CGTCGTTGAT GCTGAGTATG CTGCGCTCCA GTGACGTGCT GGCGCGGCTC GGCGGTGATG AATTTGGTCT GCTGCTGCCA GACTGCAATG TTGAAAGCGC TCGTTTTATC GCTACACGTA TTATCAGTGC TGTGAATGAC TATCACTTTA TATGGGAAGG CCGTGTGCAT CAGGTAGGTG CCAGTGCCGG GATTACCTTG ATTGATGACA ACAATCATCA GGCGGCAGAA GTGATGTCGC AGGCTGATAT CGCCTGTTAT GCCTCCAAAA ATGGTGGCCG GGGCCGGGTG ACGGTTTACG AACCGCAGCA AGCTGCCGCA CATAGCGAAC GGGCGGCGAT GTCGCTTGAT GAACAGTGGC GGATGATTAA AGAGAATCAG TTGATGATGC TCGCCCACGG TGTCGCTTCG CCACGGATCC CGGAAGCGCG TAATTTGTGG CTGATTTCAC TTAAGCTCTG GAGTTGCGAA GGCGAGATTA TTGATGAACA AACATTTCGT CGTAGCTTCA GCGATCCGGC GCTTAGCCAT GCTCTTGACC GCCGGGTATT CCACGAATTT TTCCAGCAGG CCGCAAAAGC GGTTGCCAGT AAAGGCATAA GCATCTCCCT CCCCCTTTCC GTTGCCGGTT TGAGTAGCGC CACGCTGGTG AATGATCTGC TTGAGCAGCT GGAAAATAGC CCTCTACCAC CACGGTTATT ACATCTGATT ATTCCGGCTG AAGCGATTTT AGATCACGCA GAAAGCGTGC AAAAACTGCG GCTGGCGGGA TGTCGGATAG TGCTCAGCCA GGTGGGCCGC GATCTGCAAA TCTTCAACTC GCTGAAAGCG AATATGGCAG ATTACCTGCT ACTTGATGGT GAGTTATGCG CCAACGTGCA GGGTAATTTG ATGGATGAGA TGCTGATTAC GATTATTCAG GGGCACGCTC AGCGACTCGG GATGAAAACC ATCGCCGGGC CAGTCGTTTT ACCCTTAGTG ATGGATACGC TTTCTGGCAT CGGCGTCGAT CTGATTTATG GTGAGGTGAT TGCCGATGCC CAACCGCTGG ATTTGCTGGT GAATAGCAGT TATTTCGCGA TTAACTGA
|
Protein sequence | MSKQSQHVLI ALPHPLLHLV SLGLVSFIFT LFSLELSQFG TQLAPLWFPT SIMMVAFYRH AGRMWPGIAL SCSLGNIAAS ILLFSTSSLN MTWTTINIVE AVVGAVLLRK LLPWYNPLQN LADWLRLALG SAIVPPLLGG VLVVLLTPGD DPLRAFLIWV LSESIGALAL VPLGLLFKPH YLLRHRNPRL LFESLLTLAI TLTLSWLSML YLPWPFTFII VLLMWSAVRL PRMEAFLIFL TTVMMVSLMM AADPSLLATP RTYLMTHMPW LPFLLILLPA NIMTMVMYAF RAERKHISES ETRFRNAMEY SAIGMALVGT EGQWLQSNKA LCQFLGYSQE ELRGLTFQQL TWPEDLNKDL QQVEKLISGE INTYSMEKRY YNRNGDVVWA LLAVSLVRHT DGTPLYFIAQ IEDINELKRT EQVNQQLMER ITLANEAGGI GIWEWELKPN IFSWDKRMFE LYEIPPHIKP NWQVWYECVL PEDRQHAEKV IRDSLQSRSP FKLEFRIAVK DGIRHIRALA NRVLNKEGEV ERLLGINMDM TEVKQLNEAL FQEKERLHIT LDSIGEAVVC IDMAMKITFM NPVAEKMSGW TQEEALGVPL LTVLHITFGD NGPLMENIYS ADTSRSAIEQ DVVLHCRSGG SYDVHYSITP LSTLDGSNIG SVLVIQDVTE SRKMLRQLSY SASHDALTHL ANRASFEKQL RILLQTVNST HQRHALVFID LDRFKAVNDS AGHAAGDALL RELASLMLSM LRSSDVLARL GGDEFGLLLP DCNVESARFI ATRIISAVND YHFIWEGRVH QVGASAGITL IDDNNHQAAE VMSQADIACY ASKNGGRGRV TVYEPQQAAA HSERAAMSLD EQWRMIKENQ LMMLAHGVAS PRIPEARNLW LISLKLWSCE GEIIDEQTFR RSFSDPALSH ALDRRVFHEF FQQAAKAVAS KGISISLPLS VAGLSSATLV NDLLEQLENS PLPPRLLHLI IPAEAILDHA ESVQKLRLAG CRIVLSQVGR DLQIFNSLKA NMADYLLLDG ELCANVQGNL MDEMLITIIQ GHAQRLGMKT IAGPVVLPLV MDTLSGIGVD LIYGEVIADA QPLDLLVNSS YFAIN
|
| |