Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_02383 |
Symbol | hyfR |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 2476387 |
End bp | 2478393 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | DNA-binding transcriptional activator, formate sensing |
Protein accession | ACT44203 |
Protein GI | 253978533 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGACG AGGCGATGTT TGCCCCGCCG CAAGGAATAA CAATTGAAGC GGTAAACGGA ATGCTCGCGG AGCGGTTAGC GCAGAAACAC GGTAAGGCGT CTTTATTACG CGCCTTCATC CCGCTGCCGC CGCCGTTCAG CCCGGTACAA CTTATTGAAC TGCATGTTCT CAAAAGCAAC TTCTATTACC GCTACCATGA TGATGGCAGC GATGTGACGG CAACAACAGA GTATCAGGGC GAGATGGTCG ATTATTCGCG TCACGCCGTC CTTCTCGGCA GTAGTGGAAT GGCGGAGCTA CGCTTTATTC GCACCCACGG CAGTCGTTTT ACTCCCCAGG ATTGCACACT GTTTAACTGG CTGGCGCGGA TAATCACCCC GGTTCTGCAA TCATGGCTCA ATGATGAAGA ACAGCAGGTG GCGCTGCGTT TGCTGGAGAA AGATCGCGAT CATCATCGGG TACTGGTTGA TATTACTAAT GCAGTGCTGT CACATCTTGA TCTCGACGAT CTGATCGCTG ACGCCGCTCG TGAGATCCAT CATTTTTTCG GTCTGGCTTC AGTCAGTATG GTACTGGGCG ATCATCGAAA GAACGAGAAG TTTAGCCTGT GGTGCAGCGA TCTTTCTGCC TCACATTGTG CGTGTCTGCC ACGCAATATG CCTGGCGACA GTGTATTGCT GACACAAACG CTACAAACCC GACAACCGAC CTTGACGCAC CGTGCAGACG ATCTGTTTCT CTGGCAACGC GACCCGTTAT TACTCTTACT TGCATCTAAC GGCTGCGAAT CTGCGCTCCT TATACCGCTT ACCTTTGGCA ACCATACACC GGGTGCATTG TTGCTGGCGC ATACCTCTTC CACTCTCTTT AGTGAGGAAA ACTGCCAGCT ACTACAACAC ATAGCCGATC GCATCGCTAT TGCCGTTGGC AATGCCGATG CCTGGCGTAG CATGACCGAT TTGCAGGAAA GTTTGCAGCA AGAAAACCAC CAGCTTAGCG AGCAGCTCCT TTCGAATCTG GGCATCGGTG ACATTATCTA TCAAAGCCAG GCAATGGAAG ACCTACTCCA GCAGGTAGAT ATTGTGGCGA AGAGCGACAG TACGGTGTTG ATTTGCGGTG AAACCGGAAC CGGCAAAGAG GTGATCGCCA GAGCGATCCA TCAACTTAGC CCGCGACGCG ACAAGCCGCT GGTCAAAATC AACTGCGCTG CCATCCCCGC CAGTCTTCTG GAAAGTGAGT TATTCGGTCA TGACAAAGGG GCGTTTACTG GTGCGATTAA TACCCATCGT GGTCGTTTTG AAATTGCCGA TGGCGGCACG TTGTTTCTCG ATGAAATTGG CGATCTGCCG TTAGAACTTC AGCCTAAACT GCTGCGCGTA TTGCAGGAAC GGGAGATTGA GCGTCTCGGC GGGAGTAGAA CGATCCCGGT AAATGTCAGA GTCATTGCCG CCACCAACCG TGATTTGTGG CAAATGGTTG AAGATCGCCA GTTTCGCAGC GATCTCTTTT ATCGCCTGAA TGTCTTCCCA CTGGAATTGC CGCCGCTGCG CGACCGTCCG GAAGATATCC CTCTTTTAGC AAAGCATTTC ACGCAAAAAA TGGCGCGCCA TATGAATCGC GCAATTGACG CCATCCCGAC CGAGGCACTA CGCCAGTTGA TGTCGTGGGA TTGGCCGGGC AACGTGCGCG AGCTGGAAAA CGTGATTGAG CGGGCGGTAC TGTTGACTCG TGGTAACAGT CTGAATTTAC ATCTAAATGT CCGACAAAGC CGTTTACTGC CGACGCTAAA TGAAGATTCA GCGCTTCGCA GTTCAATGGC GCAGTTGCTG CACCCGACGA CGCCAGAGAA TGACGAAGAA GAACGTCAGC GCATTGTTCA GGTATTGCGA GAAACCAATG GCATTGTTGC CGGGCCCCGT GGCGCGGCGA CACGATTAGG GATGAAGCGC ACCACGCTGC TGTCACGAAT GCAGCGTCTG GGGATCTCGG TTCGCGAGGT GTTGTAA
|
Protein sequence | MSDEAMFAPP QGITIEAVNG MLAERLAQKH GKASLLRAFI PLPPPFSPVQ LIELHVLKSN FYYRYHDDGS DVTATTEYQG EMVDYSRHAV LLGSSGMAEL RFIRTHGSRF TPQDCTLFNW LARIITPVLQ SWLNDEEQQV ALRLLEKDRD HHRVLVDITN AVLSHLDLDD LIADAAREIH HFFGLASVSM VLGDHRKNEK FSLWCSDLSA SHCACLPRNM PGDSVLLTQT LQTRQPTLTH RADDLFLWQR DPLLLLLASN GCESALLIPL TFGNHTPGAL LLAHTSSTLF SEENCQLLQH IADRIAIAVG NADAWRSMTD LQESLQQENH QLSEQLLSNL GIGDIIYQSQ AMEDLLQQVD IVAKSDSTVL ICGETGTGKE VIARAIHQLS PRRDKPLVKI NCAAIPASLL ESELFGHDKG AFTGAINTHR GRFEIADGGT LFLDEIGDLP LELQPKLLRV LQEREIERLG GSRTIPVNVR VIAATNRDLW QMVEDRQFRS DLFYRLNVFP LELPPLRDRP EDIPLLAKHF TQKMARHMNR AIDAIPTEAL RQLMSWDWPG NVRELENVIE RAVLLTRGNS LNLHLNVRQS RLLPTLNEDS ALRSSMAQLL HPTTPENDEE ERQRIVQVLR ETNGIVAGPR GAATRLGMKR TTLLSRMQRL GISVREVL
|
| |