Gene ECD_02383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02383 
SymbolhyfR 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2476387 
End bp2478393 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content53% 
IMG OID 
ProductDNA-binding transcriptional activator, formate sensing 
Protein accessionACT44203 
Protein GI253978533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACG AGGCGATGTT TGCCCCGCCG CAAGGAATAA CAATTGAAGC GGTAAACGGA 
ATGCTCGCGG AGCGGTTAGC GCAGAAACAC GGTAAGGCGT CTTTATTACG CGCCTTCATC
CCGCTGCCGC CGCCGTTCAG CCCGGTACAA CTTATTGAAC TGCATGTTCT CAAAAGCAAC
TTCTATTACC GCTACCATGA TGATGGCAGC GATGTGACGG CAACAACAGA GTATCAGGGC
GAGATGGTCG ATTATTCGCG TCACGCCGTC CTTCTCGGCA GTAGTGGAAT GGCGGAGCTA
CGCTTTATTC GCACCCACGG CAGTCGTTTT ACTCCCCAGG ATTGCACACT GTTTAACTGG
CTGGCGCGGA TAATCACCCC GGTTCTGCAA TCATGGCTCA ATGATGAAGA ACAGCAGGTG
GCGCTGCGTT TGCTGGAGAA AGATCGCGAT CATCATCGGG TACTGGTTGA TATTACTAAT
GCAGTGCTGT CACATCTTGA TCTCGACGAT CTGATCGCTG ACGCCGCTCG TGAGATCCAT
CATTTTTTCG GTCTGGCTTC AGTCAGTATG GTACTGGGCG ATCATCGAAA GAACGAGAAG
TTTAGCCTGT GGTGCAGCGA TCTTTCTGCC TCACATTGTG CGTGTCTGCC ACGCAATATG
CCTGGCGACA GTGTATTGCT GACACAAACG CTACAAACCC GACAACCGAC CTTGACGCAC
CGTGCAGACG ATCTGTTTCT CTGGCAACGC GACCCGTTAT TACTCTTACT TGCATCTAAC
GGCTGCGAAT CTGCGCTCCT TATACCGCTT ACCTTTGGCA ACCATACACC GGGTGCATTG
TTGCTGGCGC ATACCTCTTC CACTCTCTTT AGTGAGGAAA ACTGCCAGCT ACTACAACAC
ATAGCCGATC GCATCGCTAT TGCCGTTGGC AATGCCGATG CCTGGCGTAG CATGACCGAT
TTGCAGGAAA GTTTGCAGCA AGAAAACCAC CAGCTTAGCG AGCAGCTCCT TTCGAATCTG
GGCATCGGTG ACATTATCTA TCAAAGCCAG GCAATGGAAG ACCTACTCCA GCAGGTAGAT
ATTGTGGCGA AGAGCGACAG TACGGTGTTG ATTTGCGGTG AAACCGGAAC CGGCAAAGAG
GTGATCGCCA GAGCGATCCA TCAACTTAGC CCGCGACGCG ACAAGCCGCT GGTCAAAATC
AACTGCGCTG CCATCCCCGC CAGTCTTCTG GAAAGTGAGT TATTCGGTCA TGACAAAGGG
GCGTTTACTG GTGCGATTAA TACCCATCGT GGTCGTTTTG AAATTGCCGA TGGCGGCACG
TTGTTTCTCG ATGAAATTGG CGATCTGCCG TTAGAACTTC AGCCTAAACT GCTGCGCGTA
TTGCAGGAAC GGGAGATTGA GCGTCTCGGC GGGAGTAGAA CGATCCCGGT AAATGTCAGA
GTCATTGCCG CCACCAACCG TGATTTGTGG CAAATGGTTG AAGATCGCCA GTTTCGCAGC
GATCTCTTTT ATCGCCTGAA TGTCTTCCCA CTGGAATTGC CGCCGCTGCG CGACCGTCCG
GAAGATATCC CTCTTTTAGC AAAGCATTTC ACGCAAAAAA TGGCGCGCCA TATGAATCGC
GCAATTGACG CCATCCCGAC CGAGGCACTA CGCCAGTTGA TGTCGTGGGA TTGGCCGGGC
AACGTGCGCG AGCTGGAAAA CGTGATTGAG CGGGCGGTAC TGTTGACTCG TGGTAACAGT
CTGAATTTAC ATCTAAATGT CCGACAAAGC CGTTTACTGC CGACGCTAAA TGAAGATTCA
GCGCTTCGCA GTTCAATGGC GCAGTTGCTG CACCCGACGA CGCCAGAGAA TGACGAAGAA
GAACGTCAGC GCATTGTTCA GGTATTGCGA GAAACCAATG GCATTGTTGC CGGGCCCCGT
GGCGCGGCGA CACGATTAGG GATGAAGCGC ACCACGCTGC TGTCACGAAT GCAGCGTCTG
GGGATCTCGG TTCGCGAGGT GTTGTAA
 
Protein sequence
MSDEAMFAPP QGITIEAVNG MLAERLAQKH GKASLLRAFI PLPPPFSPVQ LIELHVLKSN 
FYYRYHDDGS DVTATTEYQG EMVDYSRHAV LLGSSGMAEL RFIRTHGSRF TPQDCTLFNW
LARIITPVLQ SWLNDEEQQV ALRLLEKDRD HHRVLVDITN AVLSHLDLDD LIADAAREIH
HFFGLASVSM VLGDHRKNEK FSLWCSDLSA SHCACLPRNM PGDSVLLTQT LQTRQPTLTH
RADDLFLWQR DPLLLLLASN GCESALLIPL TFGNHTPGAL LLAHTSSTLF SEENCQLLQH
IADRIAIAVG NADAWRSMTD LQESLQQENH QLSEQLLSNL GIGDIIYQSQ AMEDLLQQVD
IVAKSDSTVL ICGETGTGKE VIARAIHQLS PRRDKPLVKI NCAAIPASLL ESELFGHDKG
AFTGAINTHR GRFEIADGGT LFLDEIGDLP LELQPKLLRV LQEREIERLG GSRTIPVNVR
VIAATNRDLW QMVEDRQFRS DLFYRLNVFP LELPPLRDRP EDIPLLAKHF TQKMARHMNR
AIDAIPTEAL RQLMSWDWPG NVRELENVIE RAVLLTRGNS LNLHLNVRQS RLLPTLNEDS
ALRSSMAQLL HPTTPENDEE ERQRIVQVLR ETNGIVAGPR GAATRLGMKR TTLLSRMQRL
GISVREVL