Gene EcSMS35_2856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2856 
SymbolfhlA 
ID6145750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2930230 
End bp2932308 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content52% 
IMG OID641617725 
Productformate hydrogenlyase transcriptional activator 
Protein accessionYP_001744880 
Protein GI170679601 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.573281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATATA CACCGATGAG TGATCTCGGA CAACAAGGGT TGTTCGACAT CACTCGGACA 
CTATTGCAGC AGCCCGATCT GGCCTCGCTG TGTGAGGCTC TTTCGCAACT GGTAAAGCGT
TCTGCGCTCG CCGACAACGC GGCTATTGTG TTGTGGCAAG CGCAGACTCA ACGTGCGTCT
TATTACGCGT CGCGTGAAAA AGACACCCCC ATTAAATATG AAGACGAAAC TGTTCTGGCA
CACGGTCCGG TACGCAGCAT TTTGTCGCGC CCTGATACGC TGCATTGCAG TTACGAAGAA
TTTTGTGAAA CCTGGCCGCA GCTGGCCGCA GGTGGACTAT ACCCCAAATT TGGTCACTAT
TGCCTGATGC CACTGGCGGC GGAAGGGCAT ATTTTTGGTG GCTGTGAATT TATTCGTTAT
GACGATCGCC CCTGGAGCGA AAAAGAGTTC AATCGTCTGC AAACATTTAC GCAGATCGTT
TCGGTCGTCA CCGAACAAAT CCAGAGCCGC GTCGTTAACA ATGTCGACTA TGAGTTGTTA
TGCCGGGAGC GCGATAACTT CCGCATCCTG GTTGCCATTA CCAACGCGGT GCTTTCCCGC
CTGGATATGG ACGAACTGGT CAGCGAAGTC GCCAAAGAAA TCCATTACTA CTTCGACATT
GACGATATCA GCATCGTCTT ACGCAGCCAC CGTAAAAACA AACTCAACAT CTACTCCACT
CACTATCTTG ATAAACAGCA TCCCGCCCAC GAACAGAGCG AAGTCGATGA AGCCGGAACC
CTCACCGAAC GCGTCTTCAA AAGTAAAGAG ATGCTGTTGA TTAATCTCCA CGAGCGGGAC
GATTTAGCCC CCTATGAACG CATGTTGTTC GACACCTGGG GCAACCAGAT TCAAACCTTG
TGCCTGTTAC CGCTGATGTC TGGCGACACC ATGCTGGGCG TGCTGAAACT GGCGCAATGC
GAAGAGAAAG TGTTTACCAC TACCAATCTG AATTTACTGC GCCAGATTGC CGAACGTGTG
GCAATCGCTG TCGATAACGC CCTCGCCTAT CAGGAAATCC ATCGTCTGAA AGAACGGCTG
GTTGATGAAA ACCTCGCCCT GACCGAGCAG CTCAACAATG TTGATAGTGA ATTTGGCGAG
ATTATTGGCC GCAGCGAAGC CATGTACAGC GTGCTTAAAC AAGTTGAAAT GGTGGCGCAA
AGTGACAGTA CCGTGCTGAT CCTCGGTGAA ACTGGCACGG GTAAAGAGCT GATTGCCCGT
GCTATCCATA ATCTCAGTGG GCGTAATAAT CGCCGCATGG TAAAAATGAA CTGCGCGGCG
ATGCCTGCCG GATTGCTGGA GAGCGATCTG TTTGGTCATG AGCGTGGGGC TTTTACCGGT
GCCAGCGCCC AGCGCATTGG TCGTTTTGAA CTGGCGGATA AAAGCTCCCT GTTCCTCGAC
GAAGTGGGCG ATATGCCGCT GGAGTTACAG CCGAAGTTGC TGCGTGTATT ACAGGAACAG
GAGTTTGAAC GCCTCGGCAG CAACAAAATC ATTCAGACGG ACGTACGTTT AATTGCCGCG
ACTAACCGCG ATCTGAAAAA AATGGTCGCC GACCGAGAGT TCCGTAGTGA TCTCTATTAC
CGCCTGAACG TATTCCCGAT CCACCTGCCG CCACTACGCG AGCGTCCGGA AGATATTCCG
CTGCTGGCGA AAGCCTTTAC CTTCAAAATT GCCCGTCGTC TGGGGCGCAA TATCGACAGC
ATTCCTGCCG AGACGCTGCG CATCCTGAGC AACATGGAAT GGCCGGGCAA CGTGCGTGAG
CTGGAAAACG TCATCGAGCG CGCGGTATTG CTAACACGCG GCAACGTGCT GCAGCTGTCA
TTGCCAGATA TTGCTTTACC AGAACCTGAA ACGCCGCCTG CCGCAACGGT TGTCGCCCAG
GAGGGCGAAG ATGAATATCA GTTGATTGTT CGCGTGCTGA AAGAAACAAA CGGCGTGGTT
GCCGGGCCTA AAGGCGCTGC GCAACGTCTG GGGCTGAAAC GCACGACTCT GCTGTCACGG
ATGAAGCGAC TGGGAATTGA TAAATCGGCA TTGATTTAA
 
Protein sequence
MSYTPMSDLG QQGLFDITRT LLQQPDLASL CEALSQLVKR SALADNAAIV LWQAQTQRAS 
YYASREKDTP IKYEDETVLA HGPVRSILSR PDTLHCSYEE FCETWPQLAA GGLYPKFGHY
CLMPLAAEGH IFGGCEFIRY DDRPWSEKEF NRLQTFTQIV SVVTEQIQSR VVNNVDYELL
CRERDNFRIL VAITNAVLSR LDMDELVSEV AKEIHYYFDI DDISIVLRSH RKNKLNIYST
HYLDKQHPAH EQSEVDEAGT LTERVFKSKE MLLINLHERD DLAPYERMLF DTWGNQIQTL
CLLPLMSGDT MLGVLKLAQC EEKVFTTTNL NLLRQIAERV AIAVDNALAY QEIHRLKERL
VDENLALTEQ LNNVDSEFGE IIGRSEAMYS VLKQVEMVAQ SDSTVLILGE TGTGKELIAR
AIHNLSGRNN RRMVKMNCAA MPAGLLESDL FGHERGAFTG ASAQRIGRFE LADKSSLFLD
EVGDMPLELQ PKLLRVLQEQ EFERLGSNKI IQTDVRLIAA TNRDLKKMVA DREFRSDLYY
RLNVFPIHLP PLRERPEDIP LLAKAFTFKI ARRLGRNIDS IPAETLRILS NMEWPGNVRE
LENVIERAVL LTRGNVLQLS LPDIALPEPE TPPAATVVAQ EGEDEYQLIV RVLKETNGVV
AGPKGAAQRL GLKRTTLLSR MKRLGIDKSA LI