Gene EcHS_A2625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2625 
Symbol 
ID5590985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2633903 
End bp2635849 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content53% 
IMG OID640921742 
Productformate hydrogenlyase transcriptional activator 
Protein accessionYP_001459269 
Protein GI157161951 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCGG AGCGGTTAGC GCAGAAACAC GGTAAGGCGT CTTTATTACG CGCCTTCATC 
CCGCTGCCGC CGCCGTTCAG CCCGGTACAA CTTATTGAAC TGCATGTTCT CAAAAGCAAC
TTCTATTACC GCTACCATGA TGATGGCAGC GATGTGACGG CAACAACAGA GTATCAGGGC
GAGATGGTCG ATTATTCGCG TCACGCCGTC CTTCTCGGCA GTAGTGGAAT GGCGGAGCTA
CGCTTTATTC GCACCCACGG CAGTCGTTTT ACTCCCCAGG ATTGCACACT GTTTAACTGG
CTGGCGCGGA TAATCACCCC GGTTCTGCAA TCATGGCTCA ATGATGAAGA ACAGCAGGTG
GCGCTGCGTT TGCTGGAGAA AGATCGCGAT CATCATCGGG TACTGGTTGA TATTACTAAT
GCAGTGCTGT CACATCTTGA TCTCGACGAT CTGATCGCTG ACGTCGCTCG TGAGATCCAT
CATTTTTTCG GTCTGGCTTC AGTCAGTATG GTACTGGGCG ATCATCGAAA GAACGAGAAG
TTTAGCCTGT GGTGCAGCGA TCTTTCTGCC TCACATTGTG CGTGTCTGCC ACGCAATATG
CCTGGCGACA GTGTATTGCT GACACAAACG CTACAAACCC GACAACCGAC CTTGACGCAC
CGTGCAGACG ATCTGTTTCT CTGGCAACGC GACCCGTTAT TACTCTTACT TGCATCTAAC
GGCTGCGAAT CTGCGCTCCT TATACCGCTT ACCTTTGGCA ACCATACACC GGGTGCATTG
TTGCTGGCGC ATACCTCTTC CACTCTCTTT AGTGAGGAAA ACTGCCAGCT ACTACAACAC
ATAGCCGATC GCATCGCTAT TGCCGTTGGC AATGCCGATG CCTGGCGTAG CATGACCGAT
TTGCAGGAAA GTTTGCAGCA AGAAAACCAC CAGCTTAGCG AGCAGCTCCT TTCGAATCTG
GGCATCGGTG ACATTATCTA TCAAAGCCAG GCAATGGAAG ACCTACTCCA GCAGTTAGAT
ATTGTGGCGA AGAGCGACAG TACGGTGTTG ATTTGCGGTG AAACCGGAAC CGGCAAAGAG
GTGATCGCCA GAGCGATCCA TCAACTTAGC CCGCGACGCG ACAAGCCGCT GGTCAAAATC
AACTGCGCTG CCATCCCCGC CAGTCTTCTG GAAAGTGAGT TATTCGGTCA TGACAAAGGG
GCGTTTACTG GTGCGATTAA TACCCATCGT GGTCGTTTTG AAATTGCCGA TGGCGGCACG
TTGTTTCTCG ATGAAATTGG CGATCTGCCG TTAGAACTTC AGCCTAAACT GCTGCGCGTA
TTGCAGGAAC GGGAGATTGA GCGTCTCGGC GGGAGTAGAA CGATCCCGGT AAATGTCAGA
GTCATTGCCG CCACCAACCG TGATTTGTGG CAAATGGTTG AAGATCGCCA GTTTCGCAGC
GATCTCTTTT ATCGCCTGAA TGTCTTCCCA CTGGAATTGC CGCCGCTGCG CGACCGTCCG
GAAGATATCC CTCTTTTAGC AAAGCATTTC ACGCAAAAAA TGGCGCGCCA TATGAATCGC
GCAATTGACG CCATCCCGAC CGAGGCACTA CGCCAGTTGA TGTCGTGGGA TTGGCCGGGC
AACGTGCGCG AGCTGGAAAA CGTGATTGAG CGGGCGGTAC TGTTGACTCG TGGTAACAGT
CTGAATTTAC ATCTAAATGT CCGACAAAGC CGTTTACTGC CGACGCTAAA TGAAGATTCA
GCGCTTCGCA GTTCAATGGC GCAGTTGCTG CACCCGACGA CGCCAGAGAA TGACGAAGAA
GAACGTCAGC GCATTGTTCA GGTATTGCGA GAAACCAATG GCATTGTTGC CGGGCCCCGT
GGCGCGGCGA CACGATTAGG GATGAAGCGC ACCACGCTGC TGTCACGAAT GCAGCGTCTG
GGGATCTCGG TTCGCGAGGT GTTGTAA
 
Protein sequence
MLAERLAQKH GKASLLRAFI PLPPPFSPVQ LIELHVLKSN FYYRYHDDGS DVTATTEYQG 
EMVDYSRHAV LLGSSGMAEL RFIRTHGSRF TPQDCTLFNW LARIITPVLQ SWLNDEEQQV
ALRLLEKDRD HHRVLVDITN AVLSHLDLDD LIADVAREIH HFFGLASVSM VLGDHRKNEK
FSLWCSDLSA SHCACLPRNM PGDSVLLTQT LQTRQPTLTH RADDLFLWQR DPLLLLLASN
GCESALLIPL TFGNHTPGAL LLAHTSSTLF SEENCQLLQH IADRIAIAVG NADAWRSMTD
LQESLQQENH QLSEQLLSNL GIGDIIYQSQ AMEDLLQQLD IVAKSDSTVL ICGETGTGKE
VIARAIHQLS PRRDKPLVKI NCAAIPASLL ESELFGHDKG AFTGAINTHR GRFEIADGGT
LFLDEIGDLP LELQPKLLRV LQEREIERLG GSRTIPVNVR VIAATNRDLW QMVEDRQFRS
DLFYRLNVFP LELPPLRDRP EDIPLLAKHF TQKMARHMNR AIDAIPTEAL RQLMSWDWPG
NVRELENVIE RAVLLTRGNS LNLHLNVRQS RLLPTLNEDS ALRSSMAQLL HPTTPENDEE
ERQRIVQVLR ETNGIVAGPR GAATRLGMKR TTLLSRMQRL GISVREVL