Gene EcE24377A_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3357 
Symbol 
ID5586099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3374501 
End bp3376156 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content55% 
IMG OID640926988 
Producthelicase/Zfx/Zfy transcription activation region domain-containing protein 
Protein accessionYP_001464359 
Protein GI157156708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAATC GCCTGAAAAA ACTGCTTCCC GGTAACAGCA ATACCAGTAG TGCTGAGACA 
ACCGCCCCGG AAACCGCACG CCAGCCGGAA CATCTGCCGG AAGGTTTTTA TATGCCCGGG
ACTGCAGAGG AGCTGACGTC CACACCACGC AGGAAACAGT GCCTGAAGCA GTTATGGGAA
AACAGCAGTA TGCCATCTGA CGTGTATCAG CAGTTCTGCC TGACACCAAT ACAAAAACTC
CTGATGGCGG CGCAGAACGT TCCCGCCGCC AGAGACTCCC GGTGGGCAGA TGCCAACGGT
TTTGGCGACC TGACACTGCA GTTCACCACC TATGCCGTTC GTCTGGCCAG AGGATATATG
TTTCCGCCCG GTGCCACACC GGAAGAACAG GCCGCACAGT CTGGTGTATG GAATGCCGTG
GTGTTCTGGT CGGCACTGTT TTATCACCTG CCACTTCTTG CACACCTGGA GGGAGAACTG
GTCAGCGGAA AGTTATGGCA GCCGGGAATG TCTTCACCGG GCGAGGCATT TCGTTTTCGC
TACAGACAGC AACGTCTGCA GGGGGCAGAG GCTCAGCAAC TGGCAGCGGT GATGGCCGGG
CAGCTGTTGC CGGAGGGGGC GACAGCCTGG CTGGCCACTG TGCCGGGAGC ATTACAGAAT
CTGGCGGGGG CGGTCTGGCA TCAGCATCCG GAGATGGCAT TGATTCGTTC AGTCCTGAAA
ACAGCGGCAG AAGAGGTGGA GAGTCCGCTC CTCGCATTAC AGGTGACGGA AGCCGTAACA
GCACCTTTGC TCCCGGAAAA CACTGTGCAG CCTGAAGACA ATGTGCCATC TGACAGCCAG
CCAGAAACAT CAACAGAGGT CAGTGCGCCG GAAATGTCGG CGGCCGTCCC GGAGGTAGGC
GAATTTACGC TGCAGCCCTC TGTTTCAGGA ACCGATGAGG CCGAAGCAGT CGTCCCGGAT
ACGTTGCAGT CTGCAACAGG CGCGGAGGAA AAGGCTCCGG AGGAGCAGAG TGTCCATGAT
GATACCGATA TGCTGCTGAG TCTGTTTTCA GCAGTCAGTG ATGACACTGA GCCCACGGAG
GCTGATGTGG CAGAACCTGT TGAAAATAAC GAGGCAGTTT CTGATGAATC AGGTTGTATA
AACAGTGAAC AGGCTGGCGC GGAAAGTGAT CCTGCGCAAG ACACGGGGAT TTTTGGCTCT
GTTTTATGTA TCAGTGAGCC GGCTCAGGAG ATAAAAAAAT CACCTGAGCA CTCACAGGGC
CGGAACAGTA CGGAAAATGT CAGGGCTTCA GGCAGTAGTG GTGAATTTGT TGAATGGCTC
AGACATGGAC TGGATTCGGG AGAGATCCCG GTGAATCAGC CTGATGCCAG AGTTCATCTG
ATTGCCGGAT ATGCTTTTCT GCGTGTGCCG GATGTGTTTT ACCTGTATCT GAAACAGACG
GGGAGTAACC ACGATCGCCG TTATGTTCAG TCCGTATTTG AGCGTGCGGG ACTTCACCGG
GTTCGTTCCG GGGAGCGTTT TGTTCAGGCC AGGTTGTATG ATTCGGCGGA ACGAAAAGGG
CGTTATCAAC CTGTCAGCGG TTACCTGGTG AAAAGCCGCA GTCTGTTCAG CGGAAAAGGG
CTCCCCGGAG ACAGCCCGTT TATCACATTT CCGTGA
 
Protein sequence
MLNRLKKLLP GNSNTSSAET TAPETARQPE HLPEGFYMPG TAEELTSTPR RKQCLKQLWE 
NSSMPSDVYQ QFCLTPIQKL LMAAQNVPAA RDSRWADANG FGDLTLQFTT YAVRLARGYM
FPPGATPEEQ AAQSGVWNAV VFWSALFYHL PLLAHLEGEL VSGKLWQPGM SSPGEAFRFR
YRQQRLQGAE AQQLAAVMAG QLLPEGATAW LATVPGALQN LAGAVWHQHP EMALIRSVLK
TAAEEVESPL LALQVTEAVT APLLPENTVQ PEDNVPSDSQ PETSTEVSAP EMSAAVPEVG
EFTLQPSVSG TDEAEAVVPD TLQSATGAEE KAPEEQSVHD DTDMLLSLFS AVSDDTEPTE
ADVAEPVENN EAVSDESGCI NSEQAGAESD PAQDTGIFGS VLCISEPAQE IKKSPEHSQG
RNSTENVRAS GSSGEFVEWL RHGLDSGEIP VNQPDARVHL IAGYAFLRVP DVFYLYLKQT
GSNHDRRYVQ SVFERAGLHR VRSGERFVQA RLYDSAERKG RYQPVSGYLV KSRSLFSGKG
LPGDSPFITF P