Gene ECD_02114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02114 
SymbolyejH 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2175481 
End bp2177241 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content53% 
IMG OID 
Productpredicted ATP-dependet helicase 
Protein accessionACT43937 
Protein GI253978267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0523071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTA CACTTCGCCC ATATCAGCAA GAAGCCGTGG ATGCCACGCT CAACCATTTT 
CGTCGTCATA AAACCCCTGC CGTTATCGTG CTGCCCACCG GCGCAGGTAA AAGCCTGGTG
ATAGCGGAAC TGGCACGGCT GGCTCGTGGT CGCGTGCTGG TGCTGGCACA CGTTAAAGAA
CTGGTGGCGC AAAACCATGC AAAGTATCAG GCGCTGGGGC TGGAAGCCGA TATTTTTGCC
GCCGGGCTAA AGCGCAAAGA GAGCCACGGT AAAGTGGTAT TTGGCAGCGT GCAGTCTGTC
GCCCGTAATC TTGATGCCTT TCAGGGTGAA TTTTCGCTGT TGATTGTCGA TGAATGTCAC
CGTATTGGTG ACGATGAAGA GAGCCAGTAT CAGCAAATCC TCACTCACCT GACAAAAGTG
AATCCCCACT TACGCCTGCT GGGGCTGACT GCCACGCCTT TTCGATTGGG CAAAGGCTGG
ATCTACCAGT TTCATTATCA CGGCATGGTA CGCGGCGATG AGAAAGCCCT TTTCCGTGAC
TGCATTTATG AGCTGCCGCT GCGTTATATG ATTAAACACG GCTATCTGAC GCCGCCAGAA
CGACTGGATA TGCCAGTAGT GCAATACGAT TTCAGCCGCT TGCAGGCACA GAGTAACGGG
CTGTTCAGCG AAGCCGATCT CAACCGTGAG CTGAAAAAAC AACAACGTAT TACCCCGCAC
ATCATCAGCC AGATTATGGA GTTTGCTGCA ACGCGCAAAG GGGTGATGAT TTTTGCCGCG
ACGGTTGAAC ACGCAAAAGA GATTGTGGGA TTACTGCCTG CCGAAGATGC AGCACTGATT
ACTGGCGACA CCCCCGGCGC TGAGCGCGAT GTGTTAATTG AAAATTTTAA AGCCCAGCGT
TTTCGCTATC TGGTCAACGT CGCGGTACTG ACCACCGGAT TTGACGCCCC GCACGTCGAT
CTTATCGCCA TTCTGCGCCC TACCGAATCA GTGAGTCTTT ACCAACAAAT TGTCGGGCGC
GGTCTGCGTC TCGCTCCGGG CAAGACTGAT TGCTTAATTC TTGATTATGC GGGTAATCCT
CACGATCTCT ACGCGCCGGA AGTTGGTACA CCAAAAGGCA AAAGTGACAA CGTTCCGGTA
CAGGTTTTCT GCCCTGCCTG CGGTTTTGCC AACACCTTTT GGGGGAAAAC GACCGCCGAC
GGGACATTGA TTGAACACTT TGGTCGTCGC TGTCAGGGAT GGTTTGAAGA TGACGACGGT
CATCGCGAAC AATGTGACTT CCGTTTCCGT TTTAAAAATT GCCCGCAATG TAACGCGGAA
AACGATATTG CCGCCCGCCG CTGCCGCGAA TGTGACACCG TACTGGTTGA TCCGGACGAT
ATGTTAAAAG CGGCGCTACG ACTGAAAGAC GCGCTGGTAT TACGCTGTAG CGGCATGTCT
TTGCAACATG GGCACGACGA GAAAGGCGAA TGGTTGAAAA TCACCTATTA CGATGAAGAC
GGCGCGGATG TGAGTGAGCG TTTCCGTCTG CAAACACCTG CCCAGCGTAC CGCCTTCGAG
CAGCTTTTTA TCCGCCCGCA TACGCGCACA CCGGGCATCC CGCTGCGCTG GATCACCGCC
GCCGATATCC TCGCCCAGCA AGCCTTATTG CGACACCCGG ATTTTGTCGT CGCCCGCATG
AAAGGCCAGT ACTGGCAGGT GCGTGAAAAA GTGTTCGATT ACGAAGGTCG TTTTCGTCTG
GCGCACGAAT TACGCGGTTA A
 
Protein sequence
MIFTLRPYQQ EAVDATLNHF RRHKTPAVIV LPTGAGKSLV IAELARLARG RVLVLAHVKE 
LVAQNHAKYQ ALGLEADIFA AGLKRKESHG KVVFGSVQSV ARNLDAFQGE FSLLIVDECH
RIGDDEESQY QQILTHLTKV NPHLRLLGLT ATPFRLGKGW IYQFHYHGMV RGDEKALFRD
CIYELPLRYM IKHGYLTPPE RLDMPVVQYD FSRLQAQSNG LFSEADLNRE LKKQQRITPH
IISQIMEFAA TRKGVMIFAA TVEHAKEIVG LLPAEDAALI TGDTPGAERD VLIENFKAQR
FRYLVNVAVL TTGFDAPHVD LIAILRPTES VSLYQQIVGR GLRLAPGKTD CLILDYAGNP
HDLYAPEVGT PKGKSDNVPV QVFCPACGFA NTFWGKTTAD GTLIEHFGRR CQGWFEDDDG
HREQCDFRFR FKNCPQCNAE NDIAARRCRE CDTVLVDPDD MLKAALRLKD ALVLRCSGMS
LQHGHDEKGE WLKITYYDED GADVSERFRL QTPAQRTAFE QLFIRPHTRT PGIPLRWITA
ADILAQQALL RHPDFVVARM KGQYWQVREK VFDYEGRFRL AHELRG