Gene ECH74115_3981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3981 
SymbolhypD 
ID6969187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3680265 
End bp3681386 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content56% 
IMG OID643387750 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_002272193 
Protein GI209396571 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTG TTGATGAATA TCGCGCGCCG GAACAGGTGA TGCAGTTAAT TGAGCATCTG 
CGCGAACGTG CTTCACATCT CTCTTACACC GCCGAACGCC CTCTGCGGAT TATGGAAGTG
TGCGGTGGTC ATACCCACGC CATTTTTAAA TTCGGCCTCG ACCAGTTACT GCCTGAAAAC
GTTGAGTTTA TCCACGGTCC GGGTTGCCCG GTGTGCGTAC TGCCGATGGG CAGAATCGAC
ACCTGCGTGG AGATTGCCAG CCATCCGGAA GTCATCTTCT GTACCTTTGG CGACGCCATG
CGCGTGCCGG GGAAACAGGG ATCGCTGTTG CAGGCAAAAG CACGCGGTGC CGATGTGCGC
ATCGTCTATT CGCCGATGGA TGCGTTGAAA CTGGCGCAGG AGAATCCAAC CCGCAAAGTG
GTGTTCTTCG GCTTAGGTTT TGAAACCACC ATGCCGACCA CCGCCATCAC TCTGCAACAG
GCGAAAGCGC GTGATGTGCA GAATTTTTAC TTCTTCTGCC AGCATATTAC GCTTATCCCG
ACGCTGCGCA GTTTGCTGGA ACAGCCGGAT AACGGTATCG ACGCGTTCCT CGCGCCGGGC
CACGTTAGTA TGGTTATCGG CACTGATGCC TATAATTTTA TCGCCAGCGA TTTTCAGCGT
CCGCTGGTGG TGGCTGGTTT CGAACCCCTT GATCTACTGC AAGGCGTGGT CATGCTGGTG
GAGCAGAAAA TAGCGGCCCA CAGCAAGGTA GAGAATCAGT ATCGTCGGGT GGTACCGGAT
GCCGGTAACC TGCTGGCGCA ACAGGCGATT GCCGATGTGT TCTGTGTCAA CGGCGACAGC
GAATGGCGCG GCTTAGGCGT GATTGAATCT TCTGGCGTGC ACCTGACGCC GGATTATCAA
CGATTCGATG CCGAAGCACA TTTCCGCCCG GCACCGCAGC AGGTCTGCGA TGACCCGCGC
GCGCGTTGTG GCGAAGTCTT GACGGGCAAA TGTAAGCCGC ATCAATGCCC GCTGTTTGGT
AACACCTGTA ATCCTCAAAC CGCGTTTGGT GCGCTGATGG TTTCCTCCGA AGGAGCGTGC
GCCGCGTGGT ATCAGTATCG TCAGCAGGAG AGTGAAGCGT GA
 
Protein sequence
MRFVDEYRAP EQVMQLIEHL RERASHLSYT AERPLRIMEV CGGHTHAIFK FGLDQLLPEN 
VEFIHGPGCP VCVLPMGRID TCVEIASHPE VIFCTFGDAM RVPGKQGSLL QAKARGADVR
IVYSPMDALK LAQENPTRKV VFFGLGFETT MPTTAITLQQ AKARDVQNFY FFCQHITLIP
TLRSLLEQPD NGIDAFLAPG HVSMVIGTDA YNFIASDFQR PLVVAGFEPL DLLQGVVMLV
EQKIAAHSKV ENQYRRVVPD AGNLLAQQAI ADVFCVNGDS EWRGLGVIES SGVHLTPDYQ
RFDAEAHFRP APQQVCDDPR ARCGEVLTGK CKPHQCPLFG NTCNPQTAFG ALMVSSEGAC
AAWYQYRQQE SEA