Gene ECH74115_3962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3962 
SymbolhypF 
ID6970084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3663337 
End bp3665637 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content58% 
IMG OID643387731 
Productcarbamoyltransferase HypF 
Protein accessionYP_002272174 
Protein GI209397726 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGCT GGACGGGTTA CGCAATTCAT GATGGCGGGA ATAACTCAAT GGCAAAAAAC 
ACATCTTGCG GTGTCCAACT GCGTATTCGA GGCAAAGTGC AGGGCGTCGG TTTTCGTCCG
TTTGTCTGGC AGCTGGCACA GCAATTAAAT CTTCACGGCG ATGTCTGTAA TGACGGCGAT
GGCGTAGAAG TCCGGCTGCT GGAAGACCCG GAAACGTTTC TTGTTCAATT GCATCAGCAC
TGCCCGCCGC TGGCGCGTAT TGATAGCGTT GAGCGTGAAC CGTTTATCTG GTCACAACTG
CCCACTGAGT TCACCATCCG CCAGAGCGCG GGCGGCGCCA TGAATACGCA AATTGTTCCC
GATGCCGCCA CTTGCCCTGC TTGCCTTGCC GAAATGAATA CCCCAGGCGA ACGGCGTTAT
CGTTATCCAT TTATCAACTG TACCCACTGC GGCCCGCGCT TCACCATTAT TCGCGCCATG
CCTTACGATC GCCCGTTTAC CGTGATGGCG GCGTTTCCGC TGTGTCCAGC CTGTGATAAA
GAGTACCGCG ACCCGCTCGA TCGTCGCTTC CACGCCCAGC CGGTGGCCTG CCCGGAGTGT
GGCCCGTATC TTGAATGGGT AAGTCATGGT GAACATGCAG AACAAGAGGC GGCATTACAG
GCGGCTATCG CACAGTTAAA AATGGGCAAC ATTGTCGCCA TCAAAGGGAT TGGCGGATTT
CATCTTGCCT GCGATGCACG TAACAGTAAC GCGGTGGCGA CACTTCGGGC GCGCAAACAT
CGCCCGGCGA AACCGCTGGC AGTCATGTTG CCAGTGGCTG ACGGTTTACC AGACGCTGCG
CGCCAGTTGC TTACCACGCC CGCCGCGCCG ATTGTGCTGG TGGATAAAAA ATACGTTCCT
GAGCTTTGTG ATGATATCGC CCCTGGCCTT AACGAAGTCG GGGTAATGTT GCCTGCGAAT
CCGCTTCAGC ATTTGCTGTT ACAGGAACTA CAATGCCCGC TGGTGATGAC CTCTGGCAAC
CTGAGCGGTA AACCACCGGC TATCAGCAAC GAACAGGCGC TGGAGGATTT GCAGGGCATT
GCCGACGGAT TCTTAATACA CAACCGCGAC ATTGTGCAGC GGATGGATGA TTCGGTGGTG
CGCGAAAGCG GCGAAATGCT GCGCCGTTCG CGGGGGTATG TGCCGGATGC GCTGGCTTTG
CCTCCGGGCT TTAAAAATGT TCCGCCTGTA CTGTGTCTCG GCGCGGATCT GAAAAATACC
TTCTGCCTGG TGCGCGGTGA ACAAGTGGTG TTGAGCCAGC ATCTGGGCGA TTTAAGTGAC
GATGGCATCC AGACGCAGTG GCGCGAAGCG TTACGCCTGA TGCAAAACAT CTACAATTTC
ACTCCGCAAT ACGTTGTGCA TGACGCACAT CCGGGCTATG TCTCCTGCCA GTGGGCGAGC
GAAATGAATC TGCCGACGCA AACGGTGCTG CATCATCATG CCCACGCAGC GGCGTGTCTG
GCAGAGCATC AGTGGCCGCT GGATGGCGGT GATGTCATTG CTTTGACGCT CGACGGTATC
GGTATGGGGG AGAACGGCGC TTTGTGGGGC GGCGAGTGCC TGCGGGTGAA CTATCGCGAA
TGTGAGCACC TGGGCGGCTT GCCTGCAGTG GCGCTTCCGG GTGGCGATTT GGCGGCGAAG
CAGCCGTGGC GTAACCTGCT GGCGCAGTGC CTGCGCTTTG TGCCGGAGTG GCAGAATTAC
CCTGAAACGG CAAGTGTGCA ACAGCAAAAC TGGAGCGTAC TGGCGCGGGC CATTGAGCGT
GGAATTAACG CGCCGCTGGC GTCATCGTGT GGGCGTTTGT TCGATGCTGT GGCGGCGGCA
CTGGGCTGTG CGCCAGCCAC GTTAAGTTAT GAAGGTGAAG CGGCTTGTGC TCTGGAGGCG
CTAGCAGCCT CATGCGACGG AGTGACGCAT CCGGTGACGA TGCCGCGGGT GGACAATCAA
CTGGATCTCG CCACTTTCTG GCAGCAGTGG CTGAACTGGC AGGCACCGGT TAATCAACGC
GCGTGGGCGT TTCATGATGC GCTGGCGCAG GGTTTTGCCG CGTTGATGCG TGAGCAGGCC
ACGATGCGTG GTATCACTAC GCTGGTATTT AGCGGCGGGG TTATTCATAA CCGTTTGCTG
CGTGCACGTC TGGCGCATTA TCTCGCTGAT TTCACATTGC TGTTTCCACA GAGTTTACCG
GCGGGTGATG GCGGTTTGTC TCTGGGGCAG GGGGTTATTG CTGCGGCGCG TTGGTTAGCG
GGTGAAGTCC AGAACGGATA A
 
Protein sequence
MSGWTGYAIH DGGNNSMAKN TSCGVQLRIR GKVQGVGFRP FVWQLAQQLN LHGDVCNDGD 
GVEVRLLEDP ETFLVQLHQH CPPLARIDSV EREPFIWSQL PTEFTIRQSA GGAMNTQIVP
DAATCPACLA EMNTPGERRY RYPFINCTHC GPRFTIIRAM PYDRPFTVMA AFPLCPACDK
EYRDPLDRRF HAQPVACPEC GPYLEWVSHG EHAEQEAALQ AAIAQLKMGN IVAIKGIGGF
HLACDARNSN AVATLRARKH RPAKPLAVML PVADGLPDAA RQLLTTPAAP IVLVDKKYVP
ELCDDIAPGL NEVGVMLPAN PLQHLLLQEL QCPLVMTSGN LSGKPPAISN EQALEDLQGI
ADGFLIHNRD IVQRMDDSVV RESGEMLRRS RGYVPDALAL PPGFKNVPPV LCLGADLKNT
FCLVRGEQVV LSQHLGDLSD DGIQTQWREA LRLMQNIYNF TPQYVVHDAH PGYVSCQWAS
EMNLPTQTVL HHHAHAAACL AEHQWPLDGG DVIALTLDGI GMGENGALWG GECLRVNYRE
CEHLGGLPAV ALPGGDLAAK QPWRNLLAQC LRFVPEWQNY PETASVQQQN WSVLARAIER
GINAPLASSC GRLFDAVAAA LGCAPATLSY EGEAACALEA LAASCDGVTH PVTMPRVDNQ
LDLATFWQQW LNWQAPVNQR AWAFHDALAQ GFAALMREQA TMRGITTLVF SGGVIHNRLL
RARLAHYLAD FTLLFPQSLP AGDGGLSLGQ GVIAAARWLA GEVQNG