Gene EcSMS35_2835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2835 
SymbolhypF 
ID6145832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2910351 
End bp2912639 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content58% 
IMG OID641617704 
Product[NiFe] hydrogenase maturation protein HypF 
Protein accessionYP_001744859 
Protein GI170680945 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0257983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGCA GGGCGGGTTA CGAAATTCAT GATGGCGGGA ATAACTCAAT GGCAAAAAAC 
ATATCTTGCG GTGTCCAACT GCGTATTCGT GGCAAAGTGC AGGGCGTCGG TTTTCGTCCG
TTTGTCTGGC AACTGGCACA GCAATTAAAT CTTCACGGCG ATGTCTGTAA TGACGGCGAT
GGCGTAGAAG TCCGGCTGCT GGAAGCCCCG GAAACGTTTC TTGTTCAATT GCATCAGCAC
TGCCCGCCGC TGGCGCGTAT TGATAGCGTC GAGCGTGAGC CGTTTATCTG GTCACAACTG
CCCACTGAGT TCACTATCCG CCAGAGCACA GGCGGCACCA TGAATACGCA AATTGTCCCG
GATGCCGCCA CTTGCCCTGC TTGCCTTGCC GAAATGAATA CCCCGGGCGA GCGGCGTTAT
CGTTATCCGT TTATCAACTG TACCCACTGC GGCCCGCGTT TCACCATTAT TCGCGCGATG
CCTTACGACC GCCCGTTTAC CGTGATGGCG GCGTTTCCGC TGTGTCCGGC TTGTGACAAA
GAGTACCGCG ACCCGCTCGA TCGTCGCTTC CACGCCCAGC CGGTGGCCTG CCCGGAGTGT
GGCCCACATC TTGAATGGGT AAGTCATGGT GAACATGCAG AACAAGAGGC GGCATTACAG
GCGGCTATCG CACAGTTAAA AATGGGCAAC ATTGTCGCCA TCAAAGGGAT TGGCGGATTT
CATCTCGCCT GCGATGCGCG TAACAGTAAC GCGGTGGCGA CACTACGGGC ACGTAAACAT
CGCCCGGCGA AACCGCTGGC GGTCATGTTG CCAGTGGCTG ACGGTTTACC AGACGCTGCG
CGCCAGTTGC TTTCCACGCC CGCCGCGCCG ATTGTGCTGG TGGATAAAAA ATACATACCT
GAGCTTTGTG ATGATATCGC CCCTGGCCTT AACGAAGTTG GGGTGATGTT GCCTGCGAAC
CCGCTCCAGC ATTTGCTGTT ACAGGAACTG CAATGCCCGC TGGTGATGAC CTCCGGCAAC
CTGAGCGGTA AACCACCAGC TATCAGCAAC GAACAGGCGC TGGCGGATTT GCAGGGCATT
GCCGACGGAT TCTTGATACA TAACCGCGAC ATCGTGCAGC GGATGGATGA TTCGGTGGTG
CGCGAAAGCG GCGAAATGCT GCGCCGTTCG CGGGGGTATG TGCCGGATGC GCTGGCTTTG
CCTCCGGGCT TTAAAAATGT TCCGCCTGTG CTGTGTCTCG GCGCGGATCT GAAAAATACC
TTCTGCCTGG TGCGCGGTGA ACAAGCGGTG TTGAGTCAGC ATCTGGGCGA TTTAAGTGAC
GATGGCATCC AGATGCAGTG GCGCGAAGCG TTACGCCTGA TGCAAAACAT CTACGATTTT
ACCCCGCAAT ACGTTGTGCA TGACGCACAT CCGGGCTATG TCTCCAGCCA GTGGGCGCGC
GAAATGAATC TGCCGACGCA AACGGTGCTG CATCATCATG CCCACGCGGC GGCGTGTCTG
GCAGAACATC ACTGGCCACT GGAGGGCGGT GATGTCATTG CTTTGACGCT CGACGGTATC
GGTATGGGGG AGAACGGCGC TTTGTGGGGC GGCGAGTGTC TGCGGGTGAA CTATCGCGAA
TGCCAGCACC TGGGCGGCTT GCCAGCGGTG GCGCTTGCTG GGGGCGATTT GGCTGCAAAG
CAGCCGTGGC GTAACCTGCT GGCGCAGTGC CTGCGCTTTG TGCCGGAGTG GCAGAATTAC
CCTGAAACGG CAAGTGTGCA ACAGCAAAAC TGGAGCGTGC TGGCGCGGGC CATTGAGCGT
GGAATTAACG CGCCGCTGGC GTCATCGTGT GGGCGTTTGT TTGATGCTGT GGCGGCGGCA
CTGGGCTGTG CGCCAGCCAC GTTAAGTTAT GAAGGTGAAG CGGCTTGTGC TCTGGAGGCG
CTCGCAGCTT CATGCGACGG AGTGACGCAT CCGGTGACGA TGCCGCTGGT GGACAATCAA
CTGGATCTCG CCACTTTCTG GCAGCAGTGG CTGAACTGGC AGGCACCGGT TAATCAACGC
GCGTGGGCGT TTCATGATGC GCTGGCGCAG GGTTTTGCCG CGTTGATGCG TGAGCAGGCC
ACGATGCGCG GTATCACTAC GCTGGTATTT AGCGGTGGGG TTATTCATAA CCGATTGCTG
CGTACACGCC TGGCGCATTA TCTCGCTGAT TTCACATTAC TCTTTCCGCA GAGTTTACCG
GCGGGTGATG GCGGCTTGTC TCTGGGGCAG GGGGTTATTG CTGCGGCGCG CATCCTGCAT
GCTGCATAG
 
Protein sequence
MSGRAGYEIH DGGNNSMAKN ISCGVQLRIR GKVQGVGFRP FVWQLAQQLN LHGDVCNDGD 
GVEVRLLEAP ETFLVQLHQH CPPLARIDSV EREPFIWSQL PTEFTIRQST GGTMNTQIVP
DAATCPACLA EMNTPGERRY RYPFINCTHC GPRFTIIRAM PYDRPFTVMA AFPLCPACDK
EYRDPLDRRF HAQPVACPEC GPHLEWVSHG EHAEQEAALQ AAIAQLKMGN IVAIKGIGGF
HLACDARNSN AVATLRARKH RPAKPLAVML PVADGLPDAA RQLLSTPAAP IVLVDKKYIP
ELCDDIAPGL NEVGVMLPAN PLQHLLLQEL QCPLVMTSGN LSGKPPAISN EQALADLQGI
ADGFLIHNRD IVQRMDDSVV RESGEMLRRS RGYVPDALAL PPGFKNVPPV LCLGADLKNT
FCLVRGEQAV LSQHLGDLSD DGIQMQWREA LRLMQNIYDF TPQYVVHDAH PGYVSSQWAR
EMNLPTQTVL HHHAHAAACL AEHHWPLEGG DVIALTLDGI GMGENGALWG GECLRVNYRE
CQHLGGLPAV ALAGGDLAAK QPWRNLLAQC LRFVPEWQNY PETASVQQQN WSVLARAIER
GINAPLASSC GRLFDAVAAA LGCAPATLSY EGEAACALEA LAASCDGVTH PVTMPLVDNQ
LDLATFWQQW LNWQAPVNQR AWAFHDALAQ GFAALMREQA TMRGITTLVF SGGVIHNRLL
RTRLAHYLAD FTLLFPQSLP AGDGGLSLGQ GVIAAARILH AA