Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2630 |
Symbol | hyfC |
ID | 6144484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2688127 |
End bp | 2689074 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617501 |
Product | hydrogenase-4 component C |
Protein accession | YP_001744666 |
Protein GI | 170683202 |
COG category | [C] Energy production and conversion |
COG ID | [COG0650] Formate hydrogenlyase subunit 4 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACAAA CTCTTTGCGA CGGATATCTG ATCATTTTTG CGTTAGCACA GGCCGTGATT CTGCTGGCGC TAACCCCGCT TTTTACGGGG ATTTCCCGGC AGATACGCGC GCGTATGCAC TCCCGTCGCG GACCGGGGAT CTGGCAGGAT TATCGCGATA TCCACAAACT GTTTAAACGC CAGGAAGTTG CGCCGACATC TTCAGGTCTG ATGTTCCGCC TGATGCCGTG GGTATTAATC AGCAGCATGC TGGTGCTGGC GATGGCCCTA CCACTATTTA TTACTGTTTC CCCTTTTGCG GGCGGCGGCG ATCTGATCAC CCTTATCTAT CTTCTTGCCC TATTTCGTTT TTTCTTTGCT CTTTCCGGGC TGGATACCGG AAGCCCCTTT GCGGGAATCG GTGCCAGTCG CGAGTTAACG CTCGGCATTC TGGTCGAACC CATGCTCATT CTCTCACTGC TGGTATTGGC GCTGATAGCG GATTCCACGC ATATCGAGAT GATCAGCAAG ACGCTGGCGA CGGGCTGGAA CTCGCCGCTA ACCACCGTAC TGGCGTTACT GGCCTGTGGT TTTGCCTGCT TCATTGAGAT GGGAAAAATT CCCTTTGATG TTGCTGAAGC AGAACAGGAA TTACAGGAAG GCCCGCTGAC CGAATATTCC GGTGCCGGGC TGGCGCTGGC GAAGTTGGGG CTGGGGCTGA AACAGGTCGT GATGGCATCA CTGTTTGTGG CCCTGTTTCT GCCCTTTGGG CGCGCGCAAG AGCTTTCTCT CACCTGCCTG CTGACTTCAC TTGTCGTTAC GCTGCTCAAG GTTTTGCTGA TTTTTGTTCT GGCCTCTATC GCAGAAAACA CGCTGGCACG CGGGCGTTTT TTACTCATTC ACCATGTGAC CTGGCTTGGC TTCAGCCTTG CTGCGCTTGC CTGGGTCTTC TGGTTAACCG GTCTGTAA
|
Protein sequence | MRQTLCDGYL IIFALAQAVI LLALTPLFTG ISRQIRARMH SRRGPGIWQD YRDIHKLFKR QEVAPTSSGL MFRLMPWVLI SSMLVLAMAL PLFITVSPFA GGGDLITLIY LLALFRFFFA LSGLDTGSPF AGIGASRELT LGILVEPMLI LSLLVLALIA DSTHIEMISK TLATGWNSPL TTVLALLACG FACFIEMGKI PFDVAEAEQE LQEGPLTEYS GAGLALAKLG LGLKQVVMAS LFVALFLPFG RAQELSLTCL LTSLVVTLLK VLLIFVLASI AENTLARGRF LLIHHVTWLG FSLAALAWVF WLTGL
|
| |