Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2631 |
Symbol | hyfD |
ID | 6147422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2689091 |
End bp | 2690530 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617502 |
Product | hydrogenase 4 subunit D |
Protein accession | YP_001744667 |
Protein GI | 170682661 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.960654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAATC TTGCTCTGAC GACGTTATTG CTGCCTTTTA TCGGCGCACT GGTCGTTTCG TTTTCGCCAC AACGTCGGGC CGCCGAATGG GGGATTTTGT TCGCCGCGCT GACCACGCTG TGCATGTTGT CGCTGATCTC CGCGTTTTAT CAGGCCGATA AAGTTGCCGT CACGTTGACG TTGGTCAACG TGGGGGATGT GGCGTTGTTT GGCCTGGTCA TTGATCGCGT GAGTACGCTG ATTCTGTTTG TGGTGGTGTT CCTCGGTTTG CTGGTCACGA TCTACTCCAC GGGTTATCTG ACGGATAAAA ATCGCGAACA CCCGCATAAC GGCACGAATC GTTATTACGC ATTTTTGCTG GTGTTTATCG GCGCGATGGC GGGACTGGTA CTCTCCTCAA CGCTGCTCGG TCAGTTGTTG TTTTTTGAAA TTACGGGCGG CTGCTCCTGG GCGTTGATCA GTTATTACCA GAGCGATAAA GCGCAGCGTT CAGCACTAAA AGCGTTACTT ATCACTCATA TCGGTTCGCT GGGGTTGTAT CTTGCCGCCG CCACGCTGTT TTTGCAGACC GGAACGTTTG CGCTTAGCGC GATGAGCGAG TTACACGGCG ACGCACGTTA TCTGGTTTAT GGCGGCATTC TGTTTGCCGC GTGGGGGAAA TCGGCCCAGC TACCGATGCA AGCGTGGCTA CCGGATGCAA TGGAAGCGCC AACACCGATC AGCGCCTATC TCCACGCCGC ATCGATGGTG AAAGTGGGCG TTTACATTTT TGCCCGTGCC ATTATCGACG GCGGCAATAT CCCGCATGTG ATTGGCGGCG TTGGCATGGT TATGGCACTG GTCACCATTC TTTACGGCTT CCTGATGTAT TTGCCACAGC AGGATATGAA GCGGTTGCTG GCCTGGTCGA CCATCACTCA ACTTGGCTGG ATGTTTTTCG GCTTGTCGCT CTCCATCTTC GGCTCGCGGC TGTCGCTGGA GGGCAGCATC GCCTACATCG TCAACCACGC GTTCGCTAAA AGCCTGTTTT TCCTTGTAGC AGGTGCGCTG AGTTACAGCT GCGGCACGCG CTTGTTGCCG CGTCTGCGTG GCGTATTGCA CACCCTGCCG TTGCCAGGCG TGGGTTTCTG CGTAGCCGCG CTGGCGATTA CTGGCGTACC GCCGTTCAAC GGCTTCTTCA GTAAATTCCC GCTGTTTGCT GCCGGTTTTT CGTTGTCAGT GGAGTACTGG ATCCTGCTGC CCGCCATGAT TCTGCTGATG ATTGAATCGG TCGCCAGTTT CGCCTGGTTT ATTCGCTGGT TTGGTCGCGT CGTGCCTGGC AAACCGAGCG AGGCCGTCGC CGATGCCGCA CCGCTGCCAG GATCAATGCG CCTGGTGTTG ATTGTACTGA TTGTGATGTC GCTGATTTCC AGCGTAATCG CCGCGACCTG GTTGCAGTAA
|
Protein sequence | MENLALTTLL LPFIGALVVS FSPQRRAAEW GILFAALTTL CMLSLISAFY QADKVAVTLT LVNVGDVALF GLVIDRVSTL ILFVVVFLGL LVTIYSTGYL TDKNREHPHN GTNRYYAFLL VFIGAMAGLV LSSTLLGQLL FFEITGGCSW ALISYYQSDK AQRSALKALL ITHIGSLGLY LAAATLFLQT GTFALSAMSE LHGDARYLVY GGILFAAWGK SAQLPMQAWL PDAMEAPTPI SAYLHAASMV KVGVYIFARA IIDGGNIPHV IGGVGMVMAL VTILYGFLMY LPQQDMKRLL AWSTITQLGW MFFGLSLSIF GSRLSLEGSI AYIVNHAFAK SLFFLVAGAL SYSCGTRLLP RLRGVLHTLP LPGVGFCVAA LAITGVPPFN GFFSKFPLFA AGFSLSVEYW ILLPAMILLM IESVASFAWF IRWFGRVVPG KPSEAVADAA PLPGSMRLVL IVLIVMSLIS SVIAATWLQ
|
| |