Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2633 |
Symbol | |
ID | 6143090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2691197 |
End bp | 2692777 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617504 |
Product | hydrogenase 4 subunit F |
Protein accession | YP_001744669 |
Protein GI | 170683161 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTATT CTGTGATGTT CGCTTTACTC CTGCTCACGC CGCTGCTTTT TTCGCTGCTC TGTTTTGCCT GCCGGAAACG GGGGCACTCT CCGACTCGCG CGGTGACAGT ATTACATAGC TTAGGGATCA CACTGCTGCT GATTCTGGCA CTCTGGGTGG TCCAAACAGC CGCTGATGCG GGAGAAATAT TCGCTGCGGG ACTGTGGCTT CATATTGATG GTCTGGGCGG TTTGTTCCTC GCCATTCTTG GTGTGATTGG CTTTCTCACC GGTGTTTACT CGATTGGCTA CATGCGTCAT GAAGTGGAGC ACGGCGAGCT TTCACCCGTT ACGCTGTGCG ATTACTACGG TTTCTTCCAT CTGTTTTTGT TCACCATGCT GCTGGTTGTT ACCAGCAATA ACCTGATTGT GATGTGGGCG GCGATCGAAG CCACCACCTT AAGCTCGGCG TTTCTGGTAG GCATTTACGG TCAGCGTTCA TCGCTGGAAG CTGCATGGAA GTACATCATT ATTTGTACTG TTGGTGTCGC TTTTGGTCTG TTCGGTACCG TGCTGGTATA CGCCAACGCC GCCAGCGTTA TGCCGCAGGC AGAAATGGCG ATATTCTGGA GCGAGGTTCT TAAGCAATCG TCCTTGCTTG ACCCAACATT AATGCTGTTG GCCTTTGTGT TTGTGCTAAT TGGCTTTGGC ACTAAAACCG GGCTATTCCC CATGCACGCC TGGCTGCCAG ATGCTCACAG TGAAGCGCCG AGTCCGGTCA GCGCCCTGCT CTCCGCCGTA TTGCTGAACT GCGCGCTGTT GGTGCTGATT CGCTATTACA TCATTATTTG CCAGGCCATC GGCAGCGATT TCCCCAACCG GTTGTTGCTC ATCTTCGGCA TGTTGTCGGT TGCCGTGGCG GCATTTTTCA TTCTGGTACA GCGGGACATT AAGCGTCTGC TGGCGTACTC CAGCGTGGAG AACATGGGAC TGGTCGCGGT GGCGTTAGGC ATTGGCGGGC CGCTGGGAAT TTTTGCCGCG CTGCTGCACA CCTTAAACCA CAGTCTGGCA AAAACGCTGC TGTTCTGCGG TTCCGGCAAT GTACTGCTCA AGTACGGCAC GCGCGATCTC AACGTCGTCT GCGGAATGCT CAAAATCATG CCATTTACCG CCGTGCTGTT TGGCGGCGGT GCGCTGGCGC TGGCCGGGAT GCCGCCCTTC AACATTTTTC TTAGCGAATT TATGACCGTT ACCGCAGGGC TGGCGCGTAA TCATCTGCTG CTTATCGTCC TGCTGTTATT GCTGTTAACG CTGGTGCTGG CGGGCCTGGT ACGGATGGCT GCGCGGGTGT TAATGGCGAA ACCGCCGCAG GCCGTTAACC GGGGTGAACT TGGCTGGTTG ACCACCTCGC CAATGGTGAT TCTGCTGGTC ATGATGCTGG CGATGGGAAC GCATATTCCA CAACCTGTCA TCAGGATCCT GGCGGGCGCT TCCACTATAG TCCTCTCAGG GACGCACGAC CTGCCTGCAC AACGTAGCAC CTGGCATGAT TTTTTGCCTT CAGGCACCGC ATCTGTTTCG GAGAAACACA GTGAACGTTA A
|
Protein sequence | MSYSVMFALL LLTPLLFSLL CFACRKRGHS PTRAVTVLHS LGITLLLILA LWVVQTAADA GEIFAAGLWL HIDGLGGLFL AILGVIGFLT GVYSIGYMRH EVEHGELSPV TLCDYYGFFH LFLFTMLLVV TSNNLIVMWA AIEATTLSSA FLVGIYGQRS SLEAAWKYII ICTVGVAFGL FGTVLVYANA ASVMPQAEMA IFWSEVLKQS SLLDPTLMLL AFVFVLIGFG TKTGLFPMHA WLPDAHSEAP SPVSALLSAV LLNCALLVLI RYYIIICQAI GSDFPNRLLL IFGMLSVAVA AFFILVQRDI KRLLAYSSVE NMGLVAVALG IGGPLGIFAA LLHTLNHSLA KTLLFCGSGN VLLKYGTRDL NVVCGMLKIM PFTAVLFGGG ALALAGMPPF NIFLSEFMTV TAGLARNHLL LIVLLLLLLT LVLAGLVRMA ARVLMAKPPQ AVNRGELGWL TTSPMVILLV MMLAMGTHIP QPVIRILAGA STIVLSGTHD LPAQRSTWHD FLPSGTASVS EKHSER
|
| |