Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1190 |
Symbol | |
ID | 6065804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1304245 |
End bp | 1305825 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641600606 |
Product | hydrogenase 4 subunit F |
Protein accession | YP_001724184 |
Protein GI | 170019230 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.647759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTATT CTGTGATGTT CGCTTTACTC CTGCTCACGC CGCTGCTTTT TTCGCTGCTC TGTTTTGCCT GCCGGAAACG GAGACTTTCT GCGACTCGCA CGGTGACCGT ATTACATAGC TTAGGGATCA CACTGCTGCT GATTCTGGCA CTCTGGGTGG TCCAAACTGC CGCTGATGCA GGAGAAATAT TCGCTGCGGG ACTGTGGCTT CATATTGATG GTCTGGGCGG TTTGTTCCTC GCCATTCTTG GTGTGATTGG CTTTCTCACC GGTATTTACT CGATTGGCTA CATGCGTCAT GAAGTGGCAC ACGGCGAGCT TTCACCCGTT ACGCTGTGCG ATTACTACGG TTTCTTCCAT CTGTTTTTGT TCACCATGCT GCTGGTTGTT ACCAGCAATA ACCTGATTGT GATGTGGGCG GCGATCGAAG CCACCACCTT AAGCTCGGCG TTTCTGGTAG GCATTTACGG TCAGCGTTCA TCGCTGGAAG CTGCATGGAA GTACATCATT ATTTGTACTG TTGGTGTCGC TTTTGGTCTG TTCGGTACCG TGCTGGTATA CGCCAACGCC GCCAGCGTTA TGCCGCAGGC AGAAATGGCG ATATTCTGGA GCGAGGTTCT TAAGCAATCG TCCTTGCTTG ACCCAACACT AATGCTGTTG GCCTTTGTGT TTTTGCTAAT TGGCTTTGGT ACCAAAACCG GGCTATTTCC CATGCACGCC TGGCTGCCGG ATGCTCACAG TGAAGCGCCG AGTCCGGTCA GCGCCCTGCT CTCCGCCGTA TTGCTGAACT GCGCGCTGTT GGTGCTGATT CGCTATTACA TCATTATTTG CCAAGCCATC GGCAGCGATT TCCCCAACCG GTTGTTGCTC ATCTTCGGCA TGTTGTCGGT TGCCGTGGCG GCATTTTTCA TTCTGGTACA GCGGGACATT AAGCGTCTGC TGGCGTACTC CAGCGTGGAG AACATGGGGC TGGTCGCGGT GGCGCTAGGC ATTGGCGGGC CGCTGGGAAT TTTTGCCGCG CTGCTGCACA TCTTAAACCA CAGTCTGGCA AAAACGCTGC TGTTCTGCGG TTCCGGCAAT GTACTGCTCA AGTACGGCAC GCGCGATCTC AACGTCGTCT GTGGGATGCT CAAAATCATG CCATTTACCG CCGTGCTGTT TGGCGGCGGT GCGCTGGCGC TGGCAGGGAT GCCGCCCTTC AACATTTTTC TTAGCGAATT TATGACCATT ACCGCCGGAC TGGCACGTAA TCACCTGCTG ATTATCGTCC TGCTGTTATT GCTGTTAACG CTGGTGCTGG CGGGCCTGGT ACGGATGGCT GCGCGGGTGT TAATGGCGAA ACCGCCGCAG GCCGTTAACC GGGGTGATCT CGGCTGGTTG ACCACCTCGC CAATGGTGAT TCTGCTGGTC ATGATGCTGG CGATGGGAAC GCATATTCCA CAACCTGTCA TCAGGATCCT GGCGGGCGCT TCCACTATAG TCCTCTCAGG GACGCACGAT CTGCCTGCAC AACGTAGCAC CTGGCATGAT TTTTTGCCTT CAGGCACCGC ATCTGTTTCG GAGAAACACA GTGAACGTTA A
|
Protein sequence | MSYSVMFALL LLTPLLFSLL CFACRKRRLS ATRTVTVLHS LGITLLLILA LWVVQTAADA GEIFAAGLWL HIDGLGGLFL AILGVIGFLT GIYSIGYMRH EVAHGELSPV TLCDYYGFFH LFLFTMLLVV TSNNLIVMWA AIEATTLSSA FLVGIYGQRS SLEAAWKYII ICTVGVAFGL FGTVLVYANA ASVMPQAEMA IFWSEVLKQS SLLDPTLMLL AFVFLLIGFG TKTGLFPMHA WLPDAHSEAP SPVSALLSAV LLNCALLVLI RYYIIICQAI GSDFPNRLLL IFGMLSVAVA AFFILVQRDI KRLLAYSSVE NMGLVAVALG IGGPLGIFAA LLHILNHSLA KTLLFCGSGN VLLKYGTRDL NVVCGMLKIM PFTAVLFGGG ALALAGMPPF NIFLSEFMTI TAGLARNHLL IIVLLLLLLT LVLAGLVRMA ARVLMAKPPQ AVNRGDLGWL TTSPMVILLV MMLAMGTHIP QPVIRILAGA STIVLSGTHD LPAQRSTWHD FLPSGTASVS EKHSER
|
| |