Gene EcSMS35_2633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2633 
Symbol 
ID6143090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2691197 
End bp2692777 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content55% 
IMG OID641617504 
Producthydrogenase 4 subunit F 
Protein accessionYP_001744669 
Protein GI170683161 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATT CTGTGATGTT CGCTTTACTC CTGCTCACGC CGCTGCTTTT TTCGCTGCTC 
TGTTTTGCCT GCCGGAAACG GGGGCACTCT CCGACTCGCG CGGTGACAGT ATTACATAGC
TTAGGGATCA CACTGCTGCT GATTCTGGCA CTCTGGGTGG TCCAAACAGC CGCTGATGCG
GGAGAAATAT TCGCTGCGGG ACTGTGGCTT CATATTGATG GTCTGGGCGG TTTGTTCCTC
GCCATTCTTG GTGTGATTGG CTTTCTCACC GGTGTTTACT CGATTGGCTA CATGCGTCAT
GAAGTGGAGC ACGGCGAGCT TTCACCCGTT ACGCTGTGCG ATTACTACGG TTTCTTCCAT
CTGTTTTTGT TCACCATGCT GCTGGTTGTT ACCAGCAATA ACCTGATTGT GATGTGGGCG
GCGATCGAAG CCACCACCTT AAGCTCGGCG TTTCTGGTAG GCATTTACGG TCAGCGTTCA
TCGCTGGAAG CTGCATGGAA GTACATCATT ATTTGTACTG TTGGTGTCGC TTTTGGTCTG
TTCGGTACCG TGCTGGTATA CGCCAACGCC GCCAGCGTTA TGCCGCAGGC AGAAATGGCG
ATATTCTGGA GCGAGGTTCT TAAGCAATCG TCCTTGCTTG ACCCAACATT AATGCTGTTG
GCCTTTGTGT TTGTGCTAAT TGGCTTTGGC ACTAAAACCG GGCTATTCCC CATGCACGCC
TGGCTGCCAG ATGCTCACAG TGAAGCGCCG AGTCCGGTCA GCGCCCTGCT CTCCGCCGTA
TTGCTGAACT GCGCGCTGTT GGTGCTGATT CGCTATTACA TCATTATTTG CCAGGCCATC
GGCAGCGATT TCCCCAACCG GTTGTTGCTC ATCTTCGGCA TGTTGTCGGT TGCCGTGGCG
GCATTTTTCA TTCTGGTACA GCGGGACATT AAGCGTCTGC TGGCGTACTC CAGCGTGGAG
AACATGGGAC TGGTCGCGGT GGCGTTAGGC ATTGGCGGGC CGCTGGGAAT TTTTGCCGCG
CTGCTGCACA CCTTAAACCA CAGTCTGGCA AAAACGCTGC TGTTCTGCGG TTCCGGCAAT
GTACTGCTCA AGTACGGCAC GCGCGATCTC AACGTCGTCT GCGGAATGCT CAAAATCATG
CCATTTACCG CCGTGCTGTT TGGCGGCGGT GCGCTGGCGC TGGCCGGGAT GCCGCCCTTC
AACATTTTTC TTAGCGAATT TATGACCGTT ACCGCAGGGC TGGCGCGTAA TCATCTGCTG
CTTATCGTCC TGCTGTTATT GCTGTTAACG CTGGTGCTGG CGGGCCTGGT ACGGATGGCT
GCGCGGGTGT TAATGGCGAA ACCGCCGCAG GCCGTTAACC GGGGTGAACT TGGCTGGTTG
ACCACCTCGC CAATGGTGAT TCTGCTGGTC ATGATGCTGG CGATGGGAAC GCATATTCCA
CAACCTGTCA TCAGGATCCT GGCGGGCGCT TCCACTATAG TCCTCTCAGG GACGCACGAC
CTGCCTGCAC AACGTAGCAC CTGGCATGAT TTTTTGCCTT CAGGCACCGC ATCTGTTTCG
GAGAAACACA GTGAACGTTA A
 
Protein sequence
MSYSVMFALL LLTPLLFSLL CFACRKRGHS PTRAVTVLHS LGITLLLILA LWVVQTAADA 
GEIFAAGLWL HIDGLGGLFL AILGVIGFLT GVYSIGYMRH EVEHGELSPV TLCDYYGFFH
LFLFTMLLVV TSNNLIVMWA AIEATTLSSA FLVGIYGQRS SLEAAWKYII ICTVGVAFGL
FGTVLVYANA ASVMPQAEMA IFWSEVLKQS SLLDPTLMLL AFVFVLIGFG TKTGLFPMHA
WLPDAHSEAP SPVSALLSAV LLNCALLVLI RYYIIICQAI GSDFPNRLLL IFGMLSVAVA
AFFILVQRDI KRLLAYSSVE NMGLVAVALG IGGPLGIFAA LLHTLNHSLA KTLLFCGSGN
VLLKYGTRDL NVVCGMLKIM PFTAVLFGGG ALALAGMPPF NIFLSEFMTV TAGLARNHLL
LIVLLLLLLT LVLAGLVRMA ARVLMAKPPQ AVNRGELGWL TTSPMVILLV MMLAMGTHIP
QPVIRILAGA STIVLSGTHD LPAQRSTWHD FLPSGTASVS EKHSER