Gene EcSMS35_2848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2848 
SymbolhycC 
ID6145730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2923307 
End bp2925133 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content58% 
IMG OID641617717 
Productformate hydrogenlyase subunit 3 
Protein accessionYP_001744872 
Protein GI170679710 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.264273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA TTTCCCTGAT CAATAGCGGC GTGGCGTGGT TTGTCGCCGC CGCTGTTCTG 
GCATTTCTCT TTTCTTTTCA AAAGGCGCTA AGCGGCTGGA TAGCCGGAAT TGGCGGCGCG
GTTGGTAGCC TGTATACGGC AGCCGCGGGC TTTACTGTGC TGACTGGCGC GGTTGGCGTG
AGCGGTGCGC TGTCGCTGGT AAGCTACGAT GTGCAAATCT CTCCGCTTAA CGCGATTTGG
CTGATTACGC TCGGTCTGTG CGGTCTGTTT GTCAGCCTCT ACAACATTGA CTGGCATCGC
CACGCGCAGG TGAAGTGCAA CGGCTTGCAG ATCAATATGT TGATGGCTGC CGCCGTCTGC
GCCGTCATTG CCAGCAACCT CGGCATGTTC GTGGTAATGG CCGAAGTCAT GGCCTTGTGT
GCGGTGTTCC TCACCAGCAA CAGCAAAGAG GGCAAACTGT GGTTTGCGCT GGGGCGTCTT
GGCACTCTGC TGCTGGCGAT TGCTTGCTGG CTGCTGTGGC AGCGTTACGG CACGCTGGAT
CTGCGCCTGC TGGATATGCG TATGCAACAG TTGCCGCTCG GTTCCGATAT CTGGCTGCTC
GGTGTGATTG GCTTTGGCCT GCTGGCCGGG ATTATTCCGC TGCACGGCTG GGTGCCGCAG
GCACATGCTA ACGCCTCTGC GCCAGCTGCC GCGTTGTTTT CCACGGTAGT CATGAAAATT
GGCCTGCTGG GCATTTTAAC CCTGTCACTG CTGGGCGGTA ATGCACCGCT GTGGTGGGGG
ATCGCACTGC TGGTACTCGG CATGATCACC GCATTCGTCG GCGGTCTGTA TGCGTTGATG
GAACATAATA TCCAACGTCT GCTGGCTTAC CACACCCTGG AGAATATCGG CATTATCCTG
CTGGGGCTGG GCGCTGGCGT AACGGGTATC GCGCTCGAAC AACCGGCGCT GATTGCCCTT
GGTCTGGTCG GTGGTCTGTA CCATCTGCTT AACCATAGCC TGTTCAAAAG CGTGCTTTTC
CTCGGGGCGG GGAGCGTCTG GTTCCGTACC GGTCATCGCG ATATCGAAAA ACTCGGTGGT
ATTGGCAAGA AAATGCCGGT TATCTCCATC GCCATGTTAG TCGGGCTGAT GGCAATGGCT
GCGCTGCCGC CACTGAACGG GTTTGCCGGG GAATGGGTTA TCTATCAATC CTTCTTCAAA
CTGAGCAATA GTGGCGCGTT TGTTGCCCGT CTTCTCGGGC CGCTGCTTGC CGTGGGGCTG
GCAATTACCG GTGCGCTGGC GGTGATGTGT ATGGCGAAAG TTTATGGCGT TACTTTCCTC
GGCGCGCCGC GTACCAAAGA GGCCGAAAAC GCCACCTGTG CGCCGCTCCT GATGAGCGTA
AGCGTAGTGG CACTGGCGAT TTGCTGCGTA ATTGGCGGTG TTGCTGCGCC GTGGCTACTG
CCGATGCTCT CTGCTGCTGT ACCTCTGCCG CTGGAGCCTG CTAACACCAC CGTTTCTCAA
CCCATGATCA CGTTGCTGCT GATTGCCTGC CCGCTGCTGC CATTCATCAT TATGGCGATT
TGCAAAGGCG ATCGTTTGCC GTCGCGTTCC CGCGGCGCGG CCTGGGTGTG CGGCTACGAT
CATGAAAAAT CAATGGTGAT TACCGCTCAC GGTTTTGCCA TGCCGGTGAA ACAGGCGTTT
GCGCCGGTGC TGAAACTACG CAAATGGCTG AATCCGGTGT CGCTGGTGCC GGGCTGGCAG
TGCGAGGGGA GTGCGTTGCT GTTCCGCCGG ATGGCGCTGG TTGAACTGGC GGTGCTGGTG
GTGATTATTG TTTCACGAGG AGCCTGA
 
Protein sequence
MSAISLINSG VAWFVAAAVL AFLFSFQKAL SGWIAGIGGA VGSLYTAAAG FTVLTGAVGV 
SGALSLVSYD VQISPLNAIW LITLGLCGLF VSLYNIDWHR HAQVKCNGLQ INMLMAAAVC
AVIASNLGMF VVMAEVMALC AVFLTSNSKE GKLWFALGRL GTLLLAIACW LLWQRYGTLD
LRLLDMRMQQ LPLGSDIWLL GVIGFGLLAG IIPLHGWVPQ AHANASAPAA ALFSTVVMKI
GLLGILTLSL LGGNAPLWWG IALLVLGMIT AFVGGLYALM EHNIQRLLAY HTLENIGIIL
LGLGAGVTGI ALEQPALIAL GLVGGLYHLL NHSLFKSVLF LGAGSVWFRT GHRDIEKLGG
IGKKMPVISI AMLVGLMAMA ALPPLNGFAG EWVIYQSFFK LSNSGAFVAR LLGPLLAVGL
AITGALAVMC MAKVYGVTFL GAPRTKEAEN ATCAPLLMSV SVVALAICCV IGGVAAPWLL
PMLSAAVPLP LEPANTTVSQ PMITLLLIAC PLLPFIIMAI CKGDRLPSRS RGAAWVCGYD
HEKSMVITAH GFAMPVKQAF APVLKLRKWL NPVSLVPGWQ CEGSALLFRR MALVELAVLV
VIIVSRGA