Gene EcolC_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0989 
Symbol 
ID6067766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1074821 
End bp1076647 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content58% 
IMG OID641600397 
Productformate hydrogenlyase subunit 3 
Protein accessionYP_001723985 
Protein GI170019031 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.180245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA TTTCCCTGAT CAATAGCGGC GTGGCGTGGT TTGTCGCCGC CGCTGTTCTG 
GCATTTCTCT TTTCTTTTCA AAAAGCGTTA AGTGGCTGGA TAGCTGGAAT TGGCGGCGCG
GTTGGTAGTC TGTATACGGC AGCCGCGGGC TTCACTGTAC TGACTGGCGC GGTTGGCGTG
AGCGGTGCGC TGTCGCTGGT AAGCTACGAT GTGCAAATCT CTCCGCTTAA CGCGATTTGG
CTGATTACGC TCGGTCTGTG CGGTCTGTTT GTCAGCCTCT ACAACATTGA CTGGCATCGC
CACGCGCAGG TGAAGTGCAA CGGCTTGCAG ATCAATATGT TGATGGCTGC CGCCGTCTGC
GCCGTCATTG CCAGCAACCT CGGCATGTTC GTGGTAATGG CCGAAATCAT GGCCCTGTGC
GCGGTGTTCC TCACCAGCAA CAGCAAAGAG GGCAAACTGT GGTTTGCGCT GGGGCGTCTT
GGCACTCTGC TGCTGGCGAT TGCTTGCTGG CTGCTGTGGC AGCGTTACGG CACGCTGGAT
CTGCGCCTGC TGGATATGCG TATGCAACAG CTGCCGCTCG GTTCCGATAT CTGGCTGCTC
GGAGTGATTG GCTTTGGCCT GCTGGCCGGG ATTATTCCGC TGCACGGCTG GGTGCCGCAG
GCACATGCGA ACGCCTCTGC GCCAGCTGCC GCGTTGTTTT CTACGGTAGT CATGAAAATT
GGCCTGCTGG GCATTTTAAC CCTGTCACTG CTGGGCGGTA ATGCACCGCT GTGGTGGGGG
ATCGCGCTGC TGGTGCTCGG CATGATCACC GCGTTTGTCG GTGGTCTGTA TGCGCTGATG
GAGCACAACA TCCAGCGCCT GCTGGCTTAC CACACCCTGG AAAATATCGG CATCATCCTG
CTGGGGCTGG GCGCTGGCGT AACGGGTATC GCGCTCGAAC AACCGGCGCT GATTGCTCTT
GGCCTGGTCG GTGGTCTGTA CCATCTGCTT AACCATAGCC TGTTCAAAAG CGTACTGTTC
CTCGGGGCGG GGAGCGTCTG GTTCCGTACC GGTCATCGCG ATATCGAAAA ACTCGGTGGT
ATTGGCAAGA AAATGCCGGT TATCTCCATC GCCATGTTAG TCGGGCTGAT GGCAATGGCT
GCGCTGCCGC CGCTGAATGG TTTTGCCGGG GAATGGGTTA TCTATCAATC ATTTTTCAAA
CTGAGCAATA GTGGCGCGTT TGTTGCCCGT CTGCTGGGGC CGCTGCTCGC TGTGGGGCTG
GCAATTACCG GTGCGCTGGC GGTGATGTGT ATGGCGAAAG TCTATGGCGT CACGTTCCTC
GGCGCGCCGC GTACCAAAGA AGCCGAAAAC GCCACCTGTG CGCCGCTCCT GATGAGCGTA
AGCGTAGTGG CACTGGCGAT TTGCTGCGTA ATTGGCGGTG TTGCTGCGCC GTGGCTACTG
CCGATGCTCT CTGCTGCTGT ACCTCTGCCG CTGGAGCCTG CTAACACCAC CGTTTCTCAA
CCGATGATCA CGTTGCTGCT GATTGCCTAC CCGCTGCTGC CATTCATCAT TATGGCGATT
TGCAAAGGCG ATCGTTTGCC ATCGCGTTCC CGCGGTGCGG CCTGGGTGTG CGGTTACGAC
CACGAAAAAT CAATGGTGAT TACCGCTCAC GGTTTTGCCA TGCCGGTGAA ACAGGCGTTT
GCGCCGGTGC TGAAACTACG CAAATGGCTG AATCCGGTGT CTCTGGTGCC GGGCTGGCAG
TGCGAGGGGA GTGCGTTGCT GTTCCGCCGG ATGGCGCTGG TTGAACTGGC GGTACTGGTG
GTGATTATTG TTTCACGAGG AGCCTGA
 
Protein sequence
MSAISLINSG VAWFVAAAVL AFLFSFQKAL SGWIAGIGGA VGSLYTAAAG FTVLTGAVGV 
SGALSLVSYD VQISPLNAIW LITLGLCGLF VSLYNIDWHR HAQVKCNGLQ INMLMAAAVC
AVIASNLGMF VVMAEIMALC AVFLTSNSKE GKLWFALGRL GTLLLAIACW LLWQRYGTLD
LRLLDMRMQQ LPLGSDIWLL GVIGFGLLAG IIPLHGWVPQ AHANASAPAA ALFSTVVMKI
GLLGILTLSL LGGNAPLWWG IALLVLGMIT AFVGGLYALM EHNIQRLLAY HTLENIGIIL
LGLGAGVTGI ALEQPALIAL GLVGGLYHLL NHSLFKSVLF LGAGSVWFRT GHRDIEKLGG
IGKKMPVISI AMLVGLMAMA ALPPLNGFAG EWVIYQSFFK LSNSGAFVAR LLGPLLAVGL
AITGALAVMC MAKVYGVTFL GAPRTKEAEN ATCAPLLMSV SVVALAICCV IGGVAAPWLL
PMLSAAVPLP LEPANTTVSQ PMITLLLIAY PLLPFIIMAI CKGDRLPSRS RGAAWVCGYD
HEKSMVITAH GFAMPVKQAF APVLKLRKWL NPVSLVPGWQ CEGSALLFRR MALVELAVLV
VIIVSRGA