Gene EcolC_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1194 
Symbol 
ID6065193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1308904 
End bp1310922 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content55% 
IMG OID641600609 
Producthydrogenase 4 subunit B 
Protein accessionYP_001724187 
Protein GI170019233 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCC TGCAATTATT AACCTGGTCG CTGATCCTCT ATCTGTTTGC CAGTCTGGCT 
TCGCTGTTTT TACTCGGTCT GGACAGACTG GCTATTAAGC TTTCCGGCAT CACATCGCTG
GTGGGCGGCG TGATTGGCAT CATCAGCGGA ATTACGCAAT TACATGCTGG TGTAACTTTA
GTCGCCCATT TTGCCACCCC TTTTGACTTT GCCGATTTAA CCCTGCGAAT GGATAGCCTC
TCGGCATTTA TGGTGCTGGT TATCTCCTTG CTGGTGGTGG TTTGTTCGCT CTATTCATTG
ACTTATATGC GCGAATACGA GGGCAAAGGC GCGGCGGCGA TGGGCTTCTT TATGAATCTT
TTCATCGCAT CGATGGTTGC CCTGCTGGTG ATGGACAACG CTTTTTGGTT CATCGTGCTG
TTTGAAATGA TGTCGCTGTC TTCCTGGTTT CTGGTCATTG CCAGGCAGGA TAAAACGTCG
ATCAACGCTG GCATGCTCTA CTTTTTTATC GCCCACGCCG GATCGGTGCT GATAATGATC
GCCTTCTTGC TGATGGGGCG CGAAAGCGGC AGCCTCGATT TTGCCAGTTT CCGCACGCTT
TCACTTTCTC CGGGGCTGGC GTCGGCGGTG TTCCTGCTGG CCTTTTTCGG TTTTGGCGCG
AAAGCCGGGA TGATGCCGTT GCACAGCTGG TTGCCGCGCG CTCACCCTGC CGCACCATCG
CACGCTTCAG CGTTGATGTC TGGCGTAATG GTCAAAATCG GTATTTTCGG CATCCTGAAA
GTGGCGATGG ATCTGCTGGC GCAAACGGGT TTGCCGCTGT GGTGGGGCAT TCTGGTGATG
GCGATCGGCG CAATCTCCGC GCTCCTGGGC GTGCTGTATG CGCTGGCGGA ACAGGATATC
AAACGGCTAC TGGCCTGGAG TACCGTCGAA AACGTCGGCA TTATTTTGCT GGCGGTCGGT
GTGGCGATAG TCGGTCTGTC ACTGCACGAC CCGCTGCTCA CCGTGGTTGG ACTGCTCGGC
GCACTGTTTC ATCTGCTCAA CCATGCGCTG TTCAAAGGGC TGCTATTTCT CGGCGCGGGC
GCGATTATTT CGCGTTTGCA TACCCACGAC ATGGAAAAAA TGGGGGCACT GGCGAAACGG
ATGCCGTGGA CAGCCGCAGC ATGCCTGATT GGTTGCCTGG CGATATCAGC CCTTCCTCCG
CTGAATGGTT TTATCAGCGA ATGGTACACC TGGCAGTCGC TGTTCTCACT AAGTCGTGTG
GAAGCCGTAG CGCTACAACT TGCGGGTCCT ATTGCTATGG TAATGCTGGC AGTCACTGGT
GGGCTGGCAG TAATGTGCTT CGTAAAAATG TACGGTATTA CTTTCTGTGG TGCGCCGCGC
AGTACACACG CTGAAGAGGC ACAGGAAGTG CCAAATACGA TGATCGTCGC CATGCTACTG
CTCGCGGCAC TCTGCGTATT AATTGCGCTT AGTGCCAGTT GGCTGGCACC GAAGATAATG
CATATTGCCC ATGCGTTTAC CGATACCCCT CCCGTCACTG TCGCCAGCGG AATAGCACTT
GTACCCGGCA CGTTTCATAC ACGGGTCACT CCTTCATTAC TGTTGCTGTT ACTACTGGCG
ATGCCTTTGC TGCCTGGCCT TTACTGGCTG TGGTGTCGTT CGCGCCGCGC AGCGTTTCGT
CGCACAGGAG ATGCCTGGGC ATGCGGCTAC AGCTGGGAAA ATGCGATGGC CCCGTCAGGC
AATGGCGTGA TGCAGCCGCT GCGTGTGGTC TTTTCTGCGC TATTTCGTCT ACGACAACAG
CTCGACCCTA CGCTGAGGCT AAATAAAGGT CTTGCGCACG TCACCGCCAG GGCTCAGAGC
ACAGAACCCT TCTGGGATGA GCGGGTGATC CGCCCCATCG TGAGCGCCAC CCAACGGCTG
GCCAAAGAAA TACAGCATCT GCAAAGCGGC GACTTTCGTC TCTATTGCCT GTATGTGGTC
GCCGCACTGG TTGTGCTGCT AATCGCTATT GCCGTCTAA
 
Protein sequence
MDALQLLTWS LILYLFASLA SLFLLGLDRL AIKLSGITSL VGGVIGIISG ITQLHAGVTL 
VAHFATPFDF ADLTLRMDSL SAFMVLVISL LVVVCSLYSL TYMREYEGKG AAAMGFFMNL
FIASMVALLV MDNAFWFIVL FEMMSLSSWF LVIARQDKTS INAGMLYFFI AHAGSVLIMI
AFLLMGRESG SLDFASFRTL SLSPGLASAV FLLAFFGFGA KAGMMPLHSW LPRAHPAAPS
HASALMSGVM VKIGIFGILK VAMDLLAQTG LPLWWGILVM AIGAISALLG VLYALAEQDI
KRLLAWSTVE NVGIILLAVG VAIVGLSLHD PLLTVVGLLG ALFHLLNHAL FKGLLFLGAG
AIISRLHTHD MEKMGALAKR MPWTAAACLI GCLAISALPP LNGFISEWYT WQSLFSLSRV
EAVALQLAGP IAMVMLAVTG GLAVMCFVKM YGITFCGAPR STHAEEAQEV PNTMIVAMLL
LAALCVLIAL SASWLAPKIM HIAHAFTDTP PVTVASGIAL VPGTFHTRVT PSLLLLLLLA
MPLLPGLYWL WCRSRRAAFR RTGDAWACGY SWENAMAPSG NGVMQPLRVV FSALFRLRQQ
LDPTLRLNKG LAHVTARAQS TEPFWDERVI RPIVSATQRL AKEIQHLQSG DFRLYCLYVV
AALVVLLIAI AV