Gene EcolC_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0697 
Symbol 
ID6065323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp749428 
End bp750414 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID641600103 
Producthydrogenase 2 protein HybA 
Protein accessionYP_001723699 
Protein GI170018745 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGAC GTAATTTTAT TAAAGCAGCC TCCTGCGGGG CATTGCTGAC GGGCGCGCTG 
CCGTCTGTCA GTCATGCGGC TGCTGAAAAC CGCCCGCCAA TTCCGGGATC GCTGGGGATG
TTGTACGACT CGACCTTGTG CGTAGGCTGC CAGGCTTGCG TCACCAAGTG TCAGGATATC
AATTTCCCTG AACGTAACCC GCAAGGGGAA CAGACCTGGT CGAACAACGA CAAACTGTCG
CCGTATACCA ATAACATCAT TCAGGTGTGG ACCAGCGGCA CAGGGGTCAA CAAAGACCAG
GAGGAGAACG GCTACGCGTA CATTAAGAAA CAGTGTATGC ACTGCGTCGA TCCGAACTGT
GTCTCTGTGT GCCCGGTCTC TGCACTGAAA AAAGATCCGA AAACCGGCAT TGTCCATTAC
GACAAAGACG TGTGCACCGG TTGCCGTTAC TGCATGGTCG CCTGTCCGTA CAACGTGCCG
AAGTACGACT ACAACAACCC GTTTGGTGCG CTGCATAAGT GCGAGCTGTG CAACCAGAAA
GGTGTGGAAC GTCTCGATAA AGGCGGTCTG CCTGGCTGCG TAGAAGTGTG CCCGGCGGGC
GCGGTGATTT TCGGTACGCG TGAAGAGCTG ATGGCGGAGG CGAAAAAACG TCTGGCGCTG
AAGCCTGGCA GCGAATACCA CTATCCGCGT CAGACGCTGA AATCTGGCGA CACTTACCTG
CATACGGTGC CGCAATATTA TCCGCATCTG TACGGCGAGA AAGAGGGCGG CGGTACTCAG
GTTCTGGTAC TGACGGGTGT GCCTTATGAA AATCTCGACC TGCCGAAACT GGACGATCTT
TCTACCGGTG CGCGTTCCGA AAATATTCAA CACACCCTGT ATAAAGGCAT GATGCTACCA
CTGGCTGTGC TGGCGGGCTT AACCGTGCTG GTTCGTCGCA ACACCAAAAA CGACCATCAC
GACGGAGGAG ACGATCATGA GTCATGA
 
Protein sequence
MNRRNFIKAA SCGALLTGAL PSVSHAAAEN RPPIPGSLGM LYDSTLCVGC QACVTKCQDI 
NFPERNPQGE QTWSNNDKLS PYTNNIIQVW TSGTGVNKDQ EENGYAYIKK QCMHCVDPNC
VSVCPVSALK KDPKTGIVHY DKDVCTGCRY CMVACPYNVP KYDYNNPFGA LHKCELCNQK
GVERLDKGGL PGCVEVCPAG AVIFGTREEL MAEAKKRLAL KPGSEYHYPR QTLKSGDTYL
HTVPQYYPHL YGEKEGGGTQ VLVLTGVPYE NLDLPKLDDL STGARSENIQ HTLYKGMMLP
LAVLAGLTVL VRRNTKNDHH DGGDDHES