Gene EcE24377A_3463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3463 
SymbolhybC 
ID5586442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3474548 
End bp3476251 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content53% 
IMG OID640927090 
Producthydrogenase 2 large subunit 
Protein accessionYP_001464460 
Protein GI157157766 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA GAATTACTAT TGATCCGGTA ACCCGTATTG AGGGGCATTT ACGCATCGAT 
TGCGAAATCG AAAATGGCGT CGTTTCGAAA GCATGGGCTT CCGGTACCAT GTGGCGCGGC
ATGGAAGAGA TCGTGAAAAA CCGCGATCCG CGCGATGCAT GGATGATTGT GCAACGTATC
TGTGGCGTAT GTACTACCAC TCACGCGCTG TCTTCCGTTC GTGCGGCAGA AAGTGCGCTG
AATATCGACG TTCCGGTTAA CGCGCAATAC ATCCGTAACA TCATTCTGGC TGCGCACACC
ACGCATGACC ATATCGTTCA TTTCTATCAG CTTTCGGCGC TGGACTGGGT GGATATCACT
TCTGCACTGC AAGCTGACCC AACCAAAGCC TCCGAAATGC TGAAAGGCGT TTCGACCTGG
CATCTGAACA GCCCGGAAGA GTTCACCAAA GTTCAGAACA AGATCAAAGA TCTGGTTGCC
AGCGGTCAGT TGGGTATTTT CGCTAATGGC TACTGGGGTC ACCCAGCGAT GAAACTGCCG
CCGGAAGTGA ACCTGATTGC GGTAGCGCAC TACCTGCAGG CGCTGGAGTG CCAGCGTGAC
GCTAACCGCG TCGTGGCGCT GCTGGGCGGT AAAACGCCGC ACATTCAGAA CCTGGCAGTG
GGTGGTGTCG CGAACCCAAT CAACCTCGAC GGGTTGGGCG TGTTGAACCT TGAGCGCCTG
ATGTACATCA AGTCTTTCAT CGACAAACTG AGCGACTTTG TTGAGCAGGT TTATAAGGTT
GATACCGCGG TTATCGCCGC GTTCTACCCG GAATGGCTGA CGCGCGGTAA AGGTGCGGTG
AACTACCTGA GCGTGCCGGA ATTCCCGACC GACAGCAAAA ACGGCAGCTT CCTGTTCCCG
GGCGGCTACA TTGAGAATGC GGATCTGTCC TCGTATCGTC CGATCACTTC TCATTCCGAT
GAATACCTGA TCAAAGGCAT TCAGGAAAGC GCGAAGCACT CCTGGTATAA AGACGAAGCG
CCACAGGCGC CGTGGGAAGG TACCACCATT CCGGCTTATG ATGGTTGGTC TGACGACGGG
AAATATTCCT GGGTGAAATC ACCGACTTTC TACGGCAAAA CGGTAGAAGT GGGTCCGCTG
GCTAATATGC TGGTGAAACT GGCGGCAGGT CGAGAATCTA CCCAGAACAA ACTGAATGAA
ATCGTTGCGA TTTATCAGAA ACTGACTGGT AACACGCTGG AAGTGGCGCA ACTGCACTCT
ACGCTGGGCC GTATTATTGG TCGTACCGTT CACTGCTGCG AATTGCAGGA TATCCTGCAA
AACCAATACA GTGCACTGAT CACCAATATC GGCAAAGGCG ATCACACCAC CTTCGTGAAA
CCGAACATTC CGGCAACGGG TGAGTTCAAA GGTGTTGGCT TCCTCGAAGC GCCGCGCGGT
ATGCTCTCTC ACTGGATGGT GATCAAAGAC GGTATCATCA GCAACTACCA GGCAGTTGTT
CCATCAACCT GGAACTCTGG TCCGCGTAAC TTCAATGATG ACGTCGGTCC TTACGAGCAG
TCGCTGGTGG GTACACCGGT TGCCGATCCG AATAAACCGC TGGAAGTGGT GCGTACCATT
CACTCCTTCG ACCCGTGCAT GGCCTGTGCG GTACACGTAG TGGATGCTGA CGGCAACGAA
GTGGTTTCAG TGAAGGTTCT GTAA
 
Protein sequence
MSQRITIDPV TRIEGHLRID CEIENGVVSK AWASGTMWRG MEEIVKNRDP RDAWMIVQRI 
CGVCTTTHAL SSVRAAESAL NIDVPVNAQY IRNIILAAHT THDHIVHFYQ LSALDWVDIT
SALQADPTKA SEMLKGVSTW HLNSPEEFTK VQNKIKDLVA SGQLGIFANG YWGHPAMKLP
PEVNLIAVAH YLQALECQRD ANRVVALLGG KTPHIQNLAV GGVANPINLD GLGVLNLERL
MYIKSFIDKL SDFVEQVYKV DTAVIAAFYP EWLTRGKGAV NYLSVPEFPT DSKNGSFLFP
GGYIENADLS SYRPITSHSD EYLIKGIQES AKHSWYKDEA PQAPWEGTTI PAYDGWSDDG
KYSWVKSPTF YGKTVEVGPL ANMLVKLAAG RESTQNKLNE IVAIYQKLTG NTLEVAQLHS
TLGRIIGRTV HCCELQDILQ NQYSALITNI GKGDHTTFVK PNIPATGEFK GVGFLEAPRG
MLSHWMVIKD GIISNYQAVV PSTWNSGPRN FNDDVGPYEQ SLVGTPVADP NKPLEVVRTI
HSFDPCMACA VHVVDADGNE VVSVKVL