Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0699 |
Symbol | |
ID | 6065733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 751579 |
End bp | 753282 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641600105 |
Product | hydrogenase 2 large subunit |
Protein accession | YP_001723701 |
Protein GI | 170018747 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA GAATTACTAT TGATCCGGTA ACCCGTATTG AGGGGCATTT ACGCATCGAT TGCGAAATCG AAAATGGCGT CGTTTCGAAA GCATGGGCTT CCGGTACCAT GTGGCGCGGC ATGGAAGAGA TCGTGAAAAA CCGCGATCCG CGCGATGCAT GGATGATTGT GCAACGTATC TGTGGCGTAT GTACTACCAC TCACGCGCTG TCTTCCGTTC GTGCGGCAGA AAGTGCGCTG AATATCGACG TTCCGGTTAA CGCGCAATAC ATCCGTAACA TCATTCTGGC TGCGCACACC ACGCATGACC ATATCGTTCA TTTCTATCAG CTTTCGGCGC TGGACTGGGT GGACATCACT TCTGCACTGC AAGCTGACCC AACCAAAGCC TCTGAAATGC TGAAAGGCGT TTCGACCTGG CACCTGAACA GCCCGGAAGA GTTCACCAAA GTTCAGAACA AGATCAAAGA TCTGGTTGCC AGCGGTCAGT TGGGGATTTT CGCCAACGGC TACTGGGGTC ACCCGGCAAT GAAACTGCCG CCGGAAGTGA ACCTGATTGC GGTAGCGCAC TACCTGCAGG CGCTGGAGTG CCAGCGTGAC GCTAACCGCG TCGTGGCGCT GCTGGGCGGT AAAACGCCGC ACATTCAGAA CCTGGCGGTA GGCGGGGTCG CTAACCCAAT CAACCTCGAC GGTCTGGGCG TGCTGAACCT TGAGCGCCTG ATGTACATCA AGTCTTTCAT CGACAAGCTG AGCGACTTTG TTGAACAGGT TTACAAGGTC GATACCGCGG TTATCGCTGC GTTCTACCCG GAATGGCTGG AGCGTGGTAA AGGTGCGGTG AACTACCTGA GCGTGCCGGA ATTCCCGACC GACAGTAAAA ACGGCAGCTT CCTGTTCCCT GGCGGCTACA TTGAGAATGC GGATCTGTCC TCGTATCGTC CGATCACTTC TCATTCCGAT GAATACCTGA TCAAAGGCAT TCAGGAAAGC GCGAAGCACT CCTGGTATAA AGACGAAGCG CCGCAGGCAC CGTGGGAAGG TACCACCATT CCGGCTTATG ATGGTTGGTC TGACGACGGG AAATATTCCT GGGTGAAATC ACCGACTTTC TACGGCAAAA CGGTAGAAGT GGGTCCGCTG GCTAATATGC TGGTGAAACT GGCGGCAGGT CGCGAATCTA CCCAGAACAA ACTGAATGAA ATCGTTGCGA TTTATCAGAA ACTGACTGGT AACACGCTGG AAGTGGCGCA ACTGCACTCT ACGCTGGGCC GTATTATTGG TCGTACCGTT CACTGCTGCG AATTGCAGGA TATCCTGCAA AACCAATACA GTGCACTGAT CACCAATATC GGCAAAGGCG ATCACACCAC CTTCGTGAAA CCGAACATTC CGGCAACGGG TGAATTCAAA GGTGTTGGCT TCCTCGAAGC GCCGCGCGGT ATGCTCTCTC ACTGGATGGT TATTAAAGAC GGTATCATCA GCAACTACCA GGCGGTTGTT CCATCAACCT GGAACTCTGG TCCGCGTAAC TTCAATGATG ACGTCGGTCC TTACGAGCAG TCGCTGGTGG GTACACCGGT TGCCGATCCG AATAAACCGC TGGAAGTGGT GCGTACCATT CACTCCTTTG ACCCGTGCAT GGCCTGTGCG GTACACGTAG TGGATGCTGA CGGCAACGAA GTGGTTTCAG TGAAGGTTCT GTAA
|
Protein sequence | MSQRITIDPV TRIEGHLRID CEIENGVVSK AWASGTMWRG MEEIVKNRDP RDAWMIVQRI CGVCTTTHAL SSVRAAESAL NIDVPVNAQY IRNIILAAHT THDHIVHFYQ LSALDWVDIT SALQADPTKA SEMLKGVSTW HLNSPEEFTK VQNKIKDLVA SGQLGIFANG YWGHPAMKLP PEVNLIAVAH YLQALECQRD ANRVVALLGG KTPHIQNLAV GGVANPINLD GLGVLNLERL MYIKSFIDKL SDFVEQVYKV DTAVIAAFYP EWLERGKGAV NYLSVPEFPT DSKNGSFLFP GGYIENADLS SYRPITSHSD EYLIKGIQES AKHSWYKDEA PQAPWEGTTI PAYDGWSDDG KYSWVKSPTF YGKTVEVGPL ANMLVKLAAG RESTQNKLNE IVAIYQKLTG NTLEVAQLHS TLGRIIGRTV HCCELQDILQ NQYSALITNI GKGDHTTFVK PNIPATGEFK GVGFLEAPRG MLSHWMVIKD GIISNYQAVV PSTWNSGPRN FNDDVGPYEQ SLVGTPVADP NKPLEVVRTI HSFDPCMACA VHVVDADGNE VVSVKVL
|
| |