Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3280 |
Symbol | hybC |
ID | 6145135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3356611 |
End bp | 3358314 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618110 |
Product | hydrogenase 2 large subunit |
Protein accession | YP_001745260 |
Protein GI | 170680167 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA GAATTACTAT TGATCCGGTA ACCCGTATTG AAGGGCATTT ACGCATCGAT TGCGAAATCG AAAATGGCGT CGTTTCGAAA GCATGGGCTT CCGGTACCAT GTGGCGCGGC ATGGAAGAGA TCGTGAAAAA CCGCGATCCG CGCGATGCAT GGATGATTGT GCAACGTATC TGTGGCGTAT GTACTACCAC TCACGCGCTG TCTTCCGTTC GTGCGGCAGA AAGTGCACTG AATATCGACG TTCCGGTTAA CGCGCAATAC ATCCGTAACA TCATTCTGGC TGCGCACACC ACGCATGACC ATATCGTTCA TTTCTATCAG CTCTCGGCGC TGGACTGGGT GGACATCACT TCTGCACTGC AAGCTGACCC AACCAAAGCC TCCGAAATGC TGAAAGGTGT TTCGACCTGG CACCTGAACA GCCCGGAAGA GTTCACCAAA GTTCAGAACA AGATTAAAGA TCTGGTTGCC AGCGGTCAGT TGGGGATTTT CGCCAATGGC TACTGGGGTC ACCCGGCGAT GAAACTGCCG CCAGAAGTGA ACCTGATTGC GGTAGCGCAC TACCTGCAGG CGCTGGAGTG CCAGCGTGAC GCTAACCGCG TCGTGGCGCT GCTGGGCGGT AAAACGCCAC ACATTCAGAA CCTGGCGGTA GGTGGGGTCG CTAACCCAAT CAACCTCGAC GGTCTGGGCG TGCTGAACCT TGAGCGCCTG ATGTACATCA AGTCTTTCAT CGATAAACTG AGCGACTTTG TTGAGCAGGT TTATAAGGTT GATACCGCGG TTATCGCCGC GTTCTACCCG GAATGGCTGA CGCGCGGTAA AGGTGCGGTG AACTACCTGA GCGTGCCGGA ATTCCCGACC GACAGCAAAA ACGGCAGCTT CCTGTTCCCG GGCGGCTACA TTGAGAATGC GGATCTGTCC TCGTATCGCC CGATCACTTC CCACTCCGAT GAATACCTGA TTAAAGGCAT TCAGGAGAGC GCGAAGCACT CCTGGTATAA AGACGAAGCG CCGCAGGCAC CGTGGGAAGG TACCACCATT CCGGCTTATG ATGGTTGGTC TGACGACGGT AAATATTCCT GGGTGAAATC ACCGACTTTC TACGGCAAAA CGGTAGAAGT GGGTCCGCTG GCTAACATGC TGGTGAAACT GGCGGCAGGA CGCGAGTCTA CCCAGAACAA ACTGAATGAA ATCGTTGCGA TTTATCAGAA ACTGACTGGC AACACGCTGG AAGTGGCGCA GCTGCACTCC ACGCTGGGCC GTATTATTGG TCGTACCGTT CACTGCTGTG AATTGCAGGA TATCCTGCAA AATCAATACA GTGCACTGAT CACCAATATC GGCAAAGGCG ATCACACCAC CTTTGTGAAG CCGAACATTC CGGCAACGGG TGAGTTCAAA GGCGTAGGTT TCCTCGAAGC ACCGCGCGGT ATGCTCTCTC ACTGGATGGT TATTAAAGAC GGTATCATCA GCAACTACCA GGCGGTTGTT CCATCAACCT GGAACTCTGG TCCGCGTAAC TTCAATGATG ACGTCGGTCC TTACGAGCAG TCGCTGGTGG GTACGCCGGT TGCCGATCCG AATAAACCGC TGGAAGTGGT GCGTACCATT CACTCCTTCG ACCCGTGCAT GGCCTGTGCG GTACACGTAG TGGATGCTGA CGGCAACGAA GTGGTTTCAG TGAAGGTTCT GTAA
|
Protein sequence | MSQRITIDPV TRIEGHLRID CEIENGVVSK AWASGTMWRG MEEIVKNRDP RDAWMIVQRI CGVCTTTHAL SSVRAAESAL NIDVPVNAQY IRNIILAAHT THDHIVHFYQ LSALDWVDIT SALQADPTKA SEMLKGVSTW HLNSPEEFTK VQNKIKDLVA SGQLGIFANG YWGHPAMKLP PEVNLIAVAH YLQALECQRD ANRVVALLGG KTPHIQNLAV GGVANPINLD GLGVLNLERL MYIKSFIDKL SDFVEQVYKV DTAVIAAFYP EWLTRGKGAV NYLSVPEFPT DSKNGSFLFP GGYIENADLS SYRPITSHSD EYLIKGIQES AKHSWYKDEA PQAPWEGTTI PAYDGWSDDG KYSWVKSPTF YGKTVEVGPL ANMLVKLAAG RESTQNKLNE IVAIYQKLTG NTLEVAQLHS TLGRIIGRTV HCCELQDILQ NQYSALITNI GKGDHTTFVK PNIPATGEFK GVGFLEAPRG MLSHWMVIKD GIISNYQAVV PSTWNSGPRN FNDDVGPYEQ SLVGTPVADP NKPLEVVRTI HSFDPCMACA VHVVDADGNE VVSVKVL
|
| |