Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2634 |
Symbol | |
ID | 6147381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2692767 |
End bp | 2694482 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617505 |
Product | hydrogenase-4, G subunit |
Protein accession | YP_001744670 |
Protein GI | 170680711 |
COG category | [C] Energy production and conversion |
COG ID | [COG3261] Ni,Fe-hydrogenase III large subunit [COG3262] Ni,Fe-hydrogenase III component G |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGTTA ATTCATCGTC AAATCGTGGC GAAGCGATTC TCGCCGCCCT GAAAACGCAG TTCCCCGGCG CGGTGCTGGA TGAAGAGCGA CAAACGCCTG AACAGGTCAC CATTACGGTG AAAATCAATC TGCTGCCTGA CGTTGTGCAT TATCTTTATT ATCAACATGA TGGCTGGCTT CCAGTCCTGT TTGGCAACGA CGAGCGGACA CTTAACGGTC ATTACGCGGT TTATTATGCC CTTTCTATGG AAGGGGCCGA AAAATGCTGG ATCGTGGTGA AGGCACTGGT CGATGCCGAC AGTCGGGAGT TTCCGTCAGT CACACCGCGC GTCCCTGCCG CGGTCTGGGG CGAGCGAGAA ATTCGCGATA TGTACGGGCT GATTCCGGTT GGCCTGCCGG ATCAGCGTCG GCTGGTGTTG CCCGATGACT GGCCGGAAGA TATGCATCCG CTGCGCAAAG ACGCGATGGA TTATCGACTG CGCCCAGAAC CGACGACTGA TACCGAAACG TATCCGTTTA TCAACGAGGG CAACAGCGAT GCGCGGGTGA TCCCTGTCGG CCCGCTGCAT ATCACCTCCG ATGAACCAGG TCACTTCCGC TTGTTTGTGG ATGGCGAGCA AATTGTCGAT GCTGATTACC GCCTGTTTTA TGTCCATCGC GGCATGGAGA AACTGGCAGA AACGCGGATG GGCTACAACG AAGTGACCTT CTTATCCGAC CGCGTGTGTG GGATTTGCGG TTTTGCCCAC AGTGTGGCCT ATACCAACTC GGTTGAAAAT GCACTGGGGA TTGAGGTGCC GCAACGAGCG CATACTATTC GCTCGATTCT GCTGGAAGTC GAACGGCTGC ATAGTCATTT GCTCAACCTT GGCCTCTCCT GCCATTTTGT TGGTTTTGAT ACCGGCTTTA TGCAATTTTT CCGCGTGCGG GAAAAGTCGA TGACGATGGC GGAATTGCTG ACCGGGTCGC GTAAAACCTA CGGTCTGAAT CTGATTGGTG GTGTTCGCCG CGATATTCTC AAAGAGCAAC GTCTGCAAAC GCTGAAACTG GTGCGCGAGA TGCGCGCCGA CGTGTCGGAG CTGGTAGAGA TGCTGCTTGC CACGCCGAAT ATGGAACAAC GCACTCAGGG CATTGGCATT CTCGACCGAC AAATCGCCCG TGATTATAGC CCTGTCGGGC CGCTGATCCG CGGCAGTGGT TTTGCCCGTG ATTTGCGCTT TGATCACCCC TACGCCGACT ACGGTAATAT TCCCAAAACG CTGTTTACCT TTACCGGCGG CGATGTCTTC TCCCGCGTGA TGGTCCGTGT CAAAGAGACG TTTGATTCGC TGGCAATGCT GGAATTTGCC CTCGACAACA TGCCGGATAC CCCACTGCTG ACCGAAGGCT TTAGCTATAA ACCTCACGCA TTCGCGCTGG GCTTTGTTGA AGCGCCACGC GGTGAAGACG TGCACTGGAG CATGCTCGGT GATAACCAAA AATTGTTCCG CTGGCGCTGT CGTGCCGCCA CCTACGCCAA CTGGCCAGTG CTGCGTTACA TGCTGCGCGG CAATACCGTT TCTGACGCAC CGCTGATTAT CGGTAGCCTT GATCCCTGTT ACTCCTGTAC CGACCGTGTG ACGCTGGTTG ATGTGCGCAA GCGCCAGTCA AAAACCGTGC CGTATAAAGA GATCGAACGC TACGGCATTG ATCGTAACCG TTCGCCGCTG AAGTAA
|
Protein sequence | MNVNSSSNRG EAILAALKTQ FPGAVLDEER QTPEQVTITV KINLLPDVVH YLYYQHDGWL PVLFGNDERT LNGHYAVYYA LSMEGAEKCW IVVKALVDAD SREFPSVTPR VPAAVWGERE IRDMYGLIPV GLPDQRRLVL PDDWPEDMHP LRKDAMDYRL RPEPTTDTET YPFINEGNSD ARVIPVGPLH ITSDEPGHFR LFVDGEQIVD ADYRLFYVHR GMEKLAETRM GYNEVTFLSD RVCGICGFAH SVAYTNSVEN ALGIEVPQRA HTIRSILLEV ERLHSHLLNL GLSCHFVGFD TGFMQFFRVR EKSMTMAELL TGSRKTYGLN LIGGVRRDIL KEQRLQTLKL VREMRADVSE LVEMLLATPN MEQRTQGIGI LDRQIARDYS PVGPLIRGSG FARDLRFDHP YADYGNIPKT LFTFTGGDVF SRVMVRVKET FDSLAMLEFA LDNMPDTPLL TEGFSYKPHA FALGFVEAPR GEDVHWSMLG DNQKLFRWRC RAATYANWPV LRYMLRGNTV SDAPLIIGSL DPCYSCTDRV TLVDVRKRQS KTVPYKEIER YGIDRNRSPL K
|
| |