Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0901 |
Symbol | hcr |
ID | 6143830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 906081 |
End bp | 907049 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615789 |
Product | HCP oxidoreductase, NADH-dependent |
Protein accession | YP_001742981 |
Protein GI | 170683685 |
COG category | [C] Energy production and conversion |
COG ID | [COG0633] Ferredoxin [COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.843947 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGC CAACGAATCA ATGCCCGTGG CGGATGCAGG TTCATCACAT TACGCAAGAA ACGCCGGATG TGTGGACGAT TTCCCTGATT TGCCACGATT ACTACCCATA TCGCGCCGGG CAATATGCAC TGGTCAGCGT GCGTAACTCA GCGGAAACGC TGCGTGCTTA CACCATTTCC TCCACGCCAG GCGTGAGTGA ATACATCACC CTGACTGTGC GGCGGATTGA TGACGGTGTC GGCTCCCGGT GGCTGACGCG CGATGTAAAA CGCGGTGATT ATCTCTGGCT TTCGGACGCG ATGGGGGAAT TTACCTGCGA CGATAAAGCA GAAGATAAAT TCCTGTTGCT GGCGGCAGGC TGCGGCGTTA CGCCGATTAT GTCGATGCGT CGCTGGCTGG CGAAGAACCG TCCACAGGCC GATGTGCAGG TGATCTACAA CGTGCGTACG CCGCAGGATG TGATTTTCGC CGATGAGTGG CGTAACTATC CGGTAACGCT GGTGGCGGAA AATAACGTTA CCGAAGGCTT TATCGCTGGT CGTCTCACTC GCGAACTGCT GGCAGGTGTC CCTGATTTAG CCTCACGTAC CGTGATGACC TGTGGCCCTG CTCCATATAT GGATTGGGTA GAGCAGGAAG TGAAAGCGCT CGGCGTGACG CGTTTCTTTA AAGAGAAATT CTTCACCCCA GTAGCGGAAG CGGCGACCAG CGGTCTGAAA TTCACCAAAC TGCAACCGGC ACGAGAATTT TACGCCCCGG TTGGCACCAC GCTACTGGAG GCGCTGGAAA GCAATAACGT TCCGGTTGTC GCCGCCTGCC GCGCAGGTGT TTGCGGCTGC TGTAAGACGA AAGTGGTTTC CGGTGAATAT ACGGTGAGCA GCACAATGAC GCTGACCGAC GCCGAAATCG CTGAAGGTTA CGTACTGGCC TGCTCCTGCC ATCCGCAGGG GGATTTGGTT CTCGCATAA
|
Protein sequence | MTMPTNQCPW RMQVHHITQE TPDVWTISLI CHDYYPYRAG QYALVSVRNS AETLRAYTIS STPGVSEYIT LTVRRIDDGV GSRWLTRDVK RGDYLWLSDA MGEFTCDDKA EDKFLLLAAG CGVTPIMSMR RWLAKNRPQA DVQVIYNVRT PQDVIFADEW RNYPVTLVAE NNVTEGFIAG RLTRELLAGV PDLASRTVMT CGPAPYMDWV EQEVKALGVT RFFKEKFFTP VAEAATSGLK FTKLQPAREF YAPVGTTLLE ALESNNVPVV AACRAGVCGC CKTKVVSGEY TVSSTMTLTD AEIAEGYVLA CSCHPQGDLV LA
|
| |