Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2906 |
Symbol | |
ID | 5592706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2909889 |
End bp | 2911160 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640922023 |
Product | pyridine nucleotide-disulphide oxidoreductase family protein |
Protein accession | YP_001459534 |
Protein GI | 157162216 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 78 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGACG ACTGCGACAT TATTATTATT GGTGCCGGTA TTGCAGGCAC CGCTTGCGCG TTACGCTGCG CGCGAGCGGG TTTATCCGTT TTGTTACTGG AACGCGCTGA AATCCCCGGC AGCAAAAATC TTTCCGGCGG GCGGTTATAT ACCCATGCAC TCGCGGAACT CCTCCCTCAG TTTCATCTGA CTGCGCCTCT TGAACGACGC ATCACTCACG AAAGCCTTTC CCTGTTAACG CCCGATGGCG CAACGACGTT TTCCAGCTTA CAGCCCGGCG GTGAATCCTG GAGTGTATTA CGTGCACGGT TCGATCCGTG GCTGGTTGCC GAAGCCGAAA AAGAAGGTGT CGAATGCATC CCCGGAGCGA CGGTGGATGC ACTGTATGAA GAAAACGGCA GGGTGTGTGG TGTCATTTGT GGTGACGATA TTCTCCGTGC CCGTTATGTG GTGCTGGCAG AAGGTGCCAA CAGCGTCCTG GCTGAGCGTC ACGGGTTAGT GACTCGTCCT GCTGGCGAAG CGATGGCGTT GGGGATCAAA GAAGTGCTGT CGCTGGAACC ATCCGCTATT GAAGAACGTT TTCATCTGGA GAATAACGAA GGCGCAGCGT TGCTGTTCAG CGGCAGGATC TGTGATGACT TACCCGGCGG CGCATTTCTT TATACTAATC AACAAACGCT CTCGTTAGGG ATTGTTTGCC CGCTCTCTTC CCTTACGCAA AGTCGTGTTC CGGCAAGCGA GCTGCTGACT CGCTTTAAAG CGCATCCGGC AGTGCGCCCG CTTATCAAAA ACACGGAATC ACTGGAGTAT GGTGCGCATC TGGTGCCAGA AGGTGGCTTG CACAGTATGC CGGTGCAATA CGCCGGTAAC GGCTGGCTGC TGGTGGGCGA TGCGTTGCGC AGTTGCGTCA ATACCGGAAT TTCCGTGCGC GGCATGGATA TGGCGCTGAC TGGCGCGCAG GCGGCGGCAC AAACGCTGAT AAGCGCCTGC CAGCACCGCG AGCCGCAAAA TCTGTTTGCG CTTTATCATC ACAACGTCGA GCGCAGCCTG CTGTGGGATA TTCTACAGCG TTATCAGCAT GTTCCGGCGC TTTTGCAACG CCCTGGCTGG TATCGGGCGT GGCCTGCGTT AATGCAGGAT ATTTCCCGCG ATTTATGGGA TCAGGGTGAT AAACCTGTTC CACCGCTGCG CCAGTTATTC TGGCGTCATT TACGTCGTCA TGGCCTGTGG CATCTGGCGG GCGATGTTAT CAGGAGTGTT CGATGTCTGT AG
|
Protein sequence | MEDDCDIIII GAGIAGTACA LRCARAGLSV LLLERAEIPG SKNLSGGRLY THALAELLPQ FHLTAPLERR ITHESLSLLT PDGATTFSSL QPGGESWSVL RARFDPWLVA EAEKEGVECI PGATVDALYE ENGRVCGVIC GDDILRARYV VLAEGANSVL AERHGLVTRP AGEAMALGIK EVLSLEPSAI EERFHLENNE GAALLFSGRI CDDLPGGAFL YTNQQTLSLG IVCPLSSLTQ SRVPASELLT RFKAHPAVRP LIKNTESLEY GAHLVPEGGL HSMPVQYAGN GWLLVGDALR SCVNTGISVR GMDMALTGAQ AAAQTLISAC QHREPQNLFA LYHHNVERSL LWDILQRYQH VPALLQRPGW YRAWPALMQD ISRDLWDQGD KPVPPLRQLF WRHLRRHGLW HLAGDVIRSV RCL
|
| |