Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0786 |
Symbol | cydB1 |
ID | 5593431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 799895 |
End bp | 801034 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640919960 |
Product | cytochrome d ubiquinol oxidase, subunit II |
Protein accession | YP_001457534 |
Protein GI | 157160216 |
COG category | [C] Energy production and conversion |
COG ID | [COG1294] Cytochrome bd-type quinol oxidase, subunit 2 |
TIGRFAM ID | [TIGR00203] cytochrome d oxidase, subunit II (cydB) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000000190537 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATT ATGAAGTATT GCGTTTTATC TGGTGGCTGC TGGTTGGCGT TCTGCTGATT GGTTTTGCAG TCACTGACGG TTTCGACATG GGGGTGGGCA TGCTCACCCG TTTCCTCGGT CGTAACGACA CCGAGCGTCG AATTATGATT AACTCCATTG CACCACACTG GGACGGTAAC CAGGTTTGGC TGATCACCGC GGGCGGCGCA CTCTTTGCTG CCTGGCCGAT GGTCTATGCC GCTGCGTTCT CCGGCTTCTA TGTGGCGATG ATCCTCGTGC TGGCGTCTTT GTTCTTCCGT CCGGTCGGTT TTGACTACCG CTCCAAGATT GAAGAAACCC GCTGGCGTAA CATGTGGGAC TGGGGCATCT TCATTGGTAG CTTCGTTCCG CCGCTGGTAA TTGGTGTAGC GTTCGGTAAC CTGTTGCAGG GCGTACCGTT CAACGTTGAT GAATATCTGC GTCTGTACTA CACTGGTAAC TTCTTCCAGT TGCTTAACCC GTTCGGCCTG CTGGCAGGCG TGGTGAGCGT AGGGATGATC ATTACTCAGG GCGCAACCTA TCTGCAAATG CGTACCGTGG GCGAACTGCA CCTGCGTACC CGTGCAACGG CTCAGGTGGC TGCGCTGGTG ACACTGGTCT GTTTCGCACT GGCTGGCGTA TGGGTGATGT ACGGTATCGA TGGTTATGTC GTGAAATCGA CAATGGACCA TTACGCAGCC TCTAACCCAC TGAATAAAGA AGTGGTTCGT GAAGCTGGCG CATGGCTGGT TAACTTCAAC AACACGCCAA TTCTGTGGGC TATTCCGGCA CTGGGTGTGG TTCTGCCGCT GCTGACCATC CTGACTGCAC GTATGGATAA AGCCGCGTGG GCGTTTGTGT TCTCCTCCCT GACGCTGGCC TGCATCATCC TGACAGCCGG TATCGCAATG TTCCCGTTTG TGATGCCGTC CAGCACCATG ATGAACGCAA GTCTGACAAT GTGGGATGCA ACTTCCAGCC AGCTGACGCT TAACGTCATG ACCTGGGTTG CGGTGGTTCT GGTACCGATC ATTCTGCTCT ACACCGCCTG GTGTTACTGG AAAATGTTCG GTCGTATCAC CAAAGAAGAT ATTGAACGTA ACACCCACTC TCTGTACTAA
|
Protein sequence | MIDYEVLRFI WWLLVGVLLI GFAVTDGFDM GVGMLTRFLG RNDTERRIMI NSIAPHWDGN QVWLITAGGA LFAAWPMVYA AAFSGFYVAM ILVLASLFFR PVGFDYRSKI EETRWRNMWD WGIFIGSFVP PLVIGVAFGN LLQGVPFNVD EYLRLYYTGN FFQLLNPFGL LAGVVSVGMI ITQGATYLQM RTVGELHLRT RATAQVAALV TLVCFALAGV WVMYGIDGYV VKSTMDHYAA SNPLNKEVVR EAGAWLVNFN NTPILWAIPA LGVVLPLLTI LTARMDKAAW AFVFSSLTLA CIILTAGIAM FPFVMPSSTM MNASLTMWDA TSSQLTLNVM TWVAVVLVPI ILLYTAWCYW KMFGRITKED IERNTHSLY
|
| |