Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3082 |
Symbol | |
ID | 5734954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3890993 |
End bp | 3892261 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280226 |
Product | NADH dehydrogenase I, D subunit |
Protein accession | YP_001545848 |
Protein GI | 159899601 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGATGC ACGCACATGC GTATCCCGAT ATTGAAATTC CAACACCTTC GCAAATTGCG GCGGTGGCAT TAAATAACGA GCCTGATGAA AACGGCGAGC AATCCATGGT GCTCAGCATG GGGCCACAAC ACCCTTCAAC GCACGGGGTG CTACGGCTAG CCGTCGAACT CAATGGCGAA AACGTGGTGC AACTGGCTCC CGACGTGGGC TTTTTGCACA CCGGCATCGA AAAAACCATC GAAACCAAAA CCTACACCAA ATCAATTGTG CTGACCGACC GCATGGATTA CCTTGCGCCG ATGTCGAACA ATATGGTGTA TGTGGCGGCG ATTGAAAAAT TGATGCAGCA AGATGTGCCA GAACGCGCCC AAACCTTGCG GGTGTTGTTG CTGGAACTGA CGCGGATCAA CTCGCACTTG GTTTGGTTGG GCACGCACGC GCTCGACTTG GCCGCGATGA GTGTGTTTTT TTATGCAATG CGTGAACGTG AGTATATTCT CGATATTTTC GAGATGCTGA CTGGCGCACG TATGATGACC AGCTATTTCC GCGTTGGTGG TTTGGCATGG GATATTCCAG CCGACTTTAT TCCAACTGTG GAAGATTTCT TGACCCACTT TTTGCCCAAG GTCGATGAAT ATGAAGATTT GTTGACCAAT AACTTGTTGT GGAAACAACG CACCAAGGGC GTAGGCGTGG TCAATGCTGC TGATGCAATT GCCCTCGGCT TGTCGGGCGC GAACCTACGG GCCAGCGGTG TCGATTGGGA TTTGCGCAAA ACCATGCCCT ACAGCGGCTA CGAAACCTAC CAATTCGATG TACCAGTCGG CCAAGCTGGC GATATTTACG ATCGCTATCG CTGTCGGGTG CAAGAAATGC GCGAAAGCGT CAAGATTGCC CGTCAAGCGA TCGAACGCGT CAAGCAAATG CATGGCCAGC CCTATGTGAC CGAAAATCGT AAGGTTGCGC CACCACCCAA GAGCGAAATT ACCTACAGCA TGGAATCGCT CATTCACCAC TTTAAGCTGT GGACGGAAGG CTTCCGCCCA CCGCGTGGTT CGGCCTATGC GGCAATCGAA TCACCACGAG GTGAAATTGG CTGTTATGTG GTGAGCGATG GCACTCCCAA ACCATGGCGT GTCCACTTCC GCGCACCGTC ATTTATTAAT TTGCAAGCCT TGCCCCACAT CGCCAAAGGC AAATTGATGG CCGACTTGGT GGCATTGATT GCGAGCATCG ATCCGGTGCT TGGTGAAGTG GATCGTTAA
|
Protein sequence | MVMHAHAYPD IEIPTPSQIA AVALNNEPDE NGEQSMVLSM GPQHPSTHGV LRLAVELNGE NVVQLAPDVG FLHTGIEKTI ETKTYTKSIV LTDRMDYLAP MSNNMVYVAA IEKLMQQDVP ERAQTLRVLL LELTRINSHL VWLGTHALDL AAMSVFFYAM REREYILDIF EMLTGARMMT SYFRVGGLAW DIPADFIPTV EDFLTHFLPK VDEYEDLLTN NLLWKQRTKG VGVVNAADAI ALGLSGANLR ASGVDWDLRK TMPYSGYETY QFDVPVGQAG DIYDRYRCRV QEMRESVKIA RQAIERVKQM HGQPYVTENR KVAPPPKSEI TYSMESLIHH FKLWTEGFRP PRGSAYAAIE SPRGEIGCYV VSDGTPKPWR VHFRAPSFIN LQALPHIAKG KLMADLVALI ASIDPVLGEV DR
|
| |