Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1758 |
Symbol | |
ID | 4710515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1931254 |
End bp | 1932267 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639856227 |
Product | respiratory-chain NADH dehydrogenase, subunit 1 |
Protein accession | YP_001003324 |
Protein GI | 121998537 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.482564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGAGA CACTCTTCTG GCAGTTCGCC AAGATCTGGG CCGTCCTGAT CCCGCTGTTC CTGGCCGTGG CCTATTTCAC CTACGTCGAG CGGCGGGTGA TTGGCCACAT GCAGGATCGC CGCGGGCCTA ACCGCGTCGG CCCCCGCGGG TTGCTGCAGC CCATTGCCGA TGCGCTGAAG CTCCTGTTCA AAGAGATCAC CATCCCCACC TACGCCAGCC GGACGCTGTT CCTGATCGCA CCGGCGATGG CCATCATGCC GGCGCTGGCG GCCTGGGCGG TGATCCCCTT CGATGATGGC CTGGTGGTTG CGGACATCAA TGCGGGGCTG CTCTACATCC TGGCGATGAC CTCGCTGGGC GTGTACGGAT TGATCATCGC CGGTTGGGCT TCCAACTCCA AGTACGCCCT GCTCGGTACG CTGCGGGCCT CGGCTCAGGT CGTCTCCTAT GAGATCGCGA TGGGTTTCGC GTTGGTGGGC GTGCTGATCG CTGCCGGGAC TATGAACCTG AGCGGCATCG TCCATGCCCA GGCTGGCCCT TTTTGGGAGT GGTTCTGGCT GCCGTTGTTG CCCCTGTTCC TGATCTACTG GATCTCGGGC GTGGCCGAGA CCAACCGGGC ACCGTTCGAC ATCGCCGAGG GTGAGTCGGA GATCGTCGCC GGGTTCCACG TGGAGTACTC GGGGATGGCC TTTGCGGTCT TTTTCCTGGC CGAGTACGCC AACATGCTGT TGATCTCGTT CCTGGCGGCG ACCCTGTTCC TGGGCGGTTG GCACTCACCC TTCGAGGGGC TGCCGGTCCT CGGGCCGGCC TTTGACTGGG TCCCCGGGAT CGTGTGGCTG TTCGCGAAGG CCGCTTTCTT CGCCTTCTGC TATCTGTGGT TCCGCGCTAC CTTCCCGCGC TACCGCTACG ACCAGCTCAT GCGGCTGGGC TGGAAGGTTC TGATCCCGGG CACCGTGGTG TGGCTGGTGG TGCTGACTGG CCTGGTCTAC GGCGGCGTCG GTCCTTGGTT CTGA
|
Protein sequence | MYETLFWQFA KIWAVLIPLF LAVAYFTYVE RRVIGHMQDR RGPNRVGPRG LLQPIADALK LLFKEITIPT YASRTLFLIA PAMAIMPALA AWAVIPFDDG LVVADINAGL LYILAMTSLG VYGLIIAGWA SNSKYALLGT LRASAQVVSY EIAMGFALVG VLIAAGTMNL SGIVHAQAGP FWEWFWLPLL PLFLIYWISG VAETNRAPFD IAEGESEIVA GFHVEYSGMA FAVFFLAEYA NMLLISFLAA TLFLGGWHSP FEGLPVLGPA FDWVPGIVWL FAKAAFFAFC YLWFRATFPR YRYDQLMRLG WKVLIPGTVV WLVVLTGLVY GGVGPWF
|
| |