Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4456 |
Symbol | |
ID | 5704947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5034758 |
End bp | 5036104 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641273872 |
Product | NADH dehydrogenase subunit H |
Protein accession | YP_001539221 |
Protein GI | 159039968 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.498468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTTCC TCGCCCAGGA ACCGACGCTG GCCGATTTCG GCCGGGATCC GTGGTGGCTG ATCCTGATCA AGGTCGTCTT CGCGTTCGCG TTCGGCCTGG TGGCCACGCT GCTCGGGGTC TGGTTCGAAC GGCGGGTCGT CGGCCGGATG GCGGTACGGC CCGGCCCCAA CCAGCTCGGC CCGTTCGGCC TGCTCCAGAC GCTCGCGGAC GGCGTGAAGA TGGCCTTCAA GGAGGACATC CTCCCGCGGT CCGCGGACAA GGTCGTCTAC TTCTTCGCCC CGGTCATCTC GGTGGTCTGT GCGGTCACCG CGCTGTCGGT GATGCCGTTC GGCCCGATGG TCAGCATCTT CGGGCACCAG ACGCCGTTGC AGGTCACCGA CGTGTCGGTG GCGGTGCTGC TGGTGCTGGC CTGCTCGTCG ATGGCGGTGT ACGGCGTGGT GCTGGCCGGC TGGGCCTCCG GGTCGACCTA CCCACTGCTC GGTGGTCTGC GGTCCAGCGC GCAGCTGATC TCGTATGAGA TCGCGCTGGG GCTCTCCGTC GTGGCGGTGT TCATGCTCTC GGGCACGATG TCGACCAGCG GGATCGTCGC CGCCCAGGGG GAGCGGCCCC AGGTCGAGTT CTTCGGTCTC GACGTCTCGG CTCCCGGCTG GTACGCGATC CTGCTCTTCC CGAGCTTCGT CATCTTCTTC ATCGCCATCG TCGGCGAGAC CAACCGAGCC CCGTTCGACC TGCCCGAGGC GGAGTCCGAG CTGGTCGCCG GCTTCATGAC GGAGTACAGC TCGCTGAAGT TCGCGCTCAT CATGCTCTCC GAGTACGTCG CGATGGTGAC CATGTCGGCG TTCACCGTGA CGTTGTTCCT CGGCGGCTGG CGCGCACCCT GGCCGCTGAG CATCTGGGAC GGGGCAAACT CCGGTTGGTG GCCGATGCTG TGGTTCTTCG GCAAGGTGCT CGCCCTCGTC TTCGTCTTCG TCTGGCTGCG GGGCACCCTG CCCCGGCTGC GCTACGACCA GTTCATGCGC CTCGGCTGGA AGGTCCTGCT CCCGCTCAAC CTGCTGTGGA TCCTGGTGCT GGCCGGGTGG CTGAAGACCC AGGGCTGGGA GCGCGCCGAC CGGCTGATCG CGTACGGGGC CGTCGCCGGG GTGGTGCTGA TCGTCACGCT GATCTGGCCG AGCCGCAAGC CGGCAGCGAA GCCGACGCTG GCCGAGGAGG TCAGCAACCG GCCCTATGGC AGCTTCCCGC TGCCGCCGCT GGACCTTCAG GTACCACCGA GCCCGCGAAC CCAGCGCATC GTTGCCGAGC GGGAGCCGGC CAACCTCACC ACCGGCACGG ATTCCAGGGA GGTGTGA
|
Protein sequence | MTFLAQEPTL ADFGRDPWWL ILIKVVFAFA FGLVATLLGV WFERRVVGRM AVRPGPNQLG PFGLLQTLAD GVKMAFKEDI LPRSADKVVY FFAPVISVVC AVTALSVMPF GPMVSIFGHQ TPLQVTDVSV AVLLVLACSS MAVYGVVLAG WASGSTYPLL GGLRSSAQLI SYEIALGLSV VAVFMLSGTM STSGIVAAQG ERPQVEFFGL DVSAPGWYAI LLFPSFVIFF IAIVGETNRA PFDLPEAESE LVAGFMTEYS SLKFALIMLS EYVAMVTMSA FTVTLFLGGW RAPWPLSIWD GANSGWWPML WFFGKVLALV FVFVWLRGTL PRLRYDQFMR LGWKVLLPLN LLWILVLAGW LKTQGWERAD RLIAYGAVAG VVLIVTLIWP SRKPAAKPTL AEEVSNRPYG SFPLPPLDLQ VPPSPRTQRI VAEREPANLT TGTDSREV
|
| |