Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4161 |
Symbol | |
ID | 8744789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 431945 |
End bp | 433468 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646514710 |
Product | Respiratory-chain NADH dehydrogenase domain 51 kDa subunit |
Protein accession | YP_003405657 |
Protein GI | 284167379 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.174891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATG AGATAGGAGC CGTTCGTCGA TCGCCGGTCG TTCGCGTCTC GGCAGCGGTC GCAGCGGATC GAGGTGACCG CGTTTACGCC GCCGCTCGCG ACGCCGCGGC TTCCGTGTCG GTCGTCCGAA CCGGACCCAC CGGCGTCGAC GAACTGGAAC CGCTGGTCCT CGCGACCGAC GACGGGCGGA CCGCGTTCTT TCGCTCGCCG TCCCCGTCGA CGACTCGAGA TCTCGTCACG GAATTTGAAT CAAATGGACT CCCGATGTCC CGCGCCGACG CCATCGTCGA CCACGATCCG GACGCGACGT CGCTTCCGGT TCCGGAAACT GGCCCGCTCG CTGCGGGCCG ACGGCTCGTC CTCGGTCCGT GCGGCTGGGT CAATCCGCTC GATCCGGCCG ACTACACACT GTGTTCGACC GAACGCGACG CCAGTGTGGC CGTGGATATG GGGATTCTCG CCCGTGGGCG GGGCGACGCC ATCGCCGACG AACCGGCCAT TGACGCTTGG GAACGGGCCC GCGAGACCGA CGGCGATCCA CTCATCGTCG TCAACGCGAA CGAGCCCGAC GACCGACAGC AGGCCGACCG GACGCTGCTC GCCGGGGCGC CGATCGCAGT CCTGGACGGC GTCGCGGCGG TCGCAGAGTA CGTCGGCGCC GAGGACGCGG TCGTCTATCT GAACGAACAC GAGACTACCC TCCAGCGACA CGTTCGACAG GCCGCAAACG CCATCGAAGA CGAACTGCCG GTCGTGCCGG ACGTCGTCGT CGGCCCTGAC GAGTACCGCG CCGGCGCGCC GACGGCCGCG CTCGAGGCCA TGGAGGGCGC GGACCGGATC GAACCCCGCC TGCAACCGCC GACGCCGGCC GAGTACGGCC TTTACGGCCG TCCGACGATC GTGCACACGC CGCGGACGTT CGCACAGGTC CGGCAGGCCG TTCACGCCCC GGAGTCGATC GACGCCGACG CCCCGGATCC GGGAACGCGG CTCGTGACCG TGACCGGCGA CGTCGAGACG CCGGCGATCG TCGAACTGGA CTCGAGCGCG ACGCTCGAGA CGACCCTCGA TGCCGTTTCG ATGGAGGGAT CGCTCAAGAT GGCCTGCGTC GGTGGTGTCT TCGGCGGGTT CACGACCGAT CTGGGCGTCG CCCCCACTGC ACGGTCGCTC ACCGCGGCGG ATCTCGGAAC CGACGGCGTC GTCGAACTGC TGAGCGACAG GCGCTGTGCG GTCGCGACCG CCGGCGAGCG CGCGCAGTTC GCTTCGGAGG CCAACAGCGG CCGGTGCGTC CCCGGACGGG AGGGGACGAA ACAGCTCACG GAACTGCTTC GCGACGTCTA CGCGGGCGCG TTCAGAAGCG ACGGAATTCG CGAACTGGGA CGCGTCATGG CTCGCTCGAG CAACTGCCAG ATCGGCGCCC ACGCGCCGCG GCCCGTAACC ACGGCCATGG ACGAATTCGA ATCCGAGTTT CGCGCCCACG CCAACGGGCG CTGTCCGAGC GGAACGTGTA CTGATCACTT ATGA
|
Protein sequence | MTNEIGAVRR SPVVRVSAAV AADRGDRVYA AARDAAASVS VVRTGPTGVD ELEPLVLATD DGRTAFFRSP SPSTTRDLVT EFESNGLPMS RADAIVDHDP DATSLPVPET GPLAAGRRLV LGPCGWVNPL DPADYTLCST ERDASVAVDM GILARGRGDA IADEPAIDAW ERARETDGDP LIVVNANEPD DRQQADRTLL AGAPIAVLDG VAAVAEYVGA EDAVVYLNEH ETTLQRHVRQ AANAIEDELP VVPDVVVGPD EYRAGAPTAA LEAMEGADRI EPRLQPPTPA EYGLYGRPTI VHTPRTFAQV RQAVHAPESI DADAPDPGTR LVTVTGDVET PAIVELDSSA TLETTLDAVS MEGSLKMACV GGVFGGFTTD LGVAPTARSL TAADLGTDGV VELLSDRRCA VATAGERAQF ASEANSGRCV PGREGTKQLT ELLRDVYAGA FRSDGIRELG RVMARSSNCQ IGAHAPRPVT TAMDEFESEF RAHANGRCPS GTCTDHL
|
| |