Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2120 |
Symbol | |
ID | 5054815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1895289 |
End bp | 1896512 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469672 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_001154318 |
Protein GI | 145592316 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATATTC CAACTCATGT GAGGACTGAA CAATACGGCC TTGTTCTCAA AGAGGAACAA CTGGAGGGGA AAAGGCGCAT TCTCGACATT TTTTGGGGCC CGCAACACCC CTCTTCTGGA CACACAAGAT TTATCGTAGA GGTAGATGGG GATATAGTAG TAAACGTAAC GCCAGATCCT GGCTACGTAC ATAGGACAAT GGAGAAGCTC GGCGAGACGA GGCACTGGAT TCAAAACATA CCCCTGTTCG AGCGGCTTTC GCTACCAGAT GCTATAAACG TGACTTGGGC CTACGCCATG GCGGTGGAGA GGGTAGCCAA GCTCGACGTG TCGCCTAGGG CGCAGTACCT CAGGGTAATT ATGGCCGAGC TAAGCCGCAT CAGTACACAC CTCTACGACT TGGGGCTTCA CGCCATTATG ATCGGTAGCA GCACAGGTTT TATGTGGGGG TTCGGCCTCC GCGAGTTACT CGTCCAGCTC TGGGCAATGG TCTCAGGCTC CCGGACGACG CCGACCTGGG TACTGCCAGG CGGCGTGCGC ACGGCGCCGC CCGACGCCTT CTACGAGCAG ACCAAGGGGT TTTTAGACTA CCTAGAGAAA AAGATCGACG AGTTTGTTAG GCTAGTCGTG AAAAACCCCG TCGGCTACTA CCGCCTCAAG GACGTGGGCT ACCTCAGCAA GGAAGACGCG GCAAGATTGA TGGCCACCGG GCCCGGCGCC AGGGGGTCCG GCATCGACTG GGACGCCAGG CGGGTTTACA AATACGGCAT CTACGACGAG TTTGAGTGGG ACGTATGCGT AGAGGACGCC GGCGACTCCC TTGCGAGGAC TATGGTGAGG ATCTGTGAAA TACAGCAGAG CGCCAAGATT ATTAGGCAGG CGCTTGATAG GGTGCCTAAA GACGGCCCAC TAGTCGGCGA GGCTGTGCTC CACAGAATAC CGCCTAAACA GAGAGAAAAG GCAAATGAGA TTATACGACT TGGCGCCCTC TACACCACAA TGCTCCCACA AGGCGAGGGG GTAGGCGTAA CTGAGGGCGG TCGTGGGAGG TACTTCTTCC ACGTATTCGG CGATGGGACT GAGAAGCCGT ACAGAGTTAG AATCTCAACG CCGTCTTGGC AAAACCTCAG GGCAATGATA AGGGCCTTTA TCGGGGCAAG GCTAATGGAC CTGCCAGCAA TATACGGCTC CTTTGGCTAC TTCCCGCCTG AACAAGACAG ATAA
|
Protein sequence | MYIPTHVRTE QYGLVLKEEQ LEGKRRILDI FWGPQHPSSG HTRFIVEVDG DIVVNVTPDP GYVHRTMEKL GETRHWIQNI PLFERLSLPD AINVTWAYAM AVERVAKLDV SPRAQYLRVI MAELSRISTH LYDLGLHAIM IGSSTGFMWG FGLRELLVQL WAMVSGSRTT PTWVLPGGVR TAPPDAFYEQ TKGFLDYLEK KIDEFVRLVV KNPVGYYRLK DVGYLSKEDA ARLMATGPGA RGSGIDWDAR RVYKYGIYDE FEWDVCVEDA GDSLARTMVR ICEIQQSAKI IRQALDRVPK DGPLVGEAVL HRIPPKQREK ANEIIRLGAL YTTMLPQGEG VGVTEGGRGR YFFHVFGDGT EKPYRVRIST PSWQNLRAMI RAFIGARLMD LPAIYGSFGY FPPEQDR
|
| |