Gene Pars_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2120 
Symbol 
ID5054815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1895289 
End bp1896512 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content55% 
IMG OID640469672 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001154318 
Protein GI145592316 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATATTC CAACTCATGT GAGGACTGAA CAATACGGCC TTGTTCTCAA AGAGGAACAA 
CTGGAGGGGA AAAGGCGCAT TCTCGACATT TTTTGGGGCC CGCAACACCC CTCTTCTGGA
CACACAAGAT TTATCGTAGA GGTAGATGGG GATATAGTAG TAAACGTAAC GCCAGATCCT
GGCTACGTAC ATAGGACAAT GGAGAAGCTC GGCGAGACGA GGCACTGGAT TCAAAACATA
CCCCTGTTCG AGCGGCTTTC GCTACCAGAT GCTATAAACG TGACTTGGGC CTACGCCATG
GCGGTGGAGA GGGTAGCCAA GCTCGACGTG TCGCCTAGGG CGCAGTACCT CAGGGTAATT
ATGGCCGAGC TAAGCCGCAT CAGTACACAC CTCTACGACT TGGGGCTTCA CGCCATTATG
ATCGGTAGCA GCACAGGTTT TATGTGGGGG TTCGGCCTCC GCGAGTTACT CGTCCAGCTC
TGGGCAATGG TCTCAGGCTC CCGGACGACG CCGACCTGGG TACTGCCAGG CGGCGTGCGC
ACGGCGCCGC CCGACGCCTT CTACGAGCAG ACCAAGGGGT TTTTAGACTA CCTAGAGAAA
AAGATCGACG AGTTTGTTAG GCTAGTCGTG AAAAACCCCG TCGGCTACTA CCGCCTCAAG
GACGTGGGCT ACCTCAGCAA GGAAGACGCG GCAAGATTGA TGGCCACCGG GCCCGGCGCC
AGGGGGTCCG GCATCGACTG GGACGCCAGG CGGGTTTACA AATACGGCAT CTACGACGAG
TTTGAGTGGG ACGTATGCGT AGAGGACGCC GGCGACTCCC TTGCGAGGAC TATGGTGAGG
ATCTGTGAAA TACAGCAGAG CGCCAAGATT ATTAGGCAGG CGCTTGATAG GGTGCCTAAA
GACGGCCCAC TAGTCGGCGA GGCTGTGCTC CACAGAATAC CGCCTAAACA GAGAGAAAAG
GCAAATGAGA TTATACGACT TGGCGCCCTC TACACCACAA TGCTCCCACA AGGCGAGGGG
GTAGGCGTAA CTGAGGGCGG TCGTGGGAGG TACTTCTTCC ACGTATTCGG CGATGGGACT
GAGAAGCCGT ACAGAGTTAG AATCTCAACG CCGTCTTGGC AAAACCTCAG GGCAATGATA
AGGGCCTTTA TCGGGGCAAG GCTAATGGAC CTGCCAGCAA TATACGGCTC CTTTGGCTAC
TTCCCGCCTG AACAAGACAG ATAA
 
Protein sequence
MYIPTHVRTE QYGLVLKEEQ LEGKRRILDI FWGPQHPSSG HTRFIVEVDG DIVVNVTPDP 
GYVHRTMEKL GETRHWIQNI PLFERLSLPD AINVTWAYAM AVERVAKLDV SPRAQYLRVI
MAELSRISTH LYDLGLHAIM IGSSTGFMWG FGLRELLVQL WAMVSGSRTT PTWVLPGGVR
TAPPDAFYEQ TKGFLDYLEK KIDEFVRLVV KNPVGYYRLK DVGYLSKEDA ARLMATGPGA
RGSGIDWDAR RVYKYGIYDE FEWDVCVEDA GDSLARTMVR ICEIQQSAKI IRQALDRVPK
DGPLVGEAVL HRIPPKQREK ANEIIRLGAL YTTMLPQGEG VGVTEGGRGR YFFHVFGDGT
EKPYRVRIST PSWQNLRAMI RAFIGARLMD LPAIYGSFGY FPPEQDR