Gene Pars_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1733 
Symbol 
ID5054891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1560189 
End bp1561157 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content47% 
IMG OID640469276 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001153936 
Protein GI145591934 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.856595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000426815 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCCACCTC TCTTGGAGTT ACGAAATGTG AAGATGTACT ACAACACAAC AAGGGGAACT 
GTGAAAGCCG TTGATGGGAT TTCTTTTAAG CTGGAAAAAG GAGAGGCCAT GGCCTTGGTC
GGAGAGAGCG GAAGCGGAAA AAGCTCGCTC GCTTTTACGA TAATAAGGCT GTTGCCTAGG
AACGTAGCGG AGTCAGGTGG CGAGATCTTG TTTTATGACG AAGAACTTGG AGTAGTAGAT
CTGATGAAGA TGTCTGAAAG CGAGATTAGA AGAAAGATTA GGTGGAAGAA GATATCCATG
GTGTTTCAAG CTTCTATGAA CGCGCTAAAC CCCATATTAA GAATACAAGA TCAGATGATT
GAGCCGCTTG TGCTTCACCT AGGTATGTCT AAAGAAAGCG CGGTAAAAAT CGCCGAGGAG
GCTCTCAGAT CAGTGGGCTT ATCTCGAGAT GTCCTGTCTA GATACCCCTT CGAACTATCG
GGCGGTATGA AACAGAGAGT GGTCATAGCT ATGGCAATAA TGATGAGGCC CAGGCTAGTT
ATCTTAGACG AGCCGACGTC AGCTCTGGAT GTCATTACCC AGGCTAATAT TATGAATTTG
TTAAAGGAGC TTAAGGCCAA GTTCGACTTA TCATATATCT TAATTACTCA CGACATAGCA
CTCGCCTCCG AGATAGCCGA TAAAATAGGC GTTATGTACG CAGGTAAGCT GGTGGAGGTA
GCCCCCGCAG ATCTCTTCTT TAGGTGGCCT AAACACCCGT ACTCTCAGAA ATTACTAGCC
GCAATGCCGA CGTTGAGAGA GGACAAGAAA ATTGAGCACA TACCTGGAGA TGTCCCAAGT
CTCATTAATC CTCCGCCTGG CTGCCGCTTC CACCCCAGAT GCCCCTACGC CATAAAAGGC
AAATGCGAAA AAGAAGAACC GGCAGTGAAA GACGTGGAGG GCAGTCTAGT GGCCTGCTGG
CTGTACTAG
 
Protein sequence
MPPLLELRNV KMYYNTTRGT VKAVDGISFK LEKGEAMALV GESGSGKSSL AFTIIRLLPR 
NVAESGGEIL FYDEELGVVD LMKMSESEIR RKIRWKKISM VFQASMNALN PILRIQDQMI
EPLVLHLGMS KESAVKIAEE ALRSVGLSRD VLSRYPFELS GGMKQRVVIA MAIMMRPRLV
ILDEPTSALD VITQANIMNL LKELKAKFDL SYILITHDIA LASEIADKIG VMYAGKLVEV
APADLFFRWP KHPYSQKLLA AMPTLREDKK IEHIPGDVPS LINPPPGCRF HPRCPYAIKG
KCEKEEPAVK DVEGSLVACW LY