Gene Pars_0962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0962 
Symbol 
ID5055467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp853388 
End bp854356 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content51% 
IMG OID640468518 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001153194 
Protein GI145591192 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000380465 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATCC TCGAAGTTAG GAATCTGACA GTATACTTCT ACACCTACGC CGGCGTGGTG 
AGAGCCGTGG AGAACGTGTC CTTTGACCTG TACAAAGGGG AGACTTTGGC CATCGTTGGT
GAGACTGGTA GCGGCAAGAG CGTGACTACT AGGGCGATAA CGAGACTCGT GTCGCCGCCA
GGAAAAATAG TATCAGGATC AGTTATTTAT AGGAGAGACG GAGAGGAGCT AGATCTTCTT
AAGCTACCAG ATGAGGAGCT GAGGAAAATA AGGGGGTCGG AGATAGCCTA CGTCTTTCAA
GACCCCTCTT CTGCTCTTGA CCCGCTGTAC ACAGTCGGCT ACCAGATATC GGAGACTGTG
GCGGCCCACA GAGGAGGTAA GATAAAGCAA TACTTGGGAG AAGCTGTGGA ATTGCTTAGA
AGGGTCCTCA TCCCCGATCC TGAGAGTAGG TCAAAGGCGT ACCCCCACCA GCTTTCTGGA
GGCATGAAGC AGAGGTCCGT AATTGCCATG GCTATTAGTA ATAGGCCGAA GATATTAATT
GCCGATGAGC CCACCACAGC CGTCGACGTC ACTGTGCAGG CCCAGTTGCT TCACTTATTT
AAGAAGCTGA AAGAGGAGAT CGGCATGTCT ATTATTTTCA TAACCCACAA TATGGGCCTC
GTCGCTGAGC ACGCAGATAG GGTTATCGTT ATGTACGGCG GAAAAATAGT CGAGGAAGGA
CCTGTAGATG AGGTATTCGA AAACCCGAGA CACCCCTATA CCCAGGGCCT TCTAAGAGCC
GTGATAAACC CCATCAAGAC TCAGGAACGG CTAGAGCCTG TGCCGGGCAC AATACCCAAC
CTCATAAATC CGCCTGCCGG TTGCCGCTTC CACCCCCGCT GCCCCTACTT CATAAAAGGA
AAATGCGACG TGGAAGAGCC GCCCCTTGTA GGCGACAGAC ACAAGGTAGC TTGCTGGTTG
TACGTATGA
 
Protein sequence
MKILEVRNLT VYFYTYAGVV RAVENVSFDL YKGETLAIVG ETGSGKSVTT RAITRLVSPP 
GKIVSGSVIY RRDGEELDLL KLPDEELRKI RGSEIAYVFQ DPSSALDPLY TVGYQISETV
AAHRGGKIKQ YLGEAVELLR RVLIPDPESR SKAYPHQLSG GMKQRSVIAM AISNRPKILI
ADEPTTAVDV TVQAQLLHLF KKLKEEIGMS IIFITHNMGL VAEHADRVIV MYGGKIVEEG
PVDEVFENPR HPYTQGLLRA VINPIKTQER LEPVPGTIPN LINPPAGCRF HPRCPYFIKG
KCDVEEPPLV GDRHKVACWL YV