Gene Pars_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1494 
Symbol 
ID5055386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1351403 
End bp1352983 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content53% 
IMG OID640469036 
Productextracellular solute-binding protein 
Protein accessionYP_001153702 
Protein GI145591700 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.69615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGTAA TACTAGCAGT GTTGGCTGCT ATTTTGTTCA CCTCTCAGCC GCCTCCGCAG 
ACGCCTACCC CAACCCCACC TGCCACATCT TCACCAACCA CGCCTCAAAC CACCACGACC
CCCGCTGAGA TTACTCTCAC CATAGGCGTT ACAGATAAGG TGACCGACTT AGACCCGGCG
AATGCCTACG ACTTCTTCAC GTGGGAGGTC TTGTACAACA CAATGGCAGG CCTCGTACGG
TATAAACCGG GGACTACGGA GATCGAGCCC GACCTTGCGG TGAGTTGGAC CACGTCGGAG
GGGGGCAGGG TTTGGACATT TAAGCTGAGA CCTGGCTTGA AATTCTGCGA CGGCACCCCG
CTTACGGCGC AAGACGTCAA GAGATCAATT GAACGCGCCA TGAAGATAAA CGGCGACCCC
GCGTGGTTAG TGACCGATTT TGTTGAGAAG GTCGAAGCGC CAGACGACGC CACTGTTGTT
TTCTACCTTA AAAAGCCCGT GTCGTATTTC TTAGCCCTCG TAGCGACCCC GCCATATTTC
CCAGTCCATC CAAAATACGC ACCTGACAAG GTAGACTCTG ACCAGACAGC CGGTGGGGCG
GGTCCCTACT GTATAAAGAA TTTTGTCAGA GACCAGCAGA TCGTGCTTGA GGCTAACCCC
TACTACTACG GAGGCAAGCC CCAAGTCTCC AAGGTGGTGA TTAGGTTTTA CAAAGACGCC
ACGACGCTGA GACTCGCCCT AGAGAGGGGC GAAATAGATC TGGCTTGGAG AACGCTTAAT
CCGCCCGACT TGGAAGCCCT AAAAGCCTCC GGCAAGTACA AAGTTGTCGA AGTTCCCGGC
TCTTTCATTA GATACATCGT CCTCAACCTG AACATGCCAG AGCTAAAAGA CGTGAACGTC
AGGCGCGCCC TTGCCGCGGC GGTTTGTAGA AAAGATATCG CAACCGTGGT TTTCCACGGA
ACTGTAACGC CGCTGTTTAC GCTCGTGCCT GAGGGAATGT GGTCTTCCTA CCCCGCTTTT
AAGGAGAAAT ACGGCGACTG CAACACCGAC CTTGCTAAAC AACTCCTGCA ACAAGCCGGC
TTCAGCCCCA GCAAGAAGCT CAATATCGAG CTGTGGTACA CGCCTACGCA TTACGGCGAC
ACCGAGAAGG ACCTAGCTGC CATGTTGAAA GATCAGTGGG AGGCCACAGG CATAATCTCG
GTTAGCGTAA AGTCTGCGGA GTGGGCCACA TACGTACAGC AGCTCAGAAG CGGGGCGATG
ATGGTGTCGT TGCTAGGCTG GTACCCCGAC TACATAGACC CAGACGACTA CACCACGCCG
TTTTTAAGAA GCGGGTCTAA TAAATGGCTT GGCAACGGGT ACAGCAATCC AACAATGGAC
GATATTTTAG ACAAAGCCGC CCTCGAGCTT GACCAGACTA AGAGGGCTCA GCTGTACAAA
GAAGCTCAGC TACTCTTAGC CGACGACGTG CCTATAATCC CGCTGATACA AGGCAAGCTG
TTTATAGTCA CAAAGCCGAA TATACAAGTG GTAGTAGACC CCACGATGAT ACTTAGATAC
TGGGCCATAA GGGTTTCTTA A
 
Protein sequence
MVVILAVLAA ILFTSQPPPQ TPTPTPPATS SPTTPQTTTT PAEITLTIGV TDKVTDLDPA 
NAYDFFTWEV LYNTMAGLVR YKPGTTEIEP DLAVSWTTSE GGRVWTFKLR PGLKFCDGTP
LTAQDVKRSI ERAMKINGDP AWLVTDFVEK VEAPDDATVV FYLKKPVSYF LALVATPPYF
PVHPKYAPDK VDSDQTAGGA GPYCIKNFVR DQQIVLEANP YYYGGKPQVS KVVIRFYKDA
TTLRLALERG EIDLAWRTLN PPDLEALKAS GKYKVVEVPG SFIRYIVLNL NMPELKDVNV
RRALAAAVCR KDIATVVFHG TVTPLFTLVP EGMWSSYPAF KEKYGDCNTD LAKQLLQQAG
FSPSKKLNIE LWYTPTHYGD TEKDLAAMLK DQWEATGIIS VSVKSAEWAT YVQQLRSGAM
MVSLLGWYPD YIDPDDYTTP FLRSGSNKWL GNGYSNPTMD DILDKAALEL DQTKRAQLYK
EAQLLLADDV PIIPLIQGKL FIVTKPNIQV VVDPTMILRY WAIRVS