Gene Pars_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0149 
Symbol 
ID5056072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp139264 
End bp140406 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content56% 
IMG OID640467728 
Productextracellular solute-binding protein 
Protein accessionYP_001152416 
Protein GI145590414 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.489376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACTA GGAGGACACT CCTCGTGGCG GTCGGGGCGG CTGCGGTCCT GGCAGTAGCC 
GGGGGGTATA TCTACCTGGC ACAGCAACAG GGCGGGGCGG TGCCCACTCA GACAACGCCG
CGGACAACGA CCGCGGCGGG GAGGAAGCTG GCTGTGTATA ACTACTCGTA CTACATCGAT
AAGGCGCTTC TCGACGATTT TAAAAAGGAG TACGGCATTG AGGTGATATA CCAAGAGTAT
GAAAGCGGCG AGGAGGCGTA CGCCGCGTTG TTGAGGGGAG GCGGCGGATA CGACTTAATA
GTAGTTCCCG ATACGTACAT CAAGGAGGTG ATAGGGAAGG GCTATGTGAG GAAGATCGAC
CACGCCAAGC TTTCCAACTT CGCCAACGTA GACCCTGTCT TCTTCCAGAA CCCCAACGAC
CCCGGCCTCC AGTACTCGGT GCCGTACGCC TATGGAACCA CCGGCATTGC GGTGAACTAC
TACGACATGA AAGCCGACGT TGGGAAAATA GAGAGCTGGG GCGACCTCTT TGACGAGACC
AAGCTGGAGA AGGTCAAGGG GAGGATAGCC ATGTTGGAGG AGTTCGTGGA GCCCATAATG
GCGGCGAAAT ACGCGCTGGG CATAGACCCA GACGACTGGA GCGACGACGC AGTCAACAAG
ATAGTAGACC TCCTAAAGCG GCAAAAGGAG TACATCAGGG GGTACATGGG CATTAGCCAG
ATTGTCCCCG TAATAGCCGC CGGGGAATTG TGGATATCCC AGATCTGGTC TGGCGACGCT
CAATACGCCA AGGAGGAGTT TATAAAGAGG GCGGGGGAGG CAAACGCCGA CAAGTTCCAG
TACGTATTGC CGAAGCCTAT GACCCACCGC TGGGTGGACT TCATGGTCAT CCCCCGCGAC
GCTAAAAACG TCGAGGAGGC CTACCTCTTC ATGGACTTCT TACTGAGGCC TGAGAACTCG
GCGAAGATAG TCGAGGCTAC GTACTACCCC ACCTCCTTGA AGAAGCAGCT ACTTGAGAAG
TACGTGGACC CCAACCTGCT CAACGCCATA ACCCCGCCGG AGACAGCCAA GGTCATATAC
CTAAACTACA CCGAGCAGAT GCTTAGGGCA ATTGAGAGGA TTAGCTACGC GGTTAAGGGC
TGA
 
Protein sequence
MVTRRTLLVA VGAAAVLAVA GGYIYLAQQQ GGAVPTQTTP RTTTAAGRKL AVYNYSYYID 
KALLDDFKKE YGIEVIYQEY ESGEEAYAAL LRGGGGYDLI VVPDTYIKEV IGKGYVRKID
HAKLSNFANV DPVFFQNPND PGLQYSVPYA YGTTGIAVNY YDMKADVGKI ESWGDLFDET
KLEKVKGRIA MLEEFVEPIM AAKYALGIDP DDWSDDAVNK IVDLLKRQKE YIRGYMGISQ
IVPVIAAGEL WISQIWSGDA QYAKEEFIKR AGEANADKFQ YVLPKPMTHR WVDFMVIPRD
AKNVEEAYLF MDFLLRPENS AKIVEATYYP TSLKKQLLEK YVDPNLLNAI TPPETAKVIY
LNYTEQMLRA IERISYAVKG