Gene Pars_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0998 
Symbol 
ID5054336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp890317 
End bp891312 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content52% 
IMG OID640468555 
Productperiplasmic solute binding protein 
Protein accessionYP_001153230 
Protein GI145591228 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0851489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACGGCG TCGGAATTAG TCCCGCGTAT ATTAACCGTC GTTGTGCACA AAATTTAAAA 
ATGTACATAG CCATAATGCT CATGCAGTGG AGCAAGGCCC TTTTGTGGTC GGTGTTGGTC
GTAGCGTTAG CGGTGGCGGC TATAGTAACC ATCTTAGCTT CTCAGCCACA GACTCAGCCA
CCTGTTGTCT CTCAGACCCC GGTAAGGCCC AAGATTATAG TCAGCTTTCC TGCATATGAT
AAGATTTTAG CCCAAGCATT TCCAGAGGCT GAAGTCGTCC TCTTGACAAA AGGGATCTCC
GACCCCCACG AATACCAGCT AACTCCCCAA GATCTTCAAT TGCTAAGAAG CCTTACAGAT
AAGGACGTCG TGGTGGATAC TATGCACGCA TCCTTTGAGC TGAAGATAGC GGAGATGGCC
CAGAGGGGCG AGATAAAGGC AAAGGTGATA AAGACGCCCG ACTTCGAGAC GTACCTTACC
TGGGACGGGA AAGAGGTTAA GCTAACCAGC TACGGACAAG AGCAGGGAGG CGTGAACATG
CACAGCCACG GGCTCTACCC CCCAGACGTG TTGAAGCTAA TTGACGCTGT GTCCAGTGCC
TCAGGGCTTA CTCCAAACGC CACATTTGTC AATGGGCTTA GGCAGCTACA GGACAAGTAC
GCCGGGAAAC TCAGCGGCAA GGCTGTTGCA CTGACCCCCG CGGCCCAGTA CATATTATAC
TGGCTAGGCT ACAGAGACAT CGCGGTGTTT ATAAAAGAGC CCGGGGTGCC TCCTTCACAA
GAAGACGTAG CAAAGGCGCT CCAATACGCG AAAGAAGGAG CCCCAGTGCT GGCAGCTGTG
GTAAGTGGTG AGGCTCTACG TGTCGTTGAT ATGTTCAAGA ATAAGGCAGA GGAGGCAGGC
ATCAACGCGA AGGTTATAGT AGCGGACTTC TCAAAGGGCT ACCTAGAAGT TCTTAGAGAA
GTGACAGAGC AGATAGCCAG GTCACAAGGG GGTTAA
 
Protein sequence
MHGVGISPAY INRRCAQNLK MYIAIMLMQW SKALLWSVLV VALAVAAIVT ILASQPQTQP 
PVVSQTPVRP KIIVSFPAYD KILAQAFPEA EVVLLTKGIS DPHEYQLTPQ DLQLLRSLTD
KDVVVDTMHA SFELKIAEMA QRGEIKAKVI KTPDFETYLT WDGKEVKLTS YGQEQGGVNM
HSHGLYPPDV LKLIDAVSSA SGLTPNATFV NGLRQLQDKY AGKLSGKAVA LTPAAQYILY
WLGYRDIAVF IKEPGVPPSQ EDVAKALQYA KEGAPVLAAV VSGEALRVVD MFKNKAEEAG
INAKVIVADF SKGYLEVLRE VTEQIARSQG G