Gene Pars_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0197 
Symbol 
ID5055204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp175951 
End bp177828 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content58% 
IMG OID640467776 
Producthypothetical protein 
Protein accessionYP_001152464 
Protein GI145590462 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.317816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC TCCTAATCAC GTTAGCACTG GCAACACTAG CTTTGGCGGC CACCACTGTG 
GTAATACCCC CAATGGCTAA GCTTGAGCAA ATTGCTTACA AAGTTGAGGA GACTACCTTG
AAGATACAGG GAGCTGGCTT CGCCACGCTT GCCAAGCCGT ACGTCACTCC TGGTGAGGGC
TACGTCTACG CCGGCATGAG GATTGAGTTC CTGGGCGCCT ACCCCTCTAT CCAGGTCGGG
GCAGACGGCC AGCTCAGCAA GACCTTCGAT CAGAACGGCT TCGTGTCGAC CGTCTACGTC
GGCCCCGACG CCTCTAAGGT GACGCTCGTC AACACGGCCA AGGAGCCCGT CGAGGTGAAG
GTGAGGATCA CATACACCTA CGTCAAGGCC TCCTACATCT CGCTGAGCGG CGATGCTGTG
GTGGAGGTAA ACGTGCCTGA CGGCAAGCTG GCCCAGGGCT TCAACGCAAT GGCGAGGCTC
ACCATAGAGC CCTATGCCCC CTTCGTGGTG AAGGCGGTGG AGAGGCCAGA CGGCACCCCG
GCCACAGTGT ACAGGGTGGA GCCCAAGGTT GTTGAGATAA ACACCCCGGG CAAGTACAAG
ATAACGATCA CCCAGGGCGC CGCCCTTCCG GCGGCGATGC TGGTGAAGAG CCTCTCTAAG
CAGACGGCAA CCGTCACAGC CGGCGGCGAG TTTGCAGTGA CCGGGGCAGA GGTGGGAGTC
CCCCAGGGCT GGAAGTTGCT GGGCTATGCG GTGTTTGCCT ACACCGCTGA CGCCAACTTA
ATAGGCAAGG AGGTCACTGG CGATATAAAG ATAGACGGCG GCTTGGTGGA CACTATCACT
GACGTGAACC AAAACATCAT CGTGAGGAGT GTCAGCTATC TGGTGCCTCC TGTCTGGAAC
TTCAATATTA GGTACAAGAT AGCGCTCGTA TACGGCGAGC AGTTCAAGGT CTCCACCACT
CTGCCCAGCA CAGTTAATGT GATTTACATC CCGCTGGTGT ACAGAGAGGC ACAAGTCAAG
TGGTTGCCCG ACCGCGCCCT GGTTAACGTC ACTGATGTTG ACGTGGCGGA CGGCCAGTGG
ACAGCTGTGG TGTTGCAGCT ACCCGAGCTG GCCAAGATAG TGTCGATACG CACTCCGGGC
AACGCGATGA TCTCCAACGC CACCGACGTC AGGCTGGTGT GGGGCGGCGG CCTTAGGGCA
GTCTCAATCT CGCCAGACGG GAGGCAGGCA TACATAATCG CCCAACTCGG CGACACGAAG
GAGACCGGCA TGTACACTTT CATGATAAAC TGGAAGCCCA TGCGGATCCC CGTCATTGAC
ACTAAGGGCA GAGCCGTGGG CGACCTCTCA GCCTCTGCTG ACAAGTTTGA CGCCTCCGCC
TCAGTTGGAT ACGTCGAGGT GAAGGTGTAC AAGCCCGAGC CCTTTGCCCT CGACATAAGC
TACAAGGGCA TCCCGGCGGC CCACGTGGAG GTGAACTCCC TGGTGGAGAA GCCACAGGCC
GTGACCCTCG GCATATACAC AGTCAAGGTT GTGGTGGTCG GGGCGTTGAA CCAGCCCATA
GCCCAAGCCT CCGTGTCACT TGAGGGCTTC CCGGCCTCTG GAAAGACGGA CGGAGCTGGG
TCGCTGGTGT TCCAAGATGT GTTGGAGGGA ACTTACAAGA TAAATGTTGA CATCGGTGGG
AGGGTCAAGG TAAGCGAGGT CATAGAGGTG AGGGGCGACA CCGAGAAGAT AGTCAAGACG
CCTGTGGTTG CGATAGTGGG CGGCGTGCCG ATAACCACTC TCGATGCCAT AGCCACGGCA
GGTGGCTTGT CCGCGGCTGG GCTATACTTC GCGTTGACGA GAAGGAAGGA GTCAGTCGCC
GAGGTAGAAC AGATATAA
 
Protein sequence
MNKLLITLAL ATLALAATTV VIPPMAKLEQ IAYKVEETTL KIQGAGFATL AKPYVTPGEG 
YVYAGMRIEF LGAYPSIQVG ADGQLSKTFD QNGFVSTVYV GPDASKVTLV NTAKEPVEVK
VRITYTYVKA SYISLSGDAV VEVNVPDGKL AQGFNAMARL TIEPYAPFVV KAVERPDGTP
ATVYRVEPKV VEINTPGKYK ITITQGAALP AAMLVKSLSK QTATVTAGGE FAVTGAEVGV
PQGWKLLGYA VFAYTADANL IGKEVTGDIK IDGGLVDTIT DVNQNIIVRS VSYLVPPVWN
FNIRYKIALV YGEQFKVSTT LPSTVNVIYI PLVYREAQVK WLPDRALVNV TDVDVADGQW
TAVVLQLPEL AKIVSIRTPG NAMISNATDV RLVWGGGLRA VSISPDGRQA YIIAQLGDTK
ETGMYTFMIN WKPMRIPVID TKGRAVGDLS ASADKFDASA SVGYVEVKVY KPEPFALDIS
YKGIPAAHVE VNSLVEKPQA VTLGIYTVKV VVVGALNQPI AQASVSLEGF PASGKTDGAG
SLVFQDVLEG TYKINVDIGG RVKVSEVIEV RGDTEKIVKT PVVAIVGGVP ITTLDAIATA
GGLSAAGLYF ALTRRKESVA EVEQI