Gene Pars_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1923 
Symbol 
ID5055140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1727924 
End bp1730215 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content51% 
IMG OID640469469 
Producthypothetical protein 
Protein accessionYP_001154122 
Protein GI145592120 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.470327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCGT TGCTTTTGCT TATACTCATA GCCACCTGGG CGTACTCCGG CATTGTGACC 
TACTGGCAAG GCGTTTTTCT AGAACTTGGG GAGGCGTCGA TAGATGGGCG AGGCGGCGCA
GGAATCGTGT TGCCAAACGC TGTAGAGCTC TGGGCGAGGC AACTGGTTTT CTACATAAAC
GCCACGAGGG TAGCGCAGGC GTATTACAAT ATGTTCACCG GCGAGTTGCT GTATGTCAGC
TCGCCTACGA TACATATTTA TTTTACTGTA AGACTTGGAA CGGCCCCCGT TGTGGCTAGC
TGGAAGGAGC AGGGCAACGA CGTTTTCTCC ACTGGAGGCC TAGAGCGGGT AGCGAAGAGA
TACACGGCTG AGCCCGGGCT GTCTAGGGTG TACCTAGTTC AAATTAAAAT CTACGCAGGC
GCCAGGGCAG GAACGTATAC AATATACGAA AAAACACGAG GAGCTGTGCC CTGTGAAGAA
CAAGCCCCTG TACAACGGAG TACGTCTGTG GAGGTGGTTG TGTACAACAT TTCTCCTAGT
ATGATAGGCA AAGAGGCTGC GCCAGTGAAC GTTGGCGTCG TGGTGCACCG TGAGGTCATT
TACAGGAGCT ATTTCGCGTC GAGATACCCC GACTACGTCG TGGCGAGGAC TTATCCCAAC
GGCACGTTGA GAATTACCCT AGTCAATGTT TATGTGTGTA GAGACGGGGT TGGCGAGTTG
GCAAATGTAA CTCGCAGTAT CGATATTGGT GGAGATCCAT GGATATGGAT AAGCCCTAAG
TACCCCACAG CTGCCTACAT TGCCGCCGTT AGCGCCTACT TTGATCAAAT TATTAGCAAA
ATGCCCTGGA TGTTCCATCC CATTATTTAC CTCACGTCCG ACGTCATTTT CGGTGAGAAG
AACCTGATTC TTTACAACTG CGAAAATTTC GGCTATGTTT GCGGCGCCTT GGAGTTTTCC
ATTTTTATTG GCAACGAAAC ATTAGCGCTC GCCCCCGTCG CATATCCCTT GTTCGTCGGC
GTTGGCGCGA TGGGAGAGGC TCGCGGGGGC GCCTACACCG CCGTCCCTGG GAGATTTGTT
GAGGCATGTA AACAACACAG TTACAGCAGA GAGGGGTGCG CATACGGCCA GCCCGCAGAA
CCTCTGTGGG ATCTGAGGGG CAAAAACTCG ACAATATGTA TTGACGGCAA CTGCATTAAG
GCGCCCGCTG GGCTGTTATT CGCGGCGTCG CCTAGCTACG TTGTTAAAGG CGAGTTTTTA
CCAACGCCTT TTGGCTACGC AGTTGCTGGT AAGGTTGACG TGCGTAGGCT TTACTCTGTG
TCCGTCGTGC TCCCCAACGG AACGTTGACG TTTAGAGTTC CTTGGGGGGA GACGTTTGTG
TATACGCCTC CGCGGGTAAT AGAGCTCAGC AACAGGACGA GGTACGTGGG GTCTGCAGAT
GTGGAAATAA CGGTAATGCA AGACCTAGTC TTGTTTGTAA ACTACACCAG ATTATACAGC
GTAACAGTAA TAACACCGTT TGGTATAAAC GAGACTTGGG TTCCTGAAGG CGCCGTTGTC
GAGATACCCG AAGTCTCTTG GGATTTAGGC AATGGTACAA CGATTTTTCA AAAGCCTGTG
AGTATAACTG CGGATCGGCC ATATATCATA AAACCGAATT ACACAGTATA TTACCGCATT
GATGTGTATA TGCCAAATGG AACTGTTAGC TACTGGGTTA AAAGGGGAGA GGTGTTTATC
TACAACCCGC CGCCAGTTAT AGACTTTGGC AACAACACAA GGCTTCTGGG GGCAGAGCCT
TGTGAGCTGA GGGCAGACAG GCCAACTAAC TGTACTGCTA GATATACCAA ACGTCAGTTC
TATGTAGTTG TCTATATGTT AAACAAGACT TGGGAGGACT GGGTCGATGA GGGTGCCCTA
GTAGAAGACT TAAATGTAGA CACTGGAGTT GTTAACGCCG GTTTGGGGGC GAAGGCATAT
TACAAGGCCG TATACGTGGC AGAACCGCTT GTAGTAAAAA CGCCGGGGAA ACACAAGCTC
AGATACCGCG TGGAGGGCTG GGTTTCCGTG GGCGATGCAC TCGGCGTGCC TATACCCATG
GCTGAGGTGA AGCTGTGCAA TAGCATTGTG AAAACTGACT GGAGCGGTGT GGTCAATATC
ACCGCCGTTT CCGAGTCGCT GTGTCGCCCA GAGGTGCGCG CCCCGCCGTT TTCTCCATAT
GTAGTCATAG CCTTTGTTGC GTCCGCCGTA GCTGGTGTGG CGTACTTTTA CAAGAGACAG
AGAGGGAGAT AG
 
Protein sequence
MRALLLLILI ATWAYSGIVT YWQGVFLELG EASIDGRGGA GIVLPNAVEL WARQLVFYIN 
ATRVAQAYYN MFTGELLYVS SPTIHIYFTV RLGTAPVVAS WKEQGNDVFS TGGLERVAKR
YTAEPGLSRV YLVQIKIYAG ARAGTYTIYE KTRGAVPCEE QAPVQRSTSV EVVVYNISPS
MIGKEAAPVN VGVVVHREVI YRSYFASRYP DYVVARTYPN GTLRITLVNV YVCRDGVGEL
ANVTRSIDIG GDPWIWISPK YPTAAYIAAV SAYFDQIISK MPWMFHPIIY LTSDVIFGEK
NLILYNCENF GYVCGALEFS IFIGNETLAL APVAYPLFVG VGAMGEARGG AYTAVPGRFV
EACKQHSYSR EGCAYGQPAE PLWDLRGKNS TICIDGNCIK APAGLLFAAS PSYVVKGEFL
PTPFGYAVAG KVDVRRLYSV SVVLPNGTLT FRVPWGETFV YTPPRVIELS NRTRYVGSAD
VEITVMQDLV LFVNYTRLYS VTVITPFGIN ETWVPEGAVV EIPEVSWDLG NGTTIFQKPV
SITADRPYII KPNYTVYYRI DVYMPNGTVS YWVKRGEVFI YNPPPVIDFG NNTRLLGAEP
CELRADRPTN CTARYTKRQF YVVVYMLNKT WEDWVDEGAL VEDLNVDTGV VNAGLGAKAY
YKAVYVAEPL VVKTPGKHKL RYRVEGWVSV GDALGVPIPM AEVKLCNSIV KTDWSGVVNI
TAVSESLCRP EVRAPPFSPY VVIAFVASAV AGVAYFYKRQ RGR