Gene Pars_2060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2060 
Symbol 
ID5056318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1841959 
End bp1843743 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content62% 
IMG OID640469609 
Producthypothetical protein 
Protein accessionYP_001154258 
Protein GI145592256 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.124935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCTGGT GGTGCGAGGA CCTCAATCTT CCCGTGTTCG AGCCGAAAAA CGTAGCCGGG 
AGATGCGAGA GGTTTGTGGA GGTCAAAATG ACACAGCCTG CCGACCCCAG GCCGGCGTTT
CCCGCCGACA TCGACATTAC GAGAGGGGCC ATCGCCGACG AGTTGGGCGA CTGGGAGCTG
GCGGAGGCGC TGGTGCCCAT GGACGAGGTA GTCTTGCTGA ACAAGATACC GGGGTACGCC
GACCAGGCAG ACGAAGTAAT CGTCAGGGGG AGGCTGGTAG GCCACAGGTT CTACGACGTG
TTTGAGGGGC GGTGGAGGTT TAGGCCGCTG TACGAAGGCG TAGCCACAAT ACTCCACGAG
AGGAGGGGCT ACTGGGCCGT GGTGGACATG GCCGAGCTCC CGCAGGGCTA CGACATACAC
CCGGACAAAA TAGTGGAGGG GAGGCTCCCA GAGGAGAGGT ACCGCCACGT GGCCGTCTCT
ACAGCCGATG GCAAGACCCA CGGCGTCGCC AAGCTCTTCA GAGGGAGAAG ACTCCACGTG
GTTAAGTCGT GGCGGGCCAA GCCGCCGTTG CTACCCGGGC GCCCCTCCAC TCTGGCCGAG
GCAGCTGAGC TCAACAGAGA ACACATCGAG CGCCGCGCCC AGGAGGCGGT GGAGTTCATA
AAAGCGGTGG CCGAGAAATA CAAGAAGCCG GTAGTGGTTT CCTACTCCGG CGGTAAGGAC
AGCCTAGTGG CGCTGGATCT AACGGCGAGG AGCGGCTTGA AGTTCTACGT CTACTTCAAC
GACACGGGGC TAGAGCCGCC GGAGACTTAC GAAAACTTAA AGGCTGTGGA GGAGAGGTAC
GGCGTCGAAG TCATTGTGGG AGCCGCTGGA CAGCGGTTCT GGGAGGCCAT GGAGAAGTTC
GGCCCCCCCG CCAGGGACTA CAGGTGGTGT TGCAAGGTGA TCAAGCTGGG GCCGACCACT
GAGGCCCTTA AGTCGAGATT CCCCCAGGGC TACATAAGCG TGGTGGGGCA GAGGGGGGCT
GAGTCCTTTG TAAGGGCAAA GACGCCGAGG GTCTCGCCTA GCAAGTGGGT GGCAGGCTCG
GTGGTGGCGG CGCCTTTGCA GGAGTGGACA GCGCTGGAGG TGTGGCTCTA CATCTTTCTG
CACAAGCTCC CCTACAACCG GGCCTACGAG AGGGGCTTCG ACAGGCTGGG TTGCGTTGTC
TGCCCCGCCA ACGAGATGGC GGAGCTGGTC CTCGTGAAGG AGGCCTACCC AGAGATCTAC
GGCAAGATGG AGGTAGCACT GAGGAGGTGG CACACCGAGG AGGAGGTCAA GTGGGGCTTG
TGGAGGTGGC GCGGCAAGAT TCCAGGCGAC GTCGCGAGGT GGGTGAAGAG GGAGGAGGGG
GCCCCGCTCC CCGTCCGCAT TACCGCCAAG GGCCAGTCGC TGGAGCTTGA AATAGACGCC
GAGCCTAACG CCGAGACGAT GAGGGAGCTT TTAAAGATGG TGGGGAGGCC CGAGGGGGAT
CTTCTACGCA CTAAGAAGGG GCTGGTGGAA ATAAGGGGGG CCGGGGGGCG TTGGTCTATC
CGGGCGCCCG ACGGGAAGAC CGCCCTAGAC GTGGCGGCTC TCGTCGTGAG GTCCGCCATC
TGCGGCGACT GCGACCTCTG CGTCCACTGG TGCCCCACTG GTGCGTTGAG GAGGATCGGC
CCCGGCCGCT CTTTTAAGGT GGATGAGGGG AGGTGCATAG GCTGTCTGTT GTGTAGCTCG
GCGTGTCCCG CCGCACAGTA CCTAGTGTAT AGAAATGAGA CCTAG
 
Protein sequence
MLWWCEDLNL PVFEPKNVAG RCERFVEVKM TQPADPRPAF PADIDITRGA IADELGDWEL 
AEALVPMDEV VLLNKIPGYA DQADEVIVRG RLVGHRFYDV FEGRWRFRPL YEGVATILHE
RRGYWAVVDM AELPQGYDIH PDKIVEGRLP EERYRHVAVS TADGKTHGVA KLFRGRRLHV
VKSWRAKPPL LPGRPSTLAE AAELNREHIE RRAQEAVEFI KAVAEKYKKP VVVSYSGGKD
SLVALDLTAR SGLKFYVYFN DTGLEPPETY ENLKAVEERY GVEVIVGAAG QRFWEAMEKF
GPPARDYRWC CKVIKLGPTT EALKSRFPQG YISVVGQRGA ESFVRAKTPR VSPSKWVAGS
VVAAPLQEWT ALEVWLYIFL HKLPYNRAYE RGFDRLGCVV CPANEMAELV LVKEAYPEIY
GKMEVALRRW HTEEEVKWGL WRWRGKIPGD VARWVKREEG APLPVRITAK GQSLELEIDA
EPNAETMREL LKMVGRPEGD LLRTKKGLVE IRGAGGRWSI RAPDGKTALD VAALVVRSAI
CGDCDLCVHW CPTGALRRIG PGRSFKVDEG RCIGCLLCSS ACPAAQYLVY RNET