Gene Pars_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1809 
Symbol 
ID5056415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1623483 
End bp1625219 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content52% 
IMG OID640469355 
Productradical SAM domain-containing protein 
Protein accessionYP_001154012 
Protein GI145592010 
COG category[R] General function prediction only 
COG ID[COG1964] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.335774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.27889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC AAGTTGCCGA GCTGGCTACA GTCCGAGAAG AGAAGCCAGC CAGACTTGTG 
GATTTGTCTA AGTACAACCT ACCTGAAAGG TACAAGACCA CTCTCCTCAA CGAGAAGTGG
CAGTCTACGT TGAAGAGGCA GTACGGACTG CCAGCCCACA CCAGGGTGGT TAAAGCAACT
CTCTCCCTTT GCCCAGTCTG CAACGCCAGA ATACCCGCCG TAGTGTATGA AGAAAACGGC
GCGATATGGC TGAAGAAGAG CTGTCCACAC CACGGCGTCT TTGAAGACCT CTACTGGGGC
GACGCAGAGA TGTACTACTA CTTCCTCCAG TGGGACCGCC CTGAGTACAT CGCCAAGGGC
CTCGCCAACC CATACACGGA CCTGGAGTTT TACAAAGACA TGGGCTCATG CCCCGAGGGC
TGTGGCCTCT GCCCAGTCCA CAAGTCCAAC ACAGTACTCG CCATCATAGA CGTCACTAAT
AGGTGCAATA TGGCCTGTCC CGTCTGCTTC GCAAACGCCG GTGCCGCAGG CTATGTCTAC
GAGCCCACGC TAGAGCAGAT CGAGTATATG CTCAGGACAC TGAGAGCGCA GAAGCCGTGG
CCGCCCAACG CCATTCAGAT ATCGGGAGGG GAGCCCACTC TCAGAGACGA CCTGCCCGAA
ATAGTGAGGA TGGCCAAGAA GCTCGGCTTC ACCCACATAG AGATAAACAC CAACGGCATA
AGACTTGCTA ATGATATTGA GTACTACAAG GCGCTTCTTG ACGCTGGGAT CTCTACTCTG
TACCTGCAGT TCGACACAAT AGACGAGAAG AACGAGGGCG TCTGGAGGCA CAGGCTCTAC
CACCCCAAGG CCTACAGGAT CATAAAGGAG AAGGTCATCG AAAACGCCAG GAAGCTAGGC
CACCGCTCCA TCGTCCTAGT GGTCACCCTC GCTAAGAACT ACAACGACCA AGACCTAGGC
AAAATAGTCG ACTTCGCCGT TCAGAACAGA GACGTGGTGA GGTGGGTAAA CATACAGCCA
GTGAGCTTTT CAGGCAGGGC CAAGCTATAC AGCAAGGAAG AGCTTAGAAA GTACAGAATA
ACAATACCTG ACACCATAAT AGAGATAGAG AAGCAGACAG GCGGCGTCAT TAGCAGGTGG
GACTGGCGCC CCACCAATTG GCCAGTGGCA CTAGCCAAAA TGGTAGAGGC GCTGACCGAT
TCGCCCAAAC CGCTCTTCTC AATGAACCCC ATGTGCGGCG CCGCCACCTT TATCTACTAC
GACGAAGACG AGAAGCGGAT ATACCCCATC ACCAAGCTGG TGGACGTAGA CGCTTTCGAA
AAAGGCGCCT GGGACATATA CTACACCGCA GTGAAGGGCG GCATCCACAA ACAGACAGCC
AAGGTAAAAG CCCTGAAGTT GTTAAAAGCA GTTAAGCACA AGAAGGTGAA AGACTTGATA
TACGACTTCT TGGTGAGGAA GGACTACGAC TCCCTAGGCC GCTTCTTCTT CAACGTTGTA
GGCATCGGAA TAATGCACTT CATGGATACC ATGAACTACG ACGTAGAGAG AGTACAGCGC
TGTGACATAC ACTACGCCAC GCCCGACGGC AGAGTATTCC CATTCTGTAC ATATAATGTA
GTAGGCCACC GCGAGAAGGT AGAAAGCTCA TTCAAAGTAG ATCCGAAAAC CTGGGTTAAA
GTCACAGGAC TCTCGCTAAC AGGCTGGAAC AAGCAAAAAT TCGCCGAATT TAAATAG
 
Protein sequence
MSKQVAELAT VREEKPARLV DLSKYNLPER YKTTLLNEKW QSTLKRQYGL PAHTRVVKAT 
LSLCPVCNAR IPAVVYEENG AIWLKKSCPH HGVFEDLYWG DAEMYYYFLQ WDRPEYIAKG
LANPYTDLEF YKDMGSCPEG CGLCPVHKSN TVLAIIDVTN RCNMACPVCF ANAGAAGYVY
EPTLEQIEYM LRTLRAQKPW PPNAIQISGG EPTLRDDLPE IVRMAKKLGF THIEINTNGI
RLANDIEYYK ALLDAGISTL YLQFDTIDEK NEGVWRHRLY HPKAYRIIKE KVIENARKLG
HRSIVLVVTL AKNYNDQDLG KIVDFAVQNR DVVRWVNIQP VSFSGRAKLY SKEELRKYRI
TIPDTIIEIE KQTGGVISRW DWRPTNWPVA LAKMVEALTD SPKPLFSMNP MCGAATFIYY
DEDEKRIYPI TKLVDVDAFE KGAWDIYYTA VKGGIHKQTA KVKALKLLKA VKHKKVKDLI
YDFLVRKDYD SLGRFFFNVV GIGIMHFMDT MNYDVERVQR CDIHYATPDG RVFPFCTYNV
VGHREKVESS FKVDPKTWVK VTGLSLTGWN KQKFAEFK