Gene Pars_2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2334 
Symbol 
ID5054443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2087323 
End bp2089080 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content52% 
IMG OID640469886 
Producthypothetical protein 
Protein accessionYP_001154530 
Protein GI145592528 
COG category[S] Function unknown 
COG ID[COG2433] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0927872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00309238 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCTATCC TGGGCATTGA CATTGCGCCC GGCGGCTTGT TTGCCTACGC CGTCGTGGAT 
AACGACGTCG TGGTGGAGAA GGGCACCGCT AGCGCCAGAG ATCTAGCATC CGTTTTTAAG
AAGTATAGAA TTCAAAAGCT GGCCTTGGAC AATCTGGGGG AACTGTTTCA ATACGGCAGA
TCTGTGATAA GACTTCTCGG TAAGCTTCCA TATGACGTAA ACGTCGTCGA GGTCACTAGG
GTTGGAGAGG GGTATGTTAG AACAGAAGAC TTGGTGAGGC AACATCTCGG GGTAGTGAAG
GGGAGGCTGG ATCCGCAGGA GACGGCGATA TACCTAGCCA TGTTAGCCGG ACGCGGAGTG
GGGACACCTG TAAAACTCTT CGAGGAGGAG ACCGTGGTGC TTGTCTACAG GCGCATTTCG
ACGACCCCCG GCGGTATGAG CAGGAACAGG TACATGAGAA ACATAAGCCA CAGGATAAGA
GATATAGCGG CAAGAATTGA GGCAAAGCTA AAAGAGGCCA AGTTAGACTA CGACTTATTC
CTAAAAGAGG AGTCCGGCGA AGTCACCTCT GCCAAGTTCA TAGTTTATGC CAGCAAGGAG
GTTGTTAGGA GGTATGTAAA GACCATGCGC AGTATCGACG TTGCAGTATC TATATACTCC
GCTCCGGCTA AGAAAGGCGG AGTCCCCACC CACGGGCGCT ATCTAATCGT TGGAGTGGAT
CCAGGTATAG TGACAGGTGT CGCAGTGCTG ACGCTAGACG GCGAAGTCCT CGACACCTTG
GCTAGAAGAG GGTTCTCGCG GGGCGATGTG CTCAGGTACG TACACCAGTG GGGGGTGCCT
GTGGTTGTTG CCACGGACGT AGCCGACCCC CCCGAATACG TGAAACGGTT GGCGTCTATG
TGCGGCGCAG TGCTCTATGT GCCAAGCAGA GACCTCACGT CGGAGGAAAA GGCAGAGGTG
TTAGAAAAAG TGGGCTGGAG GGCTAAGACA ACTCACGAGA GAGACGCCTT GGCGGCCGCG
TTTAAGGCAT ATCAGGATTA TAAGCCGAAG TTTGAGAAAA TCGAAAAGGA ATTCGGAGGT
ATACTAAAGC CCGACCAGCT TGAATATGCC AGGGCCCTCG TGGCCAAGGG CTACTCCATA
GCCCAAGCCG TCTCCGAGGC CTTGAAGAGA CGTGAGGAGA AGGAGACCAA AGTTATCTAC
GTAACTGTGG AAAAGCCCTG CGGTTCAAGA GACGAAGCTC TCACAGCTCG TATAAAAGCC
CTCGAGTATG AAAACATGGA GTTGCAGAAA GAGCTTGAAA ATCTAAGGCG GGAATATGCG
CAGCTAAAAA GAGCGTTTGA GGATGCTAAG TGGCGAGATA TGAAATACAG AGAGCTCCAG
AACAGAATAG AGGCGCTTAC AGCGGCGCTG ACGCAGAAAG AGGATGAGAT AAACGCTTTG
AAAAACTTGT TTCTGGAAAT ACTCAAAGCT TTCGGGACTC GGTATAAGCT ACTCCACCTA
TCAGAGACTG TGGAGTGCAG AGGCGGCGAG GTTGTTGGCA CCGTCTGCAG AAATACAGAA
ACTGTAGACG ACGCCGTGGC GCGAAAAACC TTAGGAGTCC CCTTGAGGCT TGTTGCAAAG
TTGCAACTGG GAGAGTACTA CGTGATCGAT ATTGACGCCC TCAAGAGACT TACAGACGAA
ATAAAGCGGC GCATCGAAGA GAGGCGGGAA ATCGACCTGA GAAAAATCGT GGAGCAGTAC
CGCCGAGGGC TAGTATAG
 
Protein sequence
MAILGIDIAP GGLFAYAVVD NDVVVEKGTA SARDLASVFK KYRIQKLALD NLGELFQYGR 
SVIRLLGKLP YDVNVVEVTR VGEGYVRTED LVRQHLGVVK GRLDPQETAI YLAMLAGRGV
GTPVKLFEEE TVVLVYRRIS TTPGGMSRNR YMRNISHRIR DIAARIEAKL KEAKLDYDLF
LKEESGEVTS AKFIVYASKE VVRRYVKTMR SIDVAVSIYS APAKKGGVPT HGRYLIVGVD
PGIVTGVAVL TLDGEVLDTL ARRGFSRGDV LRYVHQWGVP VVVATDVADP PEYVKRLASM
CGAVLYVPSR DLTSEEKAEV LEKVGWRAKT THERDALAAA FKAYQDYKPK FEKIEKEFGG
ILKPDQLEYA RALVAKGYSI AQAVSEALKR REEKETKVIY VTVEKPCGSR DEALTARIKA
LEYENMELQK ELENLRREYA QLKRAFEDAK WRDMKYRELQ NRIEALTAAL TQKEDEINAL
KNLFLEILKA FGTRYKLLHL SETVECRGGE VVGTVCRNTE TVDDAVARKT LGVPLRLVAK
LQLGEYYVID IDALKRLTDE IKRRIEERRE IDLRKIVEQY RRGLV