Gene Pars_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1920 
Symbol 
ID5055221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1725066 
End bp1726187 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content56% 
IMG OID640469466 
Producthypothetical protein 
Protein accessionYP_001154119 
Protein GI145592117 
COG category[S] Function unknown 
COG ID[COG1679] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTAGAG AATACGCCCG CGAGGTTGTG TTGAAAATCG CGGACGCTGT GTCTGGCGGC 
GAGGTTGTGC CTGTGGAGAC AGCTCATGTG TCAGGAGTGT CGTTCCTCAC GGTGGGAGAA
TATGGAGTGG AATTTCTCGA ACACCTAGCC GCCTCGGGCG CGCGTGTTTC TGTATTTACG
ACCTCTAACC CAGCCGCCGT TGATTTAGCC GGCGTGTTGG GAGTGGACGA GGCGGTGGCG
AAGGGGCAGG AGAGGATTAC CAAGGCGCTG AGGGCCATGG GGGTGAATAC TTTTTTCTCA
TGTACGCCTT ATGAGTTTGT CATTACACGT CAACGTACTT TCCACGCCTG GGCTGAGTCC
AACGCGATTA CTTACATCAA CACGTTTAGA GACGCTTGGT CTGACAAAAA CCCCGGCCCC
CTGGCCCTGT TAGGAGCGAT AGCCGGCTTC GTGCCGAAAA CTCCTCTGTA CACCCTGGAG
GGCAGACGGC CTACGGTGCT TGTGGAGGTG GAGGCCGGCC CACTAGGCCC TCTAGAGGCC
GGAGCCGTGG GGGCTTTAAT GGGGGAGCAA ATAGGCTCAG GCGTGCCATA TGTGAGGGGG
CTTTCTTTAA CCGGCGAGGG GGCTAGGCGA GAGTTCGCGG CGGCCCTCTC CACGTACTCC
GCCATGGTCT TCGCAGTGGT GGAGGGCGTC ACTCCTAATT GGAAGGAGTA CCTAGAAATT
GCCGATTTTA GGGAAAAGAT AAGGATATCC CAAGGCGACG TTGCTAAGTT TTTGAGGAAC
GACGAGACCC CTGATGTGGT CTACTTCGGT TGCCCCTTTG CCGACGTCGA CTCTGTATTG
TGGGTTTTGG CAGAGGTCAA GAAGAGGGGG GTCCCCAAAA GACCTATCTA CATTTCCACG
TCTCCTGGCG TTTACGGGAT TTTGGGGAGG CTGGTGGAAG AGGCCGAGAG GTATAATGTG
CATATATTTA CGGGCTCTTG TCTAGTGGTT TCTCCTCACA CCCGCAAGTT TAGGACAATC
GCCACTGACT CCCTAAAGGC TGTCTACTAC ATCCCGAGAC TCCACGGCGT TGGGGTAGTG
CCGTGTAGAA GGGAGAGATG TCTCGACTTG GCATATGCTT AA
 
Protein sequence
MSREYAREVV LKIADAVSGG EVVPVETAHV SGVSFLTVGE YGVEFLEHLA ASGARVSVFT 
TSNPAAVDLA GVLGVDEAVA KGQERITKAL RAMGVNTFFS CTPYEFVITR QRTFHAWAES
NAITYINTFR DAWSDKNPGP LALLGAIAGF VPKTPLYTLE GRRPTVLVEV EAGPLGPLEA
GAVGALMGEQ IGSGVPYVRG LSLTGEGARR EFAAALSTYS AMVFAVVEGV TPNWKEYLEI
ADFREKIRIS QGDVAKFLRN DETPDVVYFG CPFADVDSVL WVLAEVKKRG VPKRPIYIST
SPGVYGILGR LVEEAERYNV HIFTGSCLVV SPHTRKFRTI ATDSLKAVYY IPRLHGVGVV
PCRRERCLDL AYA