Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1920 |
Symbol | |
ID | 5055221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1725066 |
End bp | 1726187 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469466 |
Product | hypothetical protein |
Protein accession | YP_001154119 |
Protein GI | 145592117 |
COG category | [S] Function unknown |
COG ID | [COG1679] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTAGAG AATACGCCCG CGAGGTTGTG TTGAAAATCG CGGACGCTGT GTCTGGCGGC GAGGTTGTGC CTGTGGAGAC AGCTCATGTG TCAGGAGTGT CGTTCCTCAC GGTGGGAGAA TATGGAGTGG AATTTCTCGA ACACCTAGCC GCCTCGGGCG CGCGTGTTTC TGTATTTACG ACCTCTAACC CAGCCGCCGT TGATTTAGCC GGCGTGTTGG GAGTGGACGA GGCGGTGGCG AAGGGGCAGG AGAGGATTAC CAAGGCGCTG AGGGCCATGG GGGTGAATAC TTTTTTCTCA TGTACGCCTT ATGAGTTTGT CATTACACGT CAACGTACTT TCCACGCCTG GGCTGAGTCC AACGCGATTA CTTACATCAA CACGTTTAGA GACGCTTGGT CTGACAAAAA CCCCGGCCCC CTGGCCCTGT TAGGAGCGAT AGCCGGCTTC GTGCCGAAAA CTCCTCTGTA CACCCTGGAG GGCAGACGGC CTACGGTGCT TGTGGAGGTG GAGGCCGGCC CACTAGGCCC TCTAGAGGCC GGAGCCGTGG GGGCTTTAAT GGGGGAGCAA ATAGGCTCAG GCGTGCCATA TGTGAGGGGG CTTTCTTTAA CCGGCGAGGG GGCTAGGCGA GAGTTCGCGG CGGCCCTCTC CACGTACTCC GCCATGGTCT TCGCAGTGGT GGAGGGCGTC ACTCCTAATT GGAAGGAGTA CCTAGAAATT GCCGATTTTA GGGAAAAGAT AAGGATATCC CAAGGCGACG TTGCTAAGTT TTTGAGGAAC GACGAGACCC CTGATGTGGT CTACTTCGGT TGCCCCTTTG CCGACGTCGA CTCTGTATTG TGGGTTTTGG CAGAGGTCAA GAAGAGGGGG GTCCCCAAAA GACCTATCTA CATTTCCACG TCTCCTGGCG TTTACGGGAT TTTGGGGAGG CTGGTGGAAG AGGCCGAGAG GTATAATGTG CATATATTTA CGGGCTCTTG TCTAGTGGTT TCTCCTCACA CCCGCAAGTT TAGGACAATC GCCACTGACT CCCTAAAGGC TGTCTACTAC ATCCCGAGAC TCCACGGCGT TGGGGTAGTG CCGTGTAGAA GGGAGAGATG TCTCGACTTG GCATATGCTT AA
|
Protein sequence | MSREYAREVV LKIADAVSGG EVVPVETAHV SGVSFLTVGE YGVEFLEHLA ASGARVSVFT TSNPAAVDLA GVLGVDEAVA KGQERITKAL RAMGVNTFFS CTPYEFVITR QRTFHAWAES NAITYINTFR DAWSDKNPGP LALLGAIAGF VPKTPLYTLE GRRPTVLVEV EAGPLGPLEA GAVGALMGEQ IGSGVPYVRG LSLTGEGARR EFAAALSTYS AMVFAVVEGV TPNWKEYLEI ADFREKIRIS QGDVAKFLRN DETPDVVYFG CPFADVDSVL WVLAEVKKRG VPKRPIYIST SPGVYGILGR LVEEAERYNV HIFTGSCLVV SPHTRKFRTI ATDSLKAVYY IPRLHGVGVV PCRRERCLDL AYA
|
| |