Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2348 |
Symbol | |
ID | 5056121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2098994 |
End bp | 2100034 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640469899 |
Product | flap endonuclease-1 |
Protein accession | YP_001154543 |
Protein GI | 145592541 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR03674] flap structure-specific endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0196172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGTAA CTGAACTTGG GAAGCTTATT GGAAAAGAGG CGCGTAGGGA GGTCAAGTTG GAGGCCCTTG CAGGTAGGTG CGTAGCGCTC GATGCTTATA ACGCTTTGTA CCAATTTTTG GCGTCTATCA GACAGCCGGA CGGTACTCCT CTTATGGATA GGGCAGGGCG TATCACCAGC CACATCTCGG GGCTGTTTTA CCGCACGATT AACCTCATGG AGGCGGGTAT TAAGCCCGTC TACGTCTTTG ACGGAAAGCC TCCTGAGTTT AAGCTCGCCG AGATTGAGGA GAGAAGAAAA GCTAAGGAGA AGGCCACAGA AGAGCTTGTA AGAGCGATAA AAGAGGGCAG GAGGGATGAG GTGGCTAAAT ATGCAAAAAG GGCGATATTT CTCACAAACG AGATGGTGGA AGACGCCAAG AAGCTATTGA CGTATATGGG TGTTCCTTGG GTACAAGCCC CAAGCGAGGG GGAGGCGCAG GCGGCGTATA TGGCCAGGAG AGGGCACTGC TGGGCTGTGG GGAGCCAAGA TTACGACTCT CTACTTTTCG GATCGCCTAG GCTTGTCAGA AACCTCGCAA CGTCGCCCAA GCGGAAGGTG GGCGATGAGG TGGTAGAGCT CTCCCCAGAA ATTATAGAGC TAGATGCCGT CTTGAAATCC CTTCGCTTAA GGAGCAGGGA GCAACTCATC GACTTGGCCA TTCTGCTGGG CACCGACTAC AACCCCGATG GGGTGCCGGG TATTGGACCC CAAAGGGCGC TCAAACTCAT ATGGGAGTTC GGCTCACTTG AGAAGTTGCT AGACACCGTT CTTAGGGGAG TGACATTTCC TATTGATCCT GTTGAGATAA AGAGGTTTTT CCTTAATCCG CCTGTGACAG ATACTTATAC CACAGATGTG ACAAAGCCAG ACGACGCGAA GTTGAGGGAC TTTCTCGTGC ATGAGCACGA TTTTGGTGAA GAGAGAGTTG AGAGAGCACT AGAAAGGCTG AAAAAAGCCA TGGGCAAGTT AAGAACTTCG GCGCTGGACT CCTTCTTTTA A
|
Protein sequence | MGVTELGKLI GKEARREVKL EALAGRCVAL DAYNALYQFL ASIRQPDGTP LMDRAGRITS HISGLFYRTI NLMEAGIKPV YVFDGKPPEF KLAEIEERRK AKEKATEELV RAIKEGRRDE VAKYAKRAIF LTNEMVEDAK KLLTYMGVPW VQAPSEGEAQ AAYMARRGHC WAVGSQDYDS LLFGSPRLVR NLATSPKRKV GDEVVELSPE IIELDAVLKS LRLRSREQLI DLAILLGTDY NPDGVPGIGP QRALKLIWEF GSLEKLLDTV LRGVTFPIDP VEIKRFFLNP PVTDTYTTDV TKPDDAKLRD FLVHEHDFGE ERVERALERL KKAMGKLRTS ALDSFF
|
| |