Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1985 |
Symbol | |
ID | 6165667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1747698 |
End bp | 1748747 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641669149 |
Product | flap endonuclease-1 |
Protein accession | YP_001795346 |
Protein GI | 171186427 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR03674] flap structure-specific endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGTTA CAGAGTTGGG CAAGCTTATA GGTAGAGAGG CGAGGCGCGA GATAAAGCTC GAAAACCTAG CCGGCAGATG TATCGCCCTA GACGCCTACA ACGCCCTATA CCAGTTCCTC GCCTCGATTA GACAACCCGA CGGCACACCT CTGATGGACC GCCAGGGCAG AGTCACTAGC CACCTCTCAG GGCTCTTCTA CCGTACCATT AACCTGATGG AGGCCGGCAT AAAGCCGGTT TACGTATTTG ACGGGAAGCC GCCTGAGTTC AAGCTGGCCG AGATAGAGGC GAGGAGGAGG GTTAAGGAGA AGGCCATGGA GGAGGTGGTG AAGGCGATAA GGGAGGGGAA GAGAGATGAC GTGGCGAAGT ACATGAAGAG GGTTATCTTT CTCACCAATG AGATGGTTGA GGACGCCAAG AGGCTACTGA CCTACATGGG CGTCCCCTGG GTGCAAGCGC CGAGCGAGGG GGAGGCTCAG GCCGCCCACA TGGCGAAGAG GGGGCACTGT TGGGCCGTCG GTAGCCAAGA CTACGACTCG CTTCTATTCG GCTCGCCTAG GTTGGTGAGG AATCTCGCCG TGTCGCCTAA GAGGAGGAGC GGGGAGGAGG TGGTGGAGGT GTCCCCGGAG GTGGTGGAGC TAGACTCCGT GTTGAAAGCG CTTAAGCTGA AGGGCAGGGA ACAGCTTATA GACGTTGCCA TACTGCTGGG AACCGACTAC AACCCCGACG GGGTCCCGGG GGTCGGGCCT CAGAAGGCGC TCAAGTTGGT TTTGGAGTTC GGCTCTCTTG AGAAGATGCT AGACACGGTT CTCAGGGGGG TCTCCTTCCC TGTGGATCCC CTCGAGATAA AGAGGTTTTT CCTCAATCCG CCTGTCACAG AGGAGTACGC CCTGGAGTTG AAGAACGTAG ACGAGCGAGG CCTTGTGAAT TTCCTCGTCG GCGAACACGA CTTCAGCGAG GAGAGAGTCG CCAAGGCTGT GGAGAGGCTT AAAAAGGCGC GGGCTAGGCA GAAAACCTCG TCTCTAGACA GCTTTTTCCA TGGCGCATAG
|
Protein sequence | MGVTELGKLI GREARREIKL ENLAGRCIAL DAYNALYQFL ASIRQPDGTP LMDRQGRVTS HLSGLFYRTI NLMEAGIKPV YVFDGKPPEF KLAEIEARRR VKEKAMEEVV KAIREGKRDD VAKYMKRVIF LTNEMVEDAK RLLTYMGVPW VQAPSEGEAQ AAHMAKRGHC WAVGSQDYDS LLFGSPRLVR NLAVSPKRRS GEEVVEVSPE VVELDSVLKA LKLKGREQLI DVAILLGTDY NPDGVPGVGP QKALKLVLEF GSLEKMLDTV LRGVSFPVDP LEIKRFFLNP PVTEEYALEL KNVDERGLVN FLVGEHDFSE ERVAKAVERL KKARARQKTS SLDSFFHGA
|
| |