Gene Tneu_1985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1985 
Symbol 
ID6165667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1747698 
End bp1748747 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content58% 
IMG OID641669149 
Productflap endonuclease-1 
Protein accessionYP_001795346 
Protein GI171186427 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTA CAGAGTTGGG CAAGCTTATA GGTAGAGAGG CGAGGCGCGA GATAAAGCTC 
GAAAACCTAG CCGGCAGATG TATCGCCCTA GACGCCTACA ACGCCCTATA CCAGTTCCTC
GCCTCGATTA GACAACCCGA CGGCACACCT CTGATGGACC GCCAGGGCAG AGTCACTAGC
CACCTCTCAG GGCTCTTCTA CCGTACCATT AACCTGATGG AGGCCGGCAT AAAGCCGGTT
TACGTATTTG ACGGGAAGCC GCCTGAGTTC AAGCTGGCCG AGATAGAGGC GAGGAGGAGG
GTTAAGGAGA AGGCCATGGA GGAGGTGGTG AAGGCGATAA GGGAGGGGAA GAGAGATGAC
GTGGCGAAGT ACATGAAGAG GGTTATCTTT CTCACCAATG AGATGGTTGA GGACGCCAAG
AGGCTACTGA CCTACATGGG CGTCCCCTGG GTGCAAGCGC CGAGCGAGGG GGAGGCTCAG
GCCGCCCACA TGGCGAAGAG GGGGCACTGT TGGGCCGTCG GTAGCCAAGA CTACGACTCG
CTTCTATTCG GCTCGCCTAG GTTGGTGAGG AATCTCGCCG TGTCGCCTAA GAGGAGGAGC
GGGGAGGAGG TGGTGGAGGT GTCCCCGGAG GTGGTGGAGC TAGACTCCGT GTTGAAAGCG
CTTAAGCTGA AGGGCAGGGA ACAGCTTATA GACGTTGCCA TACTGCTGGG AACCGACTAC
AACCCCGACG GGGTCCCGGG GGTCGGGCCT CAGAAGGCGC TCAAGTTGGT TTTGGAGTTC
GGCTCTCTTG AGAAGATGCT AGACACGGTT CTCAGGGGGG TCTCCTTCCC TGTGGATCCC
CTCGAGATAA AGAGGTTTTT CCTCAATCCG CCTGTCACAG AGGAGTACGC CCTGGAGTTG
AAGAACGTAG ACGAGCGAGG CCTTGTGAAT TTCCTCGTCG GCGAACACGA CTTCAGCGAG
GAGAGAGTCG CCAAGGCTGT GGAGAGGCTT AAAAAGGCGC GGGCTAGGCA GAAAACCTCG
TCTCTAGACA GCTTTTTCCA TGGCGCATAG
 
Protein sequence
MGVTELGKLI GREARREIKL ENLAGRCIAL DAYNALYQFL ASIRQPDGTP LMDRQGRVTS 
HLSGLFYRTI NLMEAGIKPV YVFDGKPPEF KLAEIEARRR VKEKAMEEVV KAIREGKRDD
VAKYMKRVIF LTNEMVEDAK RLLTYMGVPW VQAPSEGEAQ AAHMAKRGHC WAVGSQDYDS
LLFGSPRLVR NLAVSPKRRS GEEVVEVSPE VVELDSVLKA LKLKGREQLI DVAILLGTDY
NPDGVPGVGP QKALKLVLEF GSLEKMLDTV LRGVSFPVDP LEIKRFFLNP PVTEEYALEL
KNVDERGLVN FLVGEHDFSE ERVAKAVERL KKARARQKTS SLDSFFHGA