Gene Mthe_1634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1634 
Symbol 
ID4462506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1778221 
End bp1779231 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content55% 
IMG OID639700653 
Productflap endonuclease-1 
Protein accessionYP_844041 
Protein GI116754923 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTCG ATCTCGGAGA TATTCTCAGC AAAAAGAAGA TCTCCCTTGA GAATCTGTCT 
GGATGCTGGA TAGCAGTCGA TGGATTCAAC ACGCTGTACC AGTTCCTGTC GATCATAAGA
CAGCCTGACG GCACACCTCT CATGGACGCC TCCGGAAGGG TCACATCGCA CCTCTCGGGA
TTGCTCTACC GCATGACGAA CCTCATAGAG GTCGGGATCA GGGTTGCGTT TGTCTTCGAT
GGCACGCCTC CTGAGCTCAA GGCCGGGACG CTCGCTGCCA GGGCTCAGAT GAAGGAGGCA
GCGGAGATCC AGCTGCAGGA GGCGATAGCC ACAGGCGTCG ATAGCTTCAG GTATGCACAG
GCCACCGCCA GGATAAACAG CGAGATACTT CATGACTCCA TAAGGCTCCT GGATGCCATG
GGCATCCCAT ATGTGCAGGC GCCCTCAGAG GGCGAGGCGC AGGCAGCATT CATGGCGATT
CGGGGGGATG TTGATTATGT AGCATCTCAG GACTACGACT CCCTGCTCTT CGGCGCGCCG
AGGGTTGTGA GGAATCTTGC AATCACAGGC AGGAGGAAGA TGCCCAGGAA GAACATTTAC
ATCGATGTTC CTCCTGAGGT CATCATCCTG GAGGAGGAGC TCACGAGGCT CGGGATAAGC
AGGGAGCAGC TCATAGATAT CGGAATAATG TGCGGTACCG ATTACAACAG AGGACTTCCA
AAGGTGGGTC CTAAGAGGGC GCTCAAGCTG ATACGAGAGC ACGGATGCCT GGAGGCTGTG
CTCGATGCGC TTGGAGAGAG CATTGAAAAT TTTCGGGAAA TAAGAGAACT ATTCCTGCAT
CCTGCGGTCA CGGAGAGCTA CGAGCTGAGG ATGAGAAAGC CCATGGTCGA TGAGATCGTC
GGGTTTTTGT GCAACGAGCG CAACTTCTCA GAGGATAGGG TCAGAAAGGC CGCTGAGAGG
TTGAATGCGT CGTACCGTTC CGGCCAGAGC ACACTGGAGA GGTGGCTCTG A
 
Protein sequence
MGVDLGDILS KKKISLENLS GCWIAVDGFN TLYQFLSIIR QPDGTPLMDA SGRVTSHLSG 
LLYRMTNLIE VGIRVAFVFD GTPPELKAGT LAARAQMKEA AEIQLQEAIA TGVDSFRYAQ
ATARINSEIL HDSIRLLDAM GIPYVQAPSE GEAQAAFMAI RGDVDYVASQ DYDSLLFGAP
RVVRNLAITG RRKMPRKNIY IDVPPEVIIL EEELTRLGIS REQLIDIGIM CGTDYNRGLP
KVGPKRALKL IREHGCLEAV LDALGESIEN FREIRELFLH PAVTESYELR MRKPMVDEIV
GFLCNERNFS EDRVRKAAER LNASYRSGQS TLERWL