Gene Mthe_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1441 
Symbol 
ID4461895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1539824 
End bp1541233 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content56% 
IMG OID639700460 
Productphosphoesterase domain-containing protein 
Protein accessionYP_843855 
Protein GI116754737 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATAG AGAAAATCTG TACATCTCTG AGGAGGCTGG CTGAGGATGG TGCAGAGGTT 
ATCTCCAGAT CAGAGGACGT CCTGGTCGTG TCGCACATCG ATGCTGATGG ACTGACTGCA
GCTGGTGTGA TATGTACAGC CCTCTCAAGA AAGGGCATAG ATTACACACC TCTATTCTTC
AAGCAACTTG ATAGTAGAGC AATTGAGCGC ATTGCAGATG CGGGTTCGGA TCTGGTGATC
TTCACCGATC TTGGAAGCGG CATGATCCAG GAGATATCAT CTTTTGGCAT CAGAGCTGTT
ATCGCGGACC ATCACCGTCC CGGGAGCGCT GAGATCTATA GAGGAATTGT GCACATCAAT
CCTCATCTCC TTGGCGGGGA TGGTGCAACC CATATAAGCG GCAGCGGGAC TGCATTCATC
CTCGCGAATG CTATGGGGAG GAACGAGGAT CTCTCTGCTC TCGCAGTTGT CGGCGCTGTC
GGCGATCTCC AGGATCTATC GTCGGGCAGG CTGGTCGGGC TCAACCGCAG GATAATCGAG
GTGGGATCAA GAGCGGGCGT CCTGTCGTTC GGTCCGGATC TAAAGCTCTT CGGAAAACAG
ACGCGGCCTG TCTTCAAGAT GCTTGAATAC TCAACAGATC CATATATTCC AGGGCTATCT
GGAGACGAGG AGGCGTGCAT ATCCTTCCTC AAGGAGATCG GCATACGGCT CGGCGGGGAG
CGATGGAGGA GGTGGATAGA TCTGGATCAG GATGAGCGCG CCAGGATCGT CTCCGCACTG
ATCCGTCATG GTTTGCGTTC TGGCATCCCC GGGTTCAGGC TTGAGCGTCT CGTCGGGGAG
GTATACACAC TCTCACGTGA GAGGGAGGGC ACCGAGCTCA GGGACGCGAG CGAGTTCTCC
ACTCTGCTGA ACGCCACAGC CAGATATGGC CATTCCAGGA TCGGTTTGGA TGTATGCCTC
GGCGACAGGG ACAGGGCTCT GGAGGAGGCG AGGCTTCTTC TGAGCCAGCA CAGGCAGAAC
CTTCTCAACG GCATAAAGCT GGTGAAGGAG AGAGGTCTTG TACAGCTCTC ACATATACAG
TACTTCGATG CAGGTGAGGA GATCCTTGAT ACCATAGTGG GCATCGTCGC AGGGATGGTC
TTTCAGATCG CAGATAGATC CAAGCCAATT CTTGCATTTG CAGCCACTCC CGACGGCATG
CTCAAGGTCT CGGCGCGTGG TACGCAGGAT CTCGTGAGAG CAGGCCTGGA TCTCTCGCGT
GCCGTATCCG CATCAGCTAA CGAGGTCGGT GGCGTCGGAG GGGGGCACAG CATTGCAGCT
GGCGCCACGA TACCCCCACA GAGCAGGGAT GAGTTTCTTT CGATAATTGA CAGGGTCATC
GGGGAGCAGA TGAAGTCGAG GGGCAGCTGA
 
Protein sequence
MKIEKICTSL RRLAEDGAEV ISRSEDVLVV SHIDADGLTA AGVICTALSR KGIDYTPLFF 
KQLDSRAIER IADAGSDLVI FTDLGSGMIQ EISSFGIRAV IADHHRPGSA EIYRGIVHIN
PHLLGGDGAT HISGSGTAFI LANAMGRNED LSALAVVGAV GDLQDLSSGR LVGLNRRIIE
VGSRAGVLSF GPDLKLFGKQ TRPVFKMLEY STDPYIPGLS GDEEACISFL KEIGIRLGGE
RWRRWIDLDQ DERARIVSAL IRHGLRSGIP GFRLERLVGE VYTLSREREG TELRDASEFS
TLLNATARYG HSRIGLDVCL GDRDRALEEA RLLLSQHRQN LLNGIKLVKE RGLVQLSHIQ
YFDAGEEILD TIVGIVAGMV FQIADRSKPI LAFAATPDGM LKVSARGTQD LVRAGLDLSR
AVSASANEVG GVGGGHSIAA GATIPPQSRD EFLSIIDRVI GEQMKSRGS