Gene Mthe_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0078 
Symbol 
ID4463358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp65888 
End bp67762 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content56% 
IMG OID639699087 
Productpeptidyl-arginine deiminase 
Protein accessionYP_842520 
Protein GI116753402 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase
[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGGG TAGGGCTTGT CCAGACTAGG GTCACCGAGG ACCTGAACTT CAATCTAGCT 
AGGGCTCTAG ATCTGGTGGA GGATGCTGCG AACAGGGGCG CGGAGATCGT CTGCCTGCCG
GAGCTCTACA GGACTTCATA CTTTCCCAGA GAAAAGAACG CAAAGGTCCA GCAATATGCT
GAAACGATCC CCGGTGAATC GACCGCGGCA TTCTCGAGGC TCGCAGCCCG GATGAATGTT
GTTGTAATAG TCCCGCTCTT CGAGCGGTAT GGTAGCGTTT ATTACAATTC CGCAGCAGTC
ATCGATGCTG ATGGCTCGAT CGCAGGTGTG TACAGGAAGT CCCACATTCC GTGCGACCCC
ATGTTCTATG AGAAGATGTA CTTTTTCCAG GGGGACGGCT TTAGGGTCTT CCGAACGCGC
CATGCCTGTC TTGCGGTCCT GATATGCTAC GACCAGTGGT TCCCGGAGGC AGCGCGCTCT
GTTGTTCTTG ATGGCGCAGA CATAATTTTC TATCCGACCG CCATCGGCAG GGTAAGAGGT
GTGGAGCAGG CAGAGGGCGA CTGGCAGACC GCCTGGGAGA CCGTACAGCG TGGGCATGCG
ATTGCGAACG GCGTGCATGT GGCCGCGGTG AACCGAGTGG GCGTGGAGGG CGAGATTGCC
TTCTGGGGAG GATCGTTTGT ATGCGACTCG TTTGGTAATC TTCTCGCACA TGCCGGCTCT
GATGAGGAGG TTCTTCTCGC GGATCTCGAT CTCTCTAAGA ACTTGATGGT TAGAGAGGGG
TGGGGTTTCA TCAAGAACAG GCGGCCCGAT GCGTACCGAG TCCTTGTGAG GGATATTGAG
AGACGCCTCC TGACACCGAG CAGCGAGGGC TACCACATGC CGGCGGAGTG GGAGCGGCAT
GATGGTGTAT GGCTCGCCTG GCCGCACGAC ACTGATACTT TTAATGATAT CGATTCTGTG
GAGCGCGCAT ACGTCTCGAT GATAAAAGCG CTCCATGTGG GAGAGACCGT GAACCTGCTG
GTGAGAGATG AGGAGATGCG CGAGCGGGTG GAGCATCTCC TTCAGAGGGA TGTCAGGATG
AGCAGCCTGA GAATCCACAC CATAGATTAC GCGGACGTCT GGTTCAGGGA CTACGGCCCC
ACATTCGTTG TGAACAAAAA AGAAAAACGT CTCGGAATGG TCGCCTGGAA CTTCAACGCA
TGGGGTGGAA AGTACTGCGA GCTCATGGGG GATGTGAAGA TACCCTGCTA CATCGCGCGC
GATCTCGGCG TCAGGTGCTT CCGTCCCGGG ATCGTGCTTG AGGGCGGATC GATAGATGTC
AATGGCTCCG GAACTCTCAT GACCACGGAG CAGTGCCTGC TGAATCCGAA CAGGAATCCT
CTCATGAGCA GGTGGGATAT GGAGTTCTAC CTCAGGGAGT ATCTGGGCGT CAGAAAGATA
ATCTGGCTCA GGAGGGGAAT AGCGGGCGAT GACACCGACG GCCATGTCGA TGATGTTGCC
AGATTCGTAT CCCCAAGAAG GGTGGTTCTT GCATTCGAGG AAGACAAAGA TGACGAGAAC
CATGAGCCTC TGCGGGAGAA CTACGAGATT CTCAAACATG AGACAGACCA GGATGGGAAT
CCGCTAGAGG TTATCCGCCT GCCGATGCCG GGTTATGTTG GAGATGAGGA GCGGCTGCCA
GCAAGTTACG CGAACTTCTA CATCGGGAAC AGAGCGGTGC TCGTGCCGGT CTTCGGCCAC
AGGAACGATG CGAGGGCTCT GAGCATCATC GGCGCACTGT TCCCGGATAG AGAGGTCGTG
GGGATCGATG CGCTGGCGAT GGTCTATGGT CTTGGCACGA TTCACTGCGT TACGCAGCAG
CAGCCTGCGG TGTAG
 
Protein sequence
MVRVGLVQTR VTEDLNFNLA RALDLVEDAA NRGAEIVCLP ELYRTSYFPR EKNAKVQQYA 
ETIPGESTAA FSRLAARMNV VVIVPLFERY GSVYYNSAAV IDADGSIAGV YRKSHIPCDP
MFYEKMYFFQ GDGFRVFRTR HACLAVLICY DQWFPEAARS VVLDGADIIF YPTAIGRVRG
VEQAEGDWQT AWETVQRGHA IANGVHVAAV NRVGVEGEIA FWGGSFVCDS FGNLLAHAGS
DEEVLLADLD LSKNLMVREG WGFIKNRRPD AYRVLVRDIE RRLLTPSSEG YHMPAEWERH
DGVWLAWPHD TDTFNDIDSV ERAYVSMIKA LHVGETVNLL VRDEEMRERV EHLLQRDVRM
SSLRIHTIDY ADVWFRDYGP TFVVNKKEKR LGMVAWNFNA WGGKYCELMG DVKIPCYIAR
DLGVRCFRPG IVLEGGSIDV NGSGTLMTTE QCLLNPNRNP LMSRWDMEFY LREYLGVRKI
IWLRRGIAGD DTDGHVDDVA RFVSPRRVVL AFEEDKDDEN HEPLRENYEI LKHETDQDGN
PLEVIRLPMP GYVGDEERLP ASYANFYIGN RAVLVPVFGH RNDARALSII GALFPDREVV
GIDALAMVYG LGTIHCVTQQ QPAV