Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0078 |
Symbol | |
ID | 4463358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 65888 |
End bp | 67762 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639699087 |
Product | peptidyl-arginine deiminase |
Protein accession | YP_842520 |
Protein GI | 116753402 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAGGG TAGGGCTTGT CCAGACTAGG GTCACCGAGG ACCTGAACTT CAATCTAGCT AGGGCTCTAG ATCTGGTGGA GGATGCTGCG AACAGGGGCG CGGAGATCGT CTGCCTGCCG GAGCTCTACA GGACTTCATA CTTTCCCAGA GAAAAGAACG CAAAGGTCCA GCAATATGCT GAAACGATCC CCGGTGAATC GACCGCGGCA TTCTCGAGGC TCGCAGCCCG GATGAATGTT GTTGTAATAG TCCCGCTCTT CGAGCGGTAT GGTAGCGTTT ATTACAATTC CGCAGCAGTC ATCGATGCTG ATGGCTCGAT CGCAGGTGTG TACAGGAAGT CCCACATTCC GTGCGACCCC ATGTTCTATG AGAAGATGTA CTTTTTCCAG GGGGACGGCT TTAGGGTCTT CCGAACGCGC CATGCCTGTC TTGCGGTCCT GATATGCTAC GACCAGTGGT TCCCGGAGGC AGCGCGCTCT GTTGTTCTTG ATGGCGCAGA CATAATTTTC TATCCGACCG CCATCGGCAG GGTAAGAGGT GTGGAGCAGG CAGAGGGCGA CTGGCAGACC GCCTGGGAGA CCGTACAGCG TGGGCATGCG ATTGCGAACG GCGTGCATGT GGCCGCGGTG AACCGAGTGG GCGTGGAGGG CGAGATTGCC TTCTGGGGAG GATCGTTTGT ATGCGACTCG TTTGGTAATC TTCTCGCACA TGCCGGCTCT GATGAGGAGG TTCTTCTCGC GGATCTCGAT CTCTCTAAGA ACTTGATGGT TAGAGAGGGG TGGGGTTTCA TCAAGAACAG GCGGCCCGAT GCGTACCGAG TCCTTGTGAG GGATATTGAG AGACGCCTCC TGACACCGAG CAGCGAGGGC TACCACATGC CGGCGGAGTG GGAGCGGCAT GATGGTGTAT GGCTCGCCTG GCCGCACGAC ACTGATACTT TTAATGATAT CGATTCTGTG GAGCGCGCAT ACGTCTCGAT GATAAAAGCG CTCCATGTGG GAGAGACCGT GAACCTGCTG GTGAGAGATG AGGAGATGCG CGAGCGGGTG GAGCATCTCC TTCAGAGGGA TGTCAGGATG AGCAGCCTGA GAATCCACAC CATAGATTAC GCGGACGTCT GGTTCAGGGA CTACGGCCCC ACATTCGTTG TGAACAAAAA AGAAAAACGT CTCGGAATGG TCGCCTGGAA CTTCAACGCA TGGGGTGGAA AGTACTGCGA GCTCATGGGG GATGTGAAGA TACCCTGCTA CATCGCGCGC GATCTCGGCG TCAGGTGCTT CCGTCCCGGG ATCGTGCTTG AGGGCGGATC GATAGATGTC AATGGCTCCG GAACTCTCAT GACCACGGAG CAGTGCCTGC TGAATCCGAA CAGGAATCCT CTCATGAGCA GGTGGGATAT GGAGTTCTAC CTCAGGGAGT ATCTGGGCGT CAGAAAGATA ATCTGGCTCA GGAGGGGAAT AGCGGGCGAT GACACCGACG GCCATGTCGA TGATGTTGCC AGATTCGTAT CCCCAAGAAG GGTGGTTCTT GCATTCGAGG AAGACAAAGA TGACGAGAAC CATGAGCCTC TGCGGGAGAA CTACGAGATT CTCAAACATG AGACAGACCA GGATGGGAAT CCGCTAGAGG TTATCCGCCT GCCGATGCCG GGTTATGTTG GAGATGAGGA GCGGCTGCCA GCAAGTTACG CGAACTTCTA CATCGGGAAC AGAGCGGTGC TCGTGCCGGT CTTCGGCCAC AGGAACGATG CGAGGGCTCT GAGCATCATC GGCGCACTGT TCCCGGATAG AGAGGTCGTG GGGATCGATG CGCTGGCGAT GGTCTATGGT CTTGGCACGA TTCACTGCGT TACGCAGCAG CAGCCTGCGG TGTAG
|
Protein sequence | MVRVGLVQTR VTEDLNFNLA RALDLVEDAA NRGAEIVCLP ELYRTSYFPR EKNAKVQQYA ETIPGESTAA FSRLAARMNV VVIVPLFERY GSVYYNSAAV IDADGSIAGV YRKSHIPCDP MFYEKMYFFQ GDGFRVFRTR HACLAVLICY DQWFPEAARS VVLDGADIIF YPTAIGRVRG VEQAEGDWQT AWETVQRGHA IANGVHVAAV NRVGVEGEIA FWGGSFVCDS FGNLLAHAGS DEEVLLADLD LSKNLMVREG WGFIKNRRPD AYRVLVRDIE RRLLTPSSEG YHMPAEWERH DGVWLAWPHD TDTFNDIDSV ERAYVSMIKA LHVGETVNLL VRDEEMRERV EHLLQRDVRM SSLRIHTIDY ADVWFRDYGP TFVVNKKEKR LGMVAWNFNA WGGKYCELMG DVKIPCYIAR DLGVRCFRPG IVLEGGSIDV NGSGTLMTTE QCLLNPNRNP LMSRWDMEFY LREYLGVRKI IWLRRGIAGD DTDGHVDDVA RFVSPRRVVL AFEEDKDDEN HEPLRENYEI LKHETDQDGN PLEVIRLPMP GYVGDEERLP ASYANFYIGN RAVLVPVFGH RNDARALSII GALFPDREVV GIDALAMVYG LGTIHCVTQQ QPAV
|
| |