Gene Memar_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMemar_2038 
Symbol 
ID4847842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanoculleus marisnigri JR1 
KingdomArchaea 
Replicon accessionNC_009051 
Strand
Start bp2030489 
End bp2032408 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content65% 
IMG OID640116747 
Productpeptidyl-arginine deiminase 
Protein accessionYP_001047946 
Protein GI126179981 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase
[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATGA GCACGCAAAC GATCGGCCTC ATCCAGACGG CGGTGAGCGA GGATCCCGGC 
CGCAACCTGG AACGCACCCT CGGTATGGCG AAAGCGGCGA TCGCGAAAGG TGCGCGGATT
CTCTGCCTGC AGGAACTCTA CCGGGCCCCC TACTTCCCGC AGTACGAGGA TACGGATGCT
TCCCGCTACG CCGAGACGAT CCCCGGGCCG TCGACCGAAG CGTTCTCGGC GCTTGCCCGG
GAGCACGGTG TCGTGATCGT CGTTCCGGTC TACGAGCGGA CTATATCCGG CGAGCACTAC
AACACCGCCG TGGTGATCGA CGCCGATGGA CGGCTGCTCC CTGCCTACCG GAAGGTGCAC
GTCCCCTACG ATCCGCTCTT CTACGAGAAG ATCTATTTTC TGCCCGGGGA CCGCTACCGG
GTCTACGATA CCCGGTACGG CCGGATCGCC GTGCTCATCT GCTACGACCA GTGGTTCCCG
GAAGCCGCGA GGGCGGTTGC GCTCATGGGC GCGGAGTTCA TCTTCTACCC GACCGCCATT
GGCAGGATCG CGGGCGAAGA GCCGCCCGAG GGCGACTGGC GCGAGGCGTG GGAGACGGTG
CAGCGCGGCC ACGCGATCGC AAACAGCGTC CACGTCGCCG CCGTCAACCG TGTGGGTGAC
GAGGGGGATC TCCGGTTCTT CGGGAGTTCG TTTGTCGCCG ATGCGTTCGG GAACGTCCTC
GCCCGGGCGA GCGAGACCGG TGAGGAAATC CTCATCGTCG AGGTCGACCT CGCGGGAAAC
GAGGCCGTCC GGGAGGGGTG GGGATTCTTC CGGAACCGGC GGCCGGAGAC CTACGGGGCG
CTCGCCCGGA GGCTCCCGGC GGGCCGAACC CCTGCGGTAT CCGGCTACCG TATGCCGGCG
GAGTGGGAGC CGCACGATGC CGTCTGGCTC TCCTGGCCCC ACGACCGCGA GACGTTTCCC
GACCTTGCGG CGGTGGAGGG GATCTACGTC GAGATCATCG CGGCGCTCCG GGGGTCGGAG
ACCGTCAACC TGCTCGTCAC CGACGAGAAG ATGCATATCC GGGTGAAGGC GATGCTCGAA
GAGGAGGGCG TCGATACGGC TGGAATCAGG TTCCATCTCG CCGATTACGC CGACGTCTGG
TTCCGGGACT ACGGGCCGAC GTTTCTGGTC GACAGAAAGG CCGGGAACCT CGCGATGGTG
AACTGGACGT TCAACGCCTG GGGGGAGAAG TACACGGAAC TTATGGAGGA TACCCGGATC
CCGCTCGCCA TGAACCGCGA GATGGAGATC CCCATCTTCA CCCCCGGTTT CGTCCTCGAG
GGGGGTTCGG TCGAGGTGAA CGGGTGCGGC ACGGTGATCA CGACGGAGGC GTGCCTCTTG
AACCCCAACC GGAACCCGCA CCTCTCCCGG GAGGAGATCG AGGCCTACCT GGAGGCTTAC
CTCGGCGCCG GCCACGTCAT CTGGTTGAAG CAGGGGATCG CTGGCGACGA CACCGACGGC
CACGTCGACG ATATCGCCCG GTTCGTGGAC GAGCGGACTG TTCTCTGCGC GCTCGAGGAG
AATGAGGACG ATGAGAACTA CGCTGCCCTG CAGGAGAACT ACGAGTTCCT CCTCTCCTCG
ACCGACCAGG ACGGCAACCC CTTAACGGTC ATCCCCCTCC CGATGCCGGG GAGAGTCGGC
GGCGCGGAGA GGCTGCCGGC AAGTTACGCG AACTTCTACA TAGGAAACAC CGTCGTGCTG
GTGCCGGTTT TTAAGCACCC GAACGACGGG ATTGCCATGG CAAGGATCCA GCAGGCCTTC
CCCGACCGGG AGGTCGTCGG GATCGACTGC ACGGCGATGG TCGCGGGTTT CGGTGCGATT
CACTGCATCA GTCAGCAGCA GCCGTCGTCG GGAGAATCCG GAGCACGACC GGGACAATAA
 
Protein sequence
MEMSTQTIGL IQTAVSEDPG RNLERTLGMA KAAIAKGARI LCLQELYRAP YFPQYEDTDA 
SRYAETIPGP STEAFSALAR EHGVVIVVPV YERTISGEHY NTAVVIDADG RLLPAYRKVH
VPYDPLFYEK IYFLPGDRYR VYDTRYGRIA VLICYDQWFP EAARAVALMG AEFIFYPTAI
GRIAGEEPPE GDWREAWETV QRGHAIANSV HVAAVNRVGD EGDLRFFGSS FVADAFGNVL
ARASETGEEI LIVEVDLAGN EAVREGWGFF RNRRPETYGA LARRLPAGRT PAVSGYRMPA
EWEPHDAVWL SWPHDRETFP DLAAVEGIYV EIIAALRGSE TVNLLVTDEK MHIRVKAMLE
EEGVDTAGIR FHLADYADVW FRDYGPTFLV DRKAGNLAMV NWTFNAWGEK YTELMEDTRI
PLAMNREMEI PIFTPGFVLE GGSVEVNGCG TVITTEACLL NPNRNPHLSR EEIEAYLEAY
LGAGHVIWLK QGIAGDDTDG HVDDIARFVD ERTVLCALEE NEDDENYAAL QENYEFLLSS
TDQDGNPLTV IPLPMPGRVG GAERLPASYA NFYIGNTVVL VPVFKHPNDG IAMARIQQAF
PDREVVGIDC TAMVAGFGAI HCISQQQPSS GESGARPGQ