Gene Mhun_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_0034 
Symbol 
ID3923490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp51100 
End bp52995 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content45% 
IMG OID637895685 
Productpeptidyl-arginine deiminase 
Protein accessionYP_501531 
Protein GI88601353 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase
[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.137362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATGCA TAATTGGCCT GATACAGATC TCTGTAAGTC CTCACAAATC CTGGAATATT 
CAGCATGCCA TGGAAAACAT CAGAGAGGCT GCTGAGAGTG GTGCACAGAT AATCTGCCTT
CCAGAATTAT TCTCAACACC ATATTTTCCA CAACATATCG GACTTGATAG TTCCCCATTT
ACCGACACAT GTGATGGCGC TACGATTTAC CGGTTCAGTA AACTGGCATT GGAACTGGGA
TGCGTTCTGA TTGTTCCAAT TTGTGAAAAA AGCTCAGATA ATCGAATATA TAATAGTGCA
GTGGTGATTG ATGCTGATGG GTCGGTTTTT CGGCCATATC GTAAGATTCA CATCCCTCAG
GATCCGCTCT TTTATGAGAA GGGATATTTT AACCCTGGCG ATGAATACAG AGTTTATAAG
ACAAAATATG CAAATCTAGC AGTTTTAATC TGTTTTGATC AATGGTTTCC CGAAGCAGCT
CGGGAGGTAG CACTGAATGG TGCTGATATC ATTTTTTATC CAACTGCTAT CGGTCATATC
AGGGGCGAGA TCCCCGCAGA AGGTGACTGG AAGGAGTCAT GGAAGGTAAT TCAGCGGTCA
CATGCTATCG CGAACAGTAT TCCGGTCGCT GCGGTAAACC GGTGTGGCTG GGAGGATGAA
CTCTTCTTTT TTGGAGGCTC CTTTATCTGC GACGCATTTG GAAAGATCCT CGTACAAGGA
GATATAGACG AGGAAATTAT TCTTGCAGAA GTTGATCTCT CGCTTGGCCC CTCAATCCGT
GAAGCATGGG GCTTTTTTAG AAACAGAAGG CCGGACACCT ATCACTCACT CACTGCTTTG
GAAAAACCCG GATCAAGCAC AGAATCTCTG ACACCGGCAA ACCAGGGATA TCACATGCCG
GCTGAATGGG AGCATCATGA TGCTGTCTGG ATGGCATGGC CATATAATGA TCTGACATTT
CCCCACCTTG AGGCAGTAGA AGAGACCTAC CTGACCATTC TCTCTTCACT CCGATCAGAA
CGGGTTGAAC TTCTCATCGC TGATCCCACC TATCAGGAGA AAATCCTACA CATGCTCTCA
TTCAGGGGTG TGGACTGTTC ACATATCAGG TTCCATATTG TTCATTACTC TGATGTCTGG
ATACGGGATT TCGGTCCGAC ATGTGTAGTA AACCGGGCCC TTCATGATGT ATCAGCAGTA
TTCTGGGATT TTAATGCATG GGGGAACAAG TATGATGAGC TCATTCTGGA TGGGGTAAAA
ACCCATGAGT TATTTCAGAC CCTTGGCATG AAGATATTCA GGCCTGGTAT CGTGCTTGAA
GGGGGATCTA TTGATAGCAA TGGACGGGGA TGTGTTCTTA CCACACGGCA ATGTCTCCTG
AATCCAAATC GTAATCCCCA TCTCACCCAG GACGATATAG AGCATTATCT GTGTGAATAT
CTGTGTGCAA GAAAGATCAT CTGGTTACAT GAGGGTATTG CAGGAGATGA TACCGACGGA
CATATTGATG ACATTGCACG GTTTGTGAAT CCAACCACGG TAGTATGTGC CATTGAGGAG
AATCAGGAGG ATGAGAACTA TCTTCCTTTA CAGGAAAATT TCAGGATTTT GTCTAAAGAA
ACCGATCAGA ATAATAATCC TTTAACGGTC ATCCCAATAC CGATGCCTCA TCCCGTGAAG
GATGAATCAA ACCGGTATCC GGCAAGTTAT CTCAATTTTT ATATTGGAAA TGAAGTAGTT
CTGGTCCCGG TTTTTGATGA TGAGCATGAT AGTCGTGCTC TTGAGATACT TCAGCCTCTC
TTCCCGGATC GTGAAGTCAT AGGAATACCG GCACGAGCCA TGGTTGAAGG ATTTGGAACG
ATTCATTGTG CGACGCAGCA GCAGCCTTTT GCATGA
 
Protein sequence
MSCIIGLIQI SVSPHKSWNI QHAMENIREA AESGAQIICL PELFSTPYFP QHIGLDSSPF 
TDTCDGATIY RFSKLALELG CVLIVPICEK SSDNRIYNSA VVIDADGSVF RPYRKIHIPQ
DPLFYEKGYF NPGDEYRVYK TKYANLAVLI CFDQWFPEAA REVALNGADI IFYPTAIGHI
RGEIPAEGDW KESWKVIQRS HAIANSIPVA AVNRCGWEDE LFFFGGSFIC DAFGKILVQG
DIDEEIILAE VDLSLGPSIR EAWGFFRNRR PDTYHSLTAL EKPGSSTESL TPANQGYHMP
AEWEHHDAVW MAWPYNDLTF PHLEAVEETY LTILSSLRSE RVELLIADPT YQEKILHMLS
FRGVDCSHIR FHIVHYSDVW IRDFGPTCVV NRALHDVSAV FWDFNAWGNK YDELILDGVK
THELFQTLGM KIFRPGIVLE GGSIDSNGRG CVLTTRQCLL NPNRNPHLTQ DDIEHYLCEY
LCARKIIWLH EGIAGDDTDG HIDDIARFVN PTTVVCAIEE NQEDENYLPL QENFRILSKE
TDQNNNPLTV IPIPMPHPVK DESNRYPASY LNFYIGNEVV LVPVFDDEHD SRALEILQPL
FPDREVIGIP ARAMVEGFGT IHCATQQQPF A