Gene Mhun_2406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_2406 
Symbol 
ID3922393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp2653674 
End bp2654984 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content52% 
IMG OID637898016 
Producttetratricopeptide TPR_2 
Protein accessionYP_503827 
Protein GI88603649 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATTC GAGACTGGAT TGGGGGACAG GACGGGAGCC CGGCAACGCC ATACTGTCAG 
AAAGGAGAGA CACAACTTGT AAAGGAGAAG TATGAAGCGG CGGTTCAGAC ATTTAACCGG
GGCATTGAAC TGGACCGGAG CCATCCCGGA TGCTGGGTCG GGATGGGAAA AGCATTTCTC
GGTCTGGGCA GGTATGACCG TGCGGATGAC TGCTTTATCC GTGCCCTGGA TATCGATCCG
GAAAATCCTG AAGCCCTGAC CATGCGGGCA TCTGTCCTCC GTCTCATCGC CCTTCAGAAC
CAGGATCCCA TGCGGTGCCT GGAAGCAGTC GAGATATGCA ACAAGACCCT GAAGATCCAT
CCTGAGTACG GGCCGGCCCT CCATGAGAAA GGGATGGCGC TCTGGACCCT GGGAAAACGC
GACGAAGCAA TGAGCCTGTT TGAGCAGGCG AAAAAGATAC ATGCATCATA CCCGTACCCC
TGGGATCTGA AAGGGCGGTA CCTGTTCGAA AAGAGGCAGT ACCATGAGGC AATCGAGGCA
TATGAAGAGG CATTAGAGAA AAAACCCCAG GACCCGGATC TCCTTTTTTC CATGGGCCGG
GCTCTCATGA AGATCGGGGG GTATCATTCC GCCATCCAGT TCTTTAAGAA ATGCCTCAAG
ATTCGTCCAG ATTACACCGC TGCATGGCTG CTCCTTGGCA ACTCCTACAA GGTCTTAAAC
CAGTTTGATG AGGCGATAGA TGCCTACGAA GAAGCAATGG AACTGGATCC GGGCAGCACC
AAATATCGCA AGTATATAGC AGATGTCTAC CTCGTCATGG GAAAAGAGGC CCTGTATAAG
GAAGGAAAAC CTCAGGAGGC CATTGAATAC TTTGATAAGA CCATCCGGAT GATCGCAAAC
CACATAACCG CCTGGTTCTC AAAGGGAGTG GCCTATAAAA AACTCGGGGC ATACCGGAAT
GCAACGGCAT GTTTTCTCAA AGTGGTGGAG ATGGATCCCC AGAACGGGCA TGCCTACTAT
GAGATGGCCC AGATCCTTGA GAAGACCGGG AATAATGAAG AAGCGATCCG GTGCTACCTT
GAGACGATCC GGTGTGATCC ATCCCATACC GATGCCATGT ATAAGGTAGG AAACCTGCTC
ATGGAAGGCG GGGATTATAA AAATGCCATA GCCTATTTTG ACCGGGTTCT TGATAAGATT
CCCGAGTCTT CAGTAGCCTG GTTTGCAAAA GGAAAAGCAC TCCAAAGACG GGGACAGCAA
AAAGATGCTG ACCGGTGTTT TGAACGGGCT TCAAAACTCG CCACACGGTA G
 
Protein sequence
MGIRDWIGGQ DGSPATPYCQ KGETQLVKEK YEAAVQTFNR GIELDRSHPG CWVGMGKAFL 
GLGRYDRADD CFIRALDIDP ENPEALTMRA SVLRLIALQN QDPMRCLEAV EICNKTLKIH
PEYGPALHEK GMALWTLGKR DEAMSLFEQA KKIHASYPYP WDLKGRYLFE KRQYHEAIEA
YEEALEKKPQ DPDLLFSMGR ALMKIGGYHS AIQFFKKCLK IRPDYTAAWL LLGNSYKVLN
QFDEAIDAYE EAMELDPGST KYRKYIADVY LVMGKEALYK EGKPQEAIEY FDKTIRMIAN
HITAWFSKGV AYKKLGAYRN ATACFLKVVE MDPQNGHAYY EMAQILEKTG NNEEAIRCYL
ETIRCDPSHT DAMYKVGNLL MEGGDYKNAI AYFDRVLDKI PESSVAWFAK GKALQRRGQQ
KDADRCFERA SKLATR