Gene Mhun_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_1449 
Symbol 
ID3924507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp1669412 
End bp1672615 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content49% 
IMG OID637897080 
Producttetratricopeptide TPR_2 
Protein accessionYP_502902 
Protein GI88602724 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATGG CGTTCCTCAA AAAGGATCAG CAGGACCGTG CGTACTATGC ATTCAAAGAA 
GGTCTTCGCA TCGATCCTGA AAACTCAGAA CTCTGGTATC AGACCGGAAT AGTTCTGGCA
AAGCAGGAGC GTCACCGGGA TGCCATGAAG ATGTACCGGA ACGCTCTGAA GTACGATCCG
GATAATCTGC AGGCCCAGCT TCGCCTTGGT ATGTCGATGC ATGAAGTCGG CATGTATAAA
GAGGCCATCC CTGTTTTAAC CCGGCTTGTC GAGCGTGAAT CTGAGAATGA TCAGGGATGG
ATGTACCGGG GTGCCTGCTA TCTTGCACTT GGCAAGTTTC GTGAAGCTCT CACCGACCTT
GACATGGCGA TTGACTGGGG GGCGGAACAG GCGGAAGTCT GGGTTTTAAA AGGCTATTGT
CATGTCCAGC TTGGAGAATA TACCGACGCA CTTGATGCCT ATTACTTTGC CATTGAGATA
AATCCAGATG ATCCGGTTGC ATTTTTCGGC CTTGCAAACA CTCTTGTCCA TCTTGAAGAG
CGGGAACAGG CGATCGATGC CCTGAAAAAA GCCCTAAAGA TCGATCCGGA CTATGCTGAT
GCCTGGCGAA TGATAGCAGA CCTGCTTGCA GATTCAGGTG ATATCCCTCA GGCTACCAGT
GCATATGAGC ACGTGCTGAA GCTTGAACCC TGGGATCTTG ATACCCGGTA CTCGTACTCA
ATCCTGAAAG CCGAGTTATC TGATGATAAG GCTGCCGTCA CCGATATATT AAACCAGATC
ATCAATGAAG GCCAGGAGTC GGTCACCTTT TACAATAATC ATGGCCTTAC TTTGATGCAT
CTGAAGAAGT ATGATAGTGC TCTCCAGGCA TTTAACCGGG CATTGCAACT GGGAAAAGAC
AATCCCTCCG TCTGGCATAA TCATGGTGCT GCCCTCTATA AACTTAAATG GTACAAGGAT
GCCATGAAAT CCTTCCAGCA GTCCTTAAAA CTCAATCCGA AAAATGTCAA CTCCTGGGTT
GGTATCGGAA TGGTCGCCGT TGCCCAGTAT GAATTCAGCC GGGCAGCAGC TGCATACACC
CATGCAGCCC AGCTCAGTCC CCGGAAAGTA AACATCTGGA TCATGCTTGG GGATACCCAG
ATTGAACGCG GACAGTACCA GGAAGCAATT GCTGCATATG AAAAAGCCCT GGAACTTGAT
CCTGAAAACC CAACCGCCTG GAATCAGCGT GGACTTGCTC TCCGGCTGCT TGACAGTCAC
CCGGCGGCTC TTGAATCATT CGAACATGCT GCAGAGACAA AGAATGCAAA GCCGGAGTCA
TGGATTAACC ATGCGATTAC CAGCTTTGAG CTGGGAGAAT ATCATCAGTC GGTTCATTCG
TTTGAGCGTG CATGTAAATT CGGTCCGATC CCCTCTGACA GCTGGCTGAT ATACCTGAAT
GCTCTGGCCT ATGAACATGA AAACCAGAAA CTTATCAAGG CATCAGAGCG GTTTATTGAA
TTGTTCGGAC CTGATGCAGA GGTTCTCTTC CTTCTTGGTG TTGCCCAGTA TGAGTTGAAA
CAATATGAGG CAGCCCTGCA TCATTTCAGA GAGACATTAA AGCTGGACAA GGACCATACT
GATGCCCTTT TCTGGGCCGG TCTGACCCTC CTTGAGCTCT ATCAGTTTAC CGAGGCAATA
ACCGCATTTG AGGGAGTGGA AGATAATCAT CCGGATGATG ATCAGGCCTG GTATTATCAT
GGGAAGGCAC TTGTCGCATT ACATGAACTT CAGAAAGCAG AATTTATTCT GAAACATTCG
CTCCAGTTAA ATGATCAGTC TGCTGATGCC TGGTATCTGC TTGCGGATGT CCAGCATACC
AGGAAGGCGT ATGCAGATGC CCTGCAGTCT GTTGGAAAGG CCCTTGATCT CTCTCCGTCA
AACAATGAGA TATTAAAACT GAAGGCAAAG ATCCAGGTTG CTCTGGGTTC ATTCCGGGGT
GCCTGTCAGA CATATGCTGC CATCAGTGAG CCTGATGCGA GTGATACTGA GGTTCTTACC
GGATACATGC GTGCTTTGTA TCACACCGGT CAGTTCAGGG AAGCATATAG TCGTGTCACC
CGCCTGCTGG TCAAGGATGA GAAGAATCCT GATCTCTGGA GGATGCGGGC AGAGATTGAA
CGTGCCCAAG GGTTGTTTGA TGAGGCTGCG AATGCTCTTA CTGAGGCGTG TAAATATGCC
CCGAATAATA AAAAACTCTT ATCTTTGCAG GCGATTGTCC TCTATGAAGC AGAGAAATAT
CCGGAGGCCA TATCGGTTAT CGACAAGGTC CTCGGGTTTG ATCCGCTGAA TGGTGAGTTA
TGGAAACGGA AAGGGGCGGC CCATGATAGT CTGCAGCAAT ATGATCAGGC TTGTGAATCC
TACCTGAAAG CAGCAGAGTT CCTACAGGAT GATCCAGATC TCATACGAAA ACTGGGGGTA
GCCCTGTATA AGACCGGGAA ATGTGACAAG TCTCTTCCAC GGTTTGACCA GTATCTTGAA
GTGGTTCCGG ATGATCCTGA GATCTGGGAG ATGAAAGGGA AGGCTCTGTT TCATCAGGGG
AAGTATGAAT CTGCAAGTGC TGCCCTCTCC CAGGCGATCC TTTATCGTCC GGATGATATG
GATCTTCTCT TCAGGTATGC ACAAAGTCTG ATTAAATCCG GTGAGTTGCT GACGGCAATA
CCTCCGCTGG ATCAGGTTAT CGAACAGAAC CCGGAGAATG CCGAAGCCTG GAAATTGAAG
GCAGAGATTG AACAGACGCT TGGGAGAGAG GATGAAGCAG CTCAGGCGGT GGAAGAGGCA
CTCCGGCAGA TCCCTGATGA TCCCGGTCTG ATGCTGGCAA GGGTAAAATC CCTCTATGAG
GCTGATTCAT ATGCAGAAGG ACTGTCACTT GTTCGCCGGC TCATTGATAA AACTCCGGAG
AGTACTGAGG CATGGTCGTT GTATGCAGAG CTGCTCTGGA TGACTGCAGA TCATAATGCC
GCTGCGGCAG CATTTGACCG GATTCTGGCT CTTGATGATA CCAATGCCAA AGCCTGGTTT
TTAAAAGGGG ATTCGTTACA AAATGCAGGC CGGTTTGAAG AGGCTGCCGT TGCCCATGAG
CGGGCGTTCT CTTTAGGTGG TGACCCTTCT GTCGGACTGA TGCTCTCAAG GAAGATGCGG
TACCTACAGC AGAAGAAGGC CTGA
 
Protein sequence
MGMAFLKKDQ QDRAYYAFKE GLRIDPENSE LWYQTGIVLA KQERHRDAMK MYRNALKYDP 
DNLQAQLRLG MSMHEVGMYK EAIPVLTRLV ERESENDQGW MYRGACYLAL GKFREALTDL
DMAIDWGAEQ AEVWVLKGYC HVQLGEYTDA LDAYYFAIEI NPDDPVAFFG LANTLVHLEE
REQAIDALKK ALKIDPDYAD AWRMIADLLA DSGDIPQATS AYEHVLKLEP WDLDTRYSYS
ILKAELSDDK AAVTDILNQI INEGQESVTF YNNHGLTLMH LKKYDSALQA FNRALQLGKD
NPSVWHNHGA ALYKLKWYKD AMKSFQQSLK LNPKNVNSWV GIGMVAVAQY EFSRAAAAYT
HAAQLSPRKV NIWIMLGDTQ IERGQYQEAI AAYEKALELD PENPTAWNQR GLALRLLDSH
PAALESFEHA AETKNAKPES WINHAITSFE LGEYHQSVHS FERACKFGPI PSDSWLIYLN
ALAYEHENQK LIKASERFIE LFGPDAEVLF LLGVAQYELK QYEAALHHFR ETLKLDKDHT
DALFWAGLTL LELYQFTEAI TAFEGVEDNH PDDDQAWYYH GKALVALHEL QKAEFILKHS
LQLNDQSADA WYLLADVQHT RKAYADALQS VGKALDLSPS NNEILKLKAK IQVALGSFRG
ACQTYAAISE PDASDTEVLT GYMRALYHTG QFREAYSRVT RLLVKDEKNP DLWRMRAEIE
RAQGLFDEAA NALTEACKYA PNNKKLLSLQ AIVLYEAEKY PEAISVIDKV LGFDPLNGEL
WKRKGAAHDS LQQYDQACES YLKAAEFLQD DPDLIRKLGV ALYKTGKCDK SLPRFDQYLE
VVPDDPEIWE MKGKALFHQG KYESASAALS QAILYRPDDM DLLFRYAQSL IKSGELLTAI
PPLDQVIEQN PENAEAWKLK AEIEQTLGRE DEAAQAVEEA LRQIPDDPGL MLARVKSLYE
ADSYAEGLSL VRRLIDKTPE STEAWSLYAE LLWMTADHNA AAAAFDRILA LDDTNAKAWF
LKGDSLQNAG RFEEAAVAHE RAFSLGGDPS VGLMLSRKMR YLQQKKA