Gene Mhun_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_0004 
Symbol 
ID3922949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp2056 
End bp5175 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content47% 
IMG OID637895653 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_501504 
Protein GI88601326 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.763661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACCT CCTCACGCAC CTCATCAGAT ATTTTAAAAA TACTCCTTGT TGATGACGAG 
GATGTGGTTC TGCATGCTAC CCGTGAGTAT CTGAAGACAT GTTTCTCTTT TGATGTTGAT
ATTGCAGTAT CAGGATCCTC CGCCCTTGCC AGACTTTCAG AAAACGAGTA CGATGCCATC
ATTGCTGATT ATGAGATGGA ATCTATGAGC GGACTGGATC TCCTCAAGGC TATCAGGGAA
CGCGGGGATG ATACCCCATT TATTATCTTC ACCGGAAAGG GGAGGGAGGA GGTTGTTATT
GAGGCATTTG AACGCGGTGC AGACGGTTAT GTTCAGAAAG GCGGTGAAAT TCGATCCCAG
TTTGCAGAAC TGGCCCAAAA GGTGAGAACC GCAGTATGCA GTACACGATC GGCACGAGCA
GTTCATGAAC AGGATACCAG ACACCGGATA CTCTTTGAAA CGATGGGGCA GGGAGTGATA
TACCAGGACA GGGACGGTAC AATCATCGAA GCAAACCCTG CAGCAGAAAG AATCCTTGGG
ATCCCTCTGG ATCAACTTAC CGGAAAGCGA TGGGATGAGT TATCCCTACA CATGATTCAG
GAAGACAACT CTCCGTTTCC GGTAGAGAAG CAGCCATCAA CACAGGCCCT GCACACGGGA
AAGGTGATTG CCGATACCAT TATGGGTATC TGCCATTCGG ATGATGATGA ATGCCGATGG
ATTCACGTGG ATGCCACACC CCTGTTTCGG AATGGAGATG AGAAACCCTT TCAAGTTTAT
TCGATATTAT CAGACATTAC CGGACTTCGG GCAATAAAAG AACAGGTTAC CCTCCATGCA
GAGCGGATAT CAGCCTTCCT TGAACTGAAC CGGTTGAATG CTCCATCACG TGCAGAGCTT
CTGGATTATG CAATAAATGC ATGTCTGAAG ATTACTCAGA GTCAGTTCTC ATTTATTGGT
CTTGTTAATC CTGATGAGAC CGAAGTCACC ATCCATGCCT GGTCACCTTC GGTTATAGAG
ACTTGCAGGG TACAGGAACA GAAGAAGAAT TTCCAGGTTG CGGATTCAGG AATCTGGGGG
GATGTGCTGA GAACGCGAAA GCCTCTTATC CTGAATTCAT TTCAGGACCT AGACTCCGGT
AAAAAAAGAT TTGTAAAAGG CCATGTTCCT ATCACCAGGT TTTTGGAGGT TCCGGTTTTT
GACGGGGATA GGATCGTTGC TCTTCTGGCG GTAGCAAATA AAACCCGTCC ATATAATGAG
GACGACAGTC AGGCACTCAT TGCCTTTGGA AATGAATTAT GGGGGATTAT ACATCAAAGA
GAGACCAGGG AGGCTTTATT TCTTAAGAAT TATGCAATTG AATCATCGAT GAACGGGATT
GCTATCGCAG ACCTCTCAGG AATACTGACT TATGTAAACC CTGCTTTTTT GTCAATCTGG
AATTATCATC ATCGAGATGA GGTGATCGGA AAGAGTGCGA TTACATTCTG GAAGAACCAT
GACCAGGCTG CAGAAGTTAT CGAGGCAATC AAAACTGCAA AACAATGGCA GGGAGAGATG
GAAGCTGAAC GGACAGATGG TTCTCATGCG ATCCTCTTAG TTTCAGCCCA TTCCATACTG
GATGAATCAG GCACCCCTCA GGCCATTATG GCATCTTTCA TTGATATTAC TGATAAAAAA
ATGGCAGAAC AGGAACTCAT TCAGGCAAAT AATATCATTG AGGGTATGCT TAATGGCATC
CATGATATCG TGGGCCTGCA GCTTCCTAAC CATTCGGTTG TCAGGTATAA TAAAGCAGGG
TATGAGGCTC TTGGTCTATC CGCTTCTGAA GTTGTGGGAA AGAAATGCTA TGAACTCATC
GGCAGGGATA CCCCCTGTGT TCCTTGTGCG ACCAGTGAAG CTATTGCACG AGGACAGTTA
TATACAATTC AGAGATATAG CCCTGAAATT GGAAAGTTCC TTGAATGCAC CAGTAATCCT
ATCCTCAACG ATCAGGGAAA GCCAGAGCTT ATTGTTGAAC TCCTTCATGA TATCACCAGT
CAGAAAAAGG CTGAAGAGGC ACTCAGGGAG AGTGAGGAAC GGTTTCGCCA GCTCTTCAAC
AATGCATCAG ATGCCATTTT CCTCCATGAA CTGCATGATG ATAACTCCCA TGGGCGATTC
ATTGAGGTCA ATGATGCAGC TTGCAGGGCT CTTGGGTATT CACATGAGGA ACTCCTTGAG
AAAGCGGTGT TTGATATTAA TACCCTATCT GACAAAGAGG CTGCTCCGGA TATCACCAGG
CAGCTTATCA GTGAAGGACA TGCCTTATTT GAAGGCACCC ATGTCAGGAA AGACGGCTCT
ATCTTTCCGG TCGAGGTTTC AGCACATCTC TTCGAGCTTC ATGGATCACG GGTGGTCCTT
TCCATCTGCC GGGATATCAC TGAACGGAAA CGGAGCGAAA AGTCAATAAT CGAAGCGAAC
CGGAAACTTC AGCTTCTATC CAGTATCACC AGACACGATA TCTTAAATTC ACTTGGGGGT
CTTCTCCTCT TCCTTGATAG TATTCCACGA TCAGATCTCT CGCCTGTTGT CAGAGAAAAC
CTCCACCGGA TTGAGGCATA CGCACTGACT ATTCAAAAAC AGATCAAATT TACCCATGAT
TATCAGGTTC TGGGACTGCA AAAGGCAATC TGGCAGGATC TTTCCCGCTG TTTTCAGAAA
GCATCCGATC AATTTGATAC AGGGAGGATA ACGGTTGAAG AACATCTGTC AGGGATAGAG
ATCTTTGCTG ATCCCATGCT GGAGAAGGTC ATATACAACC TGATTGATAA CGCCCTTCGG
TATGGTGGCG CCAAGCTCTC GAAGATATTC GGATATTACC GAACGGACGG TGAAGATCTG
ATATGGATTA TTGAGGATGA CGGAGCGGGT GTTGCCCCGG AGATGAAAGA TCAGATCATG
GAGAGAGGCG TCGGGTGTAA TACCGGATTC GGACTCTTTC TTTCAGCAGA GATCCTCTCC
ATCACCGGCA TGACCATCAC CGAAACCGGC ACAGATGGGG AGGGGGCCAG ATTTGAGATC
CGAGTTCCAA AAGGGATGTT CCGTCTTGGA ACAGATCAGA GACCGGAGGA GAGAGGATGA
 
Protein sequence
MHTSSRTSSD ILKILLVDDE DVVLHATREY LKTCFSFDVD IAVSGSSALA RLSENEYDAI 
IADYEMESMS GLDLLKAIRE RGDDTPFIIF TGKGREEVVI EAFERGADGY VQKGGEIRSQ
FAELAQKVRT AVCSTRSARA VHEQDTRHRI LFETMGQGVI YQDRDGTIIE ANPAAERILG
IPLDQLTGKR WDELSLHMIQ EDNSPFPVEK QPSTQALHTG KVIADTIMGI CHSDDDECRW
IHVDATPLFR NGDEKPFQVY SILSDITGLR AIKEQVTLHA ERISAFLELN RLNAPSRAEL
LDYAINACLK ITQSQFSFIG LVNPDETEVT IHAWSPSVIE TCRVQEQKKN FQVADSGIWG
DVLRTRKPLI LNSFQDLDSG KKRFVKGHVP ITRFLEVPVF DGDRIVALLA VANKTRPYNE
DDSQALIAFG NELWGIIHQR ETREALFLKN YAIESSMNGI AIADLSGILT YVNPAFLSIW
NYHHRDEVIG KSAITFWKNH DQAAEVIEAI KTAKQWQGEM EAERTDGSHA ILLVSAHSIL
DESGTPQAIM ASFIDITDKK MAEQELIQAN NIIEGMLNGI HDIVGLQLPN HSVVRYNKAG
YEALGLSASE VVGKKCYELI GRDTPCVPCA TSEAIARGQL YTIQRYSPEI GKFLECTSNP
ILNDQGKPEL IVELLHDITS QKKAEEALRE SEERFRQLFN NASDAIFLHE LHDDNSHGRF
IEVNDAACRA LGYSHEELLE KAVFDINTLS DKEAAPDITR QLISEGHALF EGTHVRKDGS
IFPVEVSAHL FELHGSRVVL SICRDITERK RSEKSIIEAN RKLQLLSSIT RHDILNSLGG
LLLFLDSIPR SDLSPVVREN LHRIEAYALT IQKQIKFTHD YQVLGLQKAI WQDLSRCFQK
ASDQFDTGRI TVEEHLSGIE IFADPMLEKV IYNLIDNALR YGGAKLSKIF GYYRTDGEDL
IWIIEDDGAG VAPEMKDQIM ERGVGCNTGF GLFLSAEILS ITGMTITETG TDGEGARFEI
RVPKGMFRLG TDQRPEERG