Gene Mhun_2441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_2441 
Symbol 
ID3922429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp2708858 
End bp2711848 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content49% 
IMG OID637898051 
ProductPKD 
Protein accessionYP_503861 
Protein GI88603683 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000010419 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.547756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG CATCACTGTT AATGGTTCTG ATGTTCCTCC TGCTTGTGCC CTGCCTTGCA 
GAAGAGGAGA TCGTTGAATT TGAGCCTGTT GTAAGTTTCT CAGCAGAAGA ATTGATTGCG
CAGGCACCAG GTATTGAAGA ACAGATGACC TTTGACTCAG TCTTTGACTG GGACCCACGG
GTTGACGAAC AGTATGTTAC CTGGACACAA TACGAAGGAC AGGGAAAGGT CTGGTACTAT
GATATGAAGT CCGGAAACCG ACAGATGGTC GCTAATATTA TGAGCGATCA GAATAATCCG
GACATCTCCG GAGGTATTAT TGTTTGGGAT GATAACCGGC ACAAGAACTC TGACATCTAC
TCATATAACA TACCGATGCA CAAGGAAAGC CCGGTGTATG TGGATACTAC CAATAAGTTC
AGACCGGCTG TCTCCCAGGA TATGGTTGTG TTTGAGGACT TTGGCAATGA TGATATCAGG
GATATCGGGA TGGTAAAGGT AGGTACCAGT GCCAGGCCGG TATACATCAA TCCAAATGAC
AAAGATAAGG CCAACCCTGA CATCGATGGC GACTGGATTG TCTATCAGCA GCTTGATGAC
AACAAAAACG ACTGGAACAT TTACCTCTAT AATTATAAGA CCGATAAAAC AGTTCAGGTG
ACAAGAGATC TCCGTATCCA GCAGAATCCA CGAATCTCCG GTGATTACGT CGTCTGGGAG
GACAACCGGA ACGGAAAATG GGACATCTTC ATGTATAACA TCAAAAAGGA CATGACAACC
GCTGTCACCT TTGATGATTA TGATGATGTA GAGCCTGCCG TATCAGGATC CAAGATCGTG
TGGACCAGGT ATGATCAGGA GAATAACAGC GATATCTACA TGGTAAACCT CCAGATCCCG
AAGACCTATG CGGTTTGTGT CGGTCCTGGA AACCAGATAC GTCCTGATAT CTATGGTGAT
AAGGTTGTTT GGCAGGATGA CCGGTTTGGC GGATGGGATA TCTTCATTTA CACCCTTGAA
CCGGATACCC CCTTTACTCC ATACCAGTTC TATGGCCCGG TCACCTTCAA TTACCTCCCT
GCTCCGGTAG GAACAGAGAT CATCGCAAAG ATTGACGAGG TAACCAAGGA CTCCATCGTC
ACGACCCAGG AGGGATATTA TGGTGGAGCA GGAGGTTTTG CAGATCAGCT GACGGTAAAG
ATTAACCAGG CTGATATCGG TCGGCAGATC TCATTCTGGA GTGGAGGAAT CCAGGGAGCA
CCATCGGTTA CCGTCTCCGG TGACGGGAAG ATGATGGAGC AGCCGCTGAA TTTTATCTAT
GCACAGCCGC TTCCGGACCT TCTGTTCTAC GGTTCGGTTA CCATTGATGG TCAGCCTGCT
GCGAAGGGAA CAGTCCTGAA AGCAATGATT GACGGTGTTG TCAGAGGCAG TTATGAAATA
ACCCAGACCG GTCAGTATGG TGGAGAATAT GAAACCGATC CGGCTCTGCG GGTCCCGATT
ACTGTTGATG ACATCGGAAA GCACATCACC TTCATGGAGG GGGAGTATGC TGCAGGTCAG
ACTTTCCAGA TAACCAGTGG GGGCAGATTC AGACAGGATC TTACCTTTAC CACGATTCCG
CCAATGAGTC CGTATGAATT CTACGGATAT GTCCAGATTG ACGGTAAGTC AGCCCCGGTG
GGAACCACCA TACAGGCAAA GATCGATAAT ACCGTCGTGA CCACCTATGT CACCAGGTAT
GCAGGTTCGT ACGGGGCTCC GGGGCAGGCT CCTGACGATC CACGACTGAT CGTTCCGGTC
ACTGAAGCTG ATGCAGGAAA GACGATCTCA TTCTGGATCG GAACCATCAG GGCAGGGGCA
ACCCAAGTCA TCACCCGTGG CGGGGAGCGT ATCATCCGAA AAGACCTGGA CTTTGGCAGC
TCACCACAAC CTGGTATTAA TGCAGACTTT ACCGCTTCTC CACGAAGCGG TCCTGCTCCC
CTGACTGTTC AGTTTACTGA TCTCTCAACA GGCGGGCCGA CCATGTGGTT CTGGGACTTT
GGTGATGGAG TCATTCCGGC AAATGCAACC TGCTCAGGAA CTGACTGTCA CAATATAGCA
AACCCGATTC ACACCTATGC ACGGGAAGGC ACCTATACCG TCACTTTGAT AGCATCCAAC
CAGTACGGAG CATCTGATAC TGAAGTAAAG ACCGGATATA TCACCGTTGG ACAGATTACA
CCGGGCATAC AGGCGGACTT TACCGCATCC CCACGGAGCG GTTCAAAACC CCTGACGGTC
CAGTTTACCG ATCTCTCAAC CGGTGGGCCG ACGATGTGGG CATGGAACTT TGGAGACGGA
ACGACTGAAG GTCTCCTGGC AAATCCGACC CATACCTACG TAAATGACGG GACATACACC
GTCACGCTGA CTGCATCAAA TCAGTTCGGA GCATCAGACA CGGAAGTAAA GTCAGGATAC
ATCTGTGTTG GAGGCAGTCC ACAACCGGTT GATTCAATCA CGATTTATCC AGGCTGGAAC
TTTATATCTG TTCCAAAGAA ACTGGCACCA GGAAAGGATA CTGCAGCAAT CTTCAGCCAT
ATTCAGGTGG ACGGGCACAG TATCTTCCAG TATGATGCAG TGACCGGGCA GTGGATAACC
ATGACCTCAT CAAGCCCAAT AAAGCCACTC GATGCAGTCT GGATCTACTC ACGGGTCGCA
GACAAGGTCT CACTTACCTA TGACTCTGAT CCGCTGCAGA CACCTCCGAC CAAGGAGTTA
CGAAAGGGTT GGAATGCAAT CGGATTTACC GGACTTGAAC CATTGGAAGC AAAGTTCACC
TTCCTGTCAG TTCAGGATAA GTGGGTAAAC TGCCTTGGAT TCAACGAAGA GAAACAACAG
TATGATCAGA TGATCATCAA GGGCAGGAAT GATGATGCAC GCCTGTATCC ATACAGCGGG
TACTGGTTGT TCATGTCTGA TAACGGCACT CTTGCCGCTA TTTCAGCCTG A
 
Protein sequence
MKIASLLMVL MFLLLVPCLA EEEIVEFEPV VSFSAEELIA QAPGIEEQMT FDSVFDWDPR 
VDEQYVTWTQ YEGQGKVWYY DMKSGNRQMV ANIMSDQNNP DISGGIIVWD DNRHKNSDIY
SYNIPMHKES PVYVDTTNKF RPAVSQDMVV FEDFGNDDIR DIGMVKVGTS ARPVYINPND
KDKANPDIDG DWIVYQQLDD NKNDWNIYLY NYKTDKTVQV TRDLRIQQNP RISGDYVVWE
DNRNGKWDIF MYNIKKDMTT AVTFDDYDDV EPAVSGSKIV WTRYDQENNS DIYMVNLQIP
KTYAVCVGPG NQIRPDIYGD KVVWQDDRFG GWDIFIYTLE PDTPFTPYQF YGPVTFNYLP
APVGTEIIAK IDEVTKDSIV TTQEGYYGGA GGFADQLTVK INQADIGRQI SFWSGGIQGA
PSVTVSGDGK MMEQPLNFIY AQPLPDLLFY GSVTIDGQPA AKGTVLKAMI DGVVRGSYEI
TQTGQYGGEY ETDPALRVPI TVDDIGKHIT FMEGEYAAGQ TFQITSGGRF RQDLTFTTIP
PMSPYEFYGY VQIDGKSAPV GTTIQAKIDN TVVTTYVTRY AGSYGAPGQA PDDPRLIVPV
TEADAGKTIS FWIGTIRAGA TQVITRGGER IIRKDLDFGS SPQPGINADF TASPRSGPAP
LTVQFTDLST GGPTMWFWDF GDGVIPANAT CSGTDCHNIA NPIHTYAREG TYTVTLIASN
QYGASDTEVK TGYITVGQIT PGIQADFTAS PRSGSKPLTV QFTDLSTGGP TMWAWNFGDG
TTEGLLANPT HTYVNDGTYT VTLTASNQFG ASDTEVKSGY ICVGGSPQPV DSITIYPGWN
FISVPKKLAP GKDTAAIFSH IQVDGHSIFQ YDAVTGQWIT MTSSSPIKPL DAVWIYSRVA
DKVSLTYDSD PLQTPPTKEL RKGWNAIGFT GLEPLEAKFT FLSVQDKWVN CLGFNEEKQQ
YDQMIIKGRN DDARLYPYSG YWLFMSDNGT LAAISA