Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mhun_2441 |
Symbol | |
ID | 3922429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanospirillum hungatei JF-1 |
Kingdom | Archaea |
Replicon accession | NC_007796 |
Strand | + |
Start bp | 2708858 |
End bp | 2711848 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637898051 |
Product | PKD |
Protein accession | YP_503861 |
Protein GI | 88603683 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000010419 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.547756 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCG CATCACTGTT AATGGTTCTG ATGTTCCTCC TGCTTGTGCC CTGCCTTGCA GAAGAGGAGA TCGTTGAATT TGAGCCTGTT GTAAGTTTCT CAGCAGAAGA ATTGATTGCG CAGGCACCAG GTATTGAAGA ACAGATGACC TTTGACTCAG TCTTTGACTG GGACCCACGG GTTGACGAAC AGTATGTTAC CTGGACACAA TACGAAGGAC AGGGAAAGGT CTGGTACTAT GATATGAAGT CCGGAAACCG ACAGATGGTC GCTAATATTA TGAGCGATCA GAATAATCCG GACATCTCCG GAGGTATTAT TGTTTGGGAT GATAACCGGC ACAAGAACTC TGACATCTAC TCATATAACA TACCGATGCA CAAGGAAAGC CCGGTGTATG TGGATACTAC CAATAAGTTC AGACCGGCTG TCTCCCAGGA TATGGTTGTG TTTGAGGACT TTGGCAATGA TGATATCAGG GATATCGGGA TGGTAAAGGT AGGTACCAGT GCCAGGCCGG TATACATCAA TCCAAATGAC AAAGATAAGG CCAACCCTGA CATCGATGGC GACTGGATTG TCTATCAGCA GCTTGATGAC AACAAAAACG ACTGGAACAT TTACCTCTAT AATTATAAGA CCGATAAAAC AGTTCAGGTG ACAAGAGATC TCCGTATCCA GCAGAATCCA CGAATCTCCG GTGATTACGT CGTCTGGGAG GACAACCGGA ACGGAAAATG GGACATCTTC ATGTATAACA TCAAAAAGGA CATGACAACC GCTGTCACCT TTGATGATTA TGATGATGTA GAGCCTGCCG TATCAGGATC CAAGATCGTG TGGACCAGGT ATGATCAGGA GAATAACAGC GATATCTACA TGGTAAACCT CCAGATCCCG AAGACCTATG CGGTTTGTGT CGGTCCTGGA AACCAGATAC GTCCTGATAT CTATGGTGAT AAGGTTGTTT GGCAGGATGA CCGGTTTGGC GGATGGGATA TCTTCATTTA CACCCTTGAA CCGGATACCC CCTTTACTCC ATACCAGTTC TATGGCCCGG TCACCTTCAA TTACCTCCCT GCTCCGGTAG GAACAGAGAT CATCGCAAAG ATTGACGAGG TAACCAAGGA CTCCATCGTC ACGACCCAGG AGGGATATTA TGGTGGAGCA GGAGGTTTTG CAGATCAGCT GACGGTAAAG ATTAACCAGG CTGATATCGG TCGGCAGATC TCATTCTGGA GTGGAGGAAT CCAGGGAGCA CCATCGGTTA CCGTCTCCGG TGACGGGAAG ATGATGGAGC AGCCGCTGAA TTTTATCTAT GCACAGCCGC TTCCGGACCT TCTGTTCTAC GGTTCGGTTA CCATTGATGG TCAGCCTGCT GCGAAGGGAA CAGTCCTGAA AGCAATGATT GACGGTGTTG TCAGAGGCAG TTATGAAATA ACCCAGACCG GTCAGTATGG TGGAGAATAT GAAACCGATC CGGCTCTGCG GGTCCCGATT ACTGTTGATG ACATCGGAAA GCACATCACC TTCATGGAGG GGGAGTATGC TGCAGGTCAG ACTTTCCAGA TAACCAGTGG GGGCAGATTC AGACAGGATC TTACCTTTAC CACGATTCCG CCAATGAGTC CGTATGAATT CTACGGATAT GTCCAGATTG ACGGTAAGTC AGCCCCGGTG GGAACCACCA TACAGGCAAA GATCGATAAT ACCGTCGTGA CCACCTATGT CACCAGGTAT GCAGGTTCGT ACGGGGCTCC GGGGCAGGCT CCTGACGATC CACGACTGAT CGTTCCGGTC ACTGAAGCTG ATGCAGGAAA GACGATCTCA TTCTGGATCG GAACCATCAG GGCAGGGGCA ACCCAAGTCA TCACCCGTGG CGGGGAGCGT ATCATCCGAA AAGACCTGGA CTTTGGCAGC TCACCACAAC CTGGTATTAA TGCAGACTTT ACCGCTTCTC CACGAAGCGG TCCTGCTCCC CTGACTGTTC AGTTTACTGA TCTCTCAACA GGCGGGCCGA CCATGTGGTT CTGGGACTTT GGTGATGGAG TCATTCCGGC AAATGCAACC TGCTCAGGAA CTGACTGTCA CAATATAGCA AACCCGATTC ACACCTATGC ACGGGAAGGC ACCTATACCG TCACTTTGAT AGCATCCAAC CAGTACGGAG CATCTGATAC TGAAGTAAAG ACCGGATATA TCACCGTTGG ACAGATTACA CCGGGCATAC AGGCGGACTT TACCGCATCC CCACGGAGCG GTTCAAAACC CCTGACGGTC CAGTTTACCG ATCTCTCAAC CGGTGGGCCG ACGATGTGGG CATGGAACTT TGGAGACGGA ACGACTGAAG GTCTCCTGGC AAATCCGACC CATACCTACG TAAATGACGG GACATACACC GTCACGCTGA CTGCATCAAA TCAGTTCGGA GCATCAGACA CGGAAGTAAA GTCAGGATAC ATCTGTGTTG GAGGCAGTCC ACAACCGGTT GATTCAATCA CGATTTATCC AGGCTGGAAC TTTATATCTG TTCCAAAGAA ACTGGCACCA GGAAAGGATA CTGCAGCAAT CTTCAGCCAT ATTCAGGTGG ACGGGCACAG TATCTTCCAG TATGATGCAG TGACCGGGCA GTGGATAACC ATGACCTCAT CAAGCCCAAT AAAGCCACTC GATGCAGTCT GGATCTACTC ACGGGTCGCA GACAAGGTCT CACTTACCTA TGACTCTGAT CCGCTGCAGA CACCTCCGAC CAAGGAGTTA CGAAAGGGTT GGAATGCAAT CGGATTTACC GGACTTGAAC CATTGGAAGC AAAGTTCACC TTCCTGTCAG TTCAGGATAA GTGGGTAAAC TGCCTTGGAT TCAACGAAGA GAAACAACAG TATGATCAGA TGATCATCAA GGGCAGGAAT GATGATGCAC GCCTGTATCC ATACAGCGGG TACTGGTTGT TCATGTCTGA TAACGGCACT CTTGCCGCTA TTTCAGCCTG A
|
Protein sequence | MKIASLLMVL MFLLLVPCLA EEEIVEFEPV VSFSAEELIA QAPGIEEQMT FDSVFDWDPR VDEQYVTWTQ YEGQGKVWYY DMKSGNRQMV ANIMSDQNNP DISGGIIVWD DNRHKNSDIY SYNIPMHKES PVYVDTTNKF RPAVSQDMVV FEDFGNDDIR DIGMVKVGTS ARPVYINPND KDKANPDIDG DWIVYQQLDD NKNDWNIYLY NYKTDKTVQV TRDLRIQQNP RISGDYVVWE DNRNGKWDIF MYNIKKDMTT AVTFDDYDDV EPAVSGSKIV WTRYDQENNS DIYMVNLQIP KTYAVCVGPG NQIRPDIYGD KVVWQDDRFG GWDIFIYTLE PDTPFTPYQF YGPVTFNYLP APVGTEIIAK IDEVTKDSIV TTQEGYYGGA GGFADQLTVK INQADIGRQI SFWSGGIQGA PSVTVSGDGK MMEQPLNFIY AQPLPDLLFY GSVTIDGQPA AKGTVLKAMI DGVVRGSYEI TQTGQYGGEY ETDPALRVPI TVDDIGKHIT FMEGEYAAGQ TFQITSGGRF RQDLTFTTIP PMSPYEFYGY VQIDGKSAPV GTTIQAKIDN TVVTTYVTRY AGSYGAPGQA PDDPRLIVPV TEADAGKTIS FWIGTIRAGA TQVITRGGER IIRKDLDFGS SPQPGINADF TASPRSGPAP LTVQFTDLST GGPTMWFWDF GDGVIPANAT CSGTDCHNIA NPIHTYAREG TYTVTLIASN QYGASDTEVK TGYITVGQIT PGIQADFTAS PRSGSKPLTV QFTDLSTGGP TMWAWNFGDG TTEGLLANPT HTYVNDGTYT VTLTASNQFG ASDTEVKSGY ICVGGSPQPV DSITIYPGWN FISVPKKLAP GKDTAAIFSH IQVDGHSIFQ YDAVTGQWIT MTSSSPIKPL DAVWIYSRVA DKVSLTYDSD PLQTPPTKEL RKGWNAIGFT GLEPLEAKFT FLSVQDKWVN CLGFNEEKQQ YDQMIIKGRN DDARLYPYSG YWLFMSDNGT LAAISA
|
| |