Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1421 |
Symbol | |
ID | 7270026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1463510 |
End bp | 1466302 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643570051 |
Product | NHL repeat containing protein |
Protein accession | YP_002466473 |
Protein GI | 219852041 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | [TIGR01634] phage tail protein, P2 protein I family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.368909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCTT CATATTCTTT GTATATTATC TGTGTCCTGT TCCTCCTCTG TAGCAGCGTC CAGGTCGTTT CGGCTGAAGG AGGGTATGCG TACGCCACAC AATGGGGCAG TTCAGGTTCT GGAGATGAAC AGTTCTCCTC TCCATCTGGT GTTGCAGTGG ATAGCGTCGG GAACGTCTAC GTCGCTGACG TGGGCAACAA CCGGATCCAG AAGTTCACGT CGACCGGGAC CTTCATCAAA AAATGGGGCA GTTCAGGTTC TGGAGATGGA CAGTTCTCCT CTCCATCTGG TGTCGCAGTA GATAGTGCTG GTAATGTCTA CGTGGCTGAC ACGGGAAATA ACCGGATCCA AAAGTTCACG TCGATGGGGA TATTTATCAA ACAATGGGGC AGTTCGGGTT CTGGAAACGG ACAGTTCTTC TCTTCTCCAT TTGGTGTCGC AGTAGATAAT GCTGGTAATG TCTACGTGGC TGACACGGGA AATAACCGGA TCCAGAAGTT CACCTCTGAT GGTGCGTTTG TCACCAATTG GTGGGTGAAC GAACCAAATG GACCAGATGG TGTTACAGTG GATAGCGCCG GCAATGTCTA TGTGGTTGAC GTATCCTATA TTGACCGGGT TCAGAAGTTT ACATCATCTG GCACGTTCAT CGCGAAATTT GGCAGTGATT ACATTCATGA CAGCGCAATG AGTTATCACA CTAGTGTCGC AGTGGACAAC GCTGGAAATG TATACTTCAG GGGGCCTGTT AGTGGAATCC AAAAGTTCTC GTCGACTGGG GCACCTATAA CAAAATGGGG TAATTATGGT TCAGGAATGT ACTATGGTCC GGGTGATGTG GCAGTGGACA GCACCGGAAA TGTATATGTC AGCGACACCC AGAATGCTCA GATTGTAAAG TTTACCCCTG ATATCCCTCT CATTCCCGGT TTCACCGCGA CACCGACCAC AGGCGCCGCC CCGCTGACTG TGCAGTTCAC TGACACCACA ACCGGGAGAC CGACCTCCTG GTACTGGAAT TTCGGGGATG GGTACGCCTC CGGTGCTCAG AACCCGAGCC ATCTGTATAG TACTGCCGGC ACTTACTCGG TCACGCTGAC TGCCACCAAT GCTGTTTCGG GGAGCAAATC GATCACAAAG ACAGGGTACA TCACCGTCAC AGAAGCCCCA GCCATTACAC CGGTCGCAGA CTTCACGGCA ACCCCATCCA ACGGTGCTGC ACCATTGGCA ATCCAGTTCA CTGACCGGTC GACAAATGCA AAACAGTGGT CCTGGACCTT CGGCGACGGC ACAACCTCGA CCGAGCAGCA CCCCTCACAT ACCTACACCA CTGCCGGTAC ATACACCGTC GTGCTCACCG TCAGAAATGC AGCCGGCCAA TCGAACACCA AGACCCAGAC GAACCTGATC TCAGTTACAT CCCCAGTCAG TGAAACACCG GCTGCAGACT TCACGGCCAC CCCGACATCG GGAACCGGCC CGCTCACCGT CAGGTTTACA GATACCTCAA CCGGTGTCCC GACAGGCTGG TACTGGTTCT TTGGGGATGG TTACTGCGCA TTCGAGAAGA ACCCCTCACA CATCTTCGCA CAGGCCGGCA CCTATACGGT ACAGCTCTAT ACGTTCAATG CAAACGGTAA CTCGCTGAAG ACAAAGACGG ACTATATTAC GGTCAGCGCG GTCGGTGGGC TTAATGCAAG TTTTGCTGCC ACGCCGACCT CGGGGACAGC CCCTCTTAAC GTTCAGTTCA CAGACACGTC GACTGGAGGA GCGACCTTCT GGTCCTGGAA TTTTGGTGAC GGGGCAGCCT CCACTGATCA GAGTCCGAGC CACACCTACT CTCTGGCTGG CACGTACACG ACCTCATTGA CAGTCCGGAA CAGTTCTGGT CTGACGAGCA TCAAGGAAGG GACGATCACC GTCACCGCCC CACAGCAGAC CCTCCAGGCA GCGTTCACGA TTAATACGCA GACTGTGATA GCCGGGCAGA CAACAGTAAC CGGCATCGAT ACATCCACCG GGTCCCCAGC CACCTGGTAC TGGGACTTTG GTGATGGGTA TGCGTCATCG GCCCGGAACA TCAACCACGT CTACACCACC GCCGGCTCGT ACACCCTGAG TCTGACCGTC ACGAGCGGCT CACAGACCAG CACGACCAGC AAAACGATCA CCGTAACCGG TGAATCTGTA ATAACACCTC TGGCCAATTT CACGGTCACA CCCCAGGGGG GAGTCGGCTC GATGGGTATC CTGGTCCTCG ACACCTCGGT GAACGTGACC TCGGTGTTGT ATGACCTCGG CGACGGCACG ACCACCACCT ATTCGAACTT CCGGTACACC TACTGGCAAC CTGGCACGTA TACGATCAAA CAGACCGCGA CCAGTGCAAC CGGGTCCTCG ATAAAGACGA TTACCGTGAC CGTACCAGCC ACGAATTCAC CGGTCTCACC GACGGTAACC ATGACTATGA CCCCGACCGT CTCACCAACA GGGACCGTGA GTGTGACCCT GACTCCCACT CCAACCGATC AACCCCAAAC CAGGGCAGCA AGCTTCAACG TTAGTCCGAC CTCCGGCAAG AGATCATTCA CTACTGCACT CATAGACACC ACTACCGGCG GTAACCCAGT CTCCTGGAAG TGGACCTGTG GAAACGGGCA GTCCTTCTCT GGGAAGAGTG TTGGTTTAAA CAGAATCTGG TACAACAATG CAGGTACCTA CACCATCATC CTGACCGTGA CAGATCAGGA CGGTTCAACA AGGACCGCCA CGCATACTGT CACTGTCCTG TGA
|
Protein sequence | MKSSYSLYII CVLFLLCSSV QVVSAEGGYA YATQWGSSGS GDEQFSSPSG VAVDSVGNVY VADVGNNRIQ KFTSTGTFIK KWGSSGSGDG QFSSPSGVAV DSAGNVYVAD TGNNRIQKFT SMGIFIKQWG SSGSGNGQFF SSPFGVAVDN AGNVYVADTG NNRIQKFTSD GAFVTNWWVN EPNGPDGVTV DSAGNVYVVD VSYIDRVQKF TSSGTFIAKF GSDYIHDSAM SYHTSVAVDN AGNVYFRGPV SGIQKFSSTG APITKWGNYG SGMYYGPGDV AVDSTGNVYV SDTQNAQIVK FTPDIPLIPG FTATPTTGAA PLTVQFTDTT TGRPTSWYWN FGDGYASGAQ NPSHLYSTAG TYSVTLTATN AVSGSKSITK TGYITVTEAP AITPVADFTA TPSNGAAPLA IQFTDRSTNA KQWSWTFGDG TTSTEQHPSH TYTTAGTYTV VLTVRNAAGQ SNTKTQTNLI SVTSPVSETP AADFTATPTS GTGPLTVRFT DTSTGVPTGW YWFFGDGYCA FEKNPSHIFA QAGTYTVQLY TFNANGNSLK TKTDYITVSA VGGLNASFAA TPTSGTAPLN VQFTDTSTGG ATFWSWNFGD GAASTDQSPS HTYSLAGTYT TSLTVRNSSG LTSIKEGTIT VTAPQQTLQA AFTINTQTVI AGQTTVTGID TSTGSPATWY WDFGDGYASS ARNINHVYTT AGSYTLSLTV TSGSQTSTTS KTITVTGESV ITPLANFTVT PQGGVGSMGI LVLDTSVNVT SVLYDLGDGT TTTYSNFRYT YWQPGTYTIK QTATSATGSS IKTITVTVPA TNSPVSPTVT MTMTPTVSPT GTVSVTLTPT PTDQPQTRAA SFNVSPTSGK RSFTTALIDT TTGGNPVSWK WTCGNGQSFS GKSVGLNRIW YNNAGTYTII LTVTDQDGST RTATHTVTVL
|
| |