Gene Mpal_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1421 
Symbol 
ID7270026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1463510 
End bp1466302 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content54% 
IMG OID643570051 
ProductNHL repeat containing protein 
Protein accessionYP_002466473 
Protein GI219852041 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID[TIGR01634] phage tail protein, P2 protein I family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.368909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTT CATATTCTTT GTATATTATC TGTGTCCTGT TCCTCCTCTG TAGCAGCGTC 
CAGGTCGTTT CGGCTGAAGG AGGGTATGCG TACGCCACAC AATGGGGCAG TTCAGGTTCT
GGAGATGAAC AGTTCTCCTC TCCATCTGGT GTTGCAGTGG ATAGCGTCGG GAACGTCTAC
GTCGCTGACG TGGGCAACAA CCGGATCCAG AAGTTCACGT CGACCGGGAC CTTCATCAAA
AAATGGGGCA GTTCAGGTTC TGGAGATGGA CAGTTCTCCT CTCCATCTGG TGTCGCAGTA
GATAGTGCTG GTAATGTCTA CGTGGCTGAC ACGGGAAATA ACCGGATCCA AAAGTTCACG
TCGATGGGGA TATTTATCAA ACAATGGGGC AGTTCGGGTT CTGGAAACGG ACAGTTCTTC
TCTTCTCCAT TTGGTGTCGC AGTAGATAAT GCTGGTAATG TCTACGTGGC TGACACGGGA
AATAACCGGA TCCAGAAGTT CACCTCTGAT GGTGCGTTTG TCACCAATTG GTGGGTGAAC
GAACCAAATG GACCAGATGG TGTTACAGTG GATAGCGCCG GCAATGTCTA TGTGGTTGAC
GTATCCTATA TTGACCGGGT TCAGAAGTTT ACATCATCTG GCACGTTCAT CGCGAAATTT
GGCAGTGATT ACATTCATGA CAGCGCAATG AGTTATCACA CTAGTGTCGC AGTGGACAAC
GCTGGAAATG TATACTTCAG GGGGCCTGTT AGTGGAATCC AAAAGTTCTC GTCGACTGGG
GCACCTATAA CAAAATGGGG TAATTATGGT TCAGGAATGT ACTATGGTCC GGGTGATGTG
GCAGTGGACA GCACCGGAAA TGTATATGTC AGCGACACCC AGAATGCTCA GATTGTAAAG
TTTACCCCTG ATATCCCTCT CATTCCCGGT TTCACCGCGA CACCGACCAC AGGCGCCGCC
CCGCTGACTG TGCAGTTCAC TGACACCACA ACCGGGAGAC CGACCTCCTG GTACTGGAAT
TTCGGGGATG GGTACGCCTC CGGTGCTCAG AACCCGAGCC ATCTGTATAG TACTGCCGGC
ACTTACTCGG TCACGCTGAC TGCCACCAAT GCTGTTTCGG GGAGCAAATC GATCACAAAG
ACAGGGTACA TCACCGTCAC AGAAGCCCCA GCCATTACAC CGGTCGCAGA CTTCACGGCA
ACCCCATCCA ACGGTGCTGC ACCATTGGCA ATCCAGTTCA CTGACCGGTC GACAAATGCA
AAACAGTGGT CCTGGACCTT CGGCGACGGC ACAACCTCGA CCGAGCAGCA CCCCTCACAT
ACCTACACCA CTGCCGGTAC ATACACCGTC GTGCTCACCG TCAGAAATGC AGCCGGCCAA
TCGAACACCA AGACCCAGAC GAACCTGATC TCAGTTACAT CCCCAGTCAG TGAAACACCG
GCTGCAGACT TCACGGCCAC CCCGACATCG GGAACCGGCC CGCTCACCGT CAGGTTTACA
GATACCTCAA CCGGTGTCCC GACAGGCTGG TACTGGTTCT TTGGGGATGG TTACTGCGCA
TTCGAGAAGA ACCCCTCACA CATCTTCGCA CAGGCCGGCA CCTATACGGT ACAGCTCTAT
ACGTTCAATG CAAACGGTAA CTCGCTGAAG ACAAAGACGG ACTATATTAC GGTCAGCGCG
GTCGGTGGGC TTAATGCAAG TTTTGCTGCC ACGCCGACCT CGGGGACAGC CCCTCTTAAC
GTTCAGTTCA CAGACACGTC GACTGGAGGA GCGACCTTCT GGTCCTGGAA TTTTGGTGAC
GGGGCAGCCT CCACTGATCA GAGTCCGAGC CACACCTACT CTCTGGCTGG CACGTACACG
ACCTCATTGA CAGTCCGGAA CAGTTCTGGT CTGACGAGCA TCAAGGAAGG GACGATCACC
GTCACCGCCC CACAGCAGAC CCTCCAGGCA GCGTTCACGA TTAATACGCA GACTGTGATA
GCCGGGCAGA CAACAGTAAC CGGCATCGAT ACATCCACCG GGTCCCCAGC CACCTGGTAC
TGGGACTTTG GTGATGGGTA TGCGTCATCG GCCCGGAACA TCAACCACGT CTACACCACC
GCCGGCTCGT ACACCCTGAG TCTGACCGTC ACGAGCGGCT CACAGACCAG CACGACCAGC
AAAACGATCA CCGTAACCGG TGAATCTGTA ATAACACCTC TGGCCAATTT CACGGTCACA
CCCCAGGGGG GAGTCGGCTC GATGGGTATC CTGGTCCTCG ACACCTCGGT GAACGTGACC
TCGGTGTTGT ATGACCTCGG CGACGGCACG ACCACCACCT ATTCGAACTT CCGGTACACC
TACTGGCAAC CTGGCACGTA TACGATCAAA CAGACCGCGA CCAGTGCAAC CGGGTCCTCG
ATAAAGACGA TTACCGTGAC CGTACCAGCC ACGAATTCAC CGGTCTCACC GACGGTAACC
ATGACTATGA CCCCGACCGT CTCACCAACA GGGACCGTGA GTGTGACCCT GACTCCCACT
CCAACCGATC AACCCCAAAC CAGGGCAGCA AGCTTCAACG TTAGTCCGAC CTCCGGCAAG
AGATCATTCA CTACTGCACT CATAGACACC ACTACCGGCG GTAACCCAGT CTCCTGGAAG
TGGACCTGTG GAAACGGGCA GTCCTTCTCT GGGAAGAGTG TTGGTTTAAA CAGAATCTGG
TACAACAATG CAGGTACCTA CACCATCATC CTGACCGTGA CAGATCAGGA CGGTTCAACA
AGGACCGCCA CGCATACTGT CACTGTCCTG TGA
 
Protein sequence
MKSSYSLYII CVLFLLCSSV QVVSAEGGYA YATQWGSSGS GDEQFSSPSG VAVDSVGNVY 
VADVGNNRIQ KFTSTGTFIK KWGSSGSGDG QFSSPSGVAV DSAGNVYVAD TGNNRIQKFT
SMGIFIKQWG SSGSGNGQFF SSPFGVAVDN AGNVYVADTG NNRIQKFTSD GAFVTNWWVN
EPNGPDGVTV DSAGNVYVVD VSYIDRVQKF TSSGTFIAKF GSDYIHDSAM SYHTSVAVDN
AGNVYFRGPV SGIQKFSSTG APITKWGNYG SGMYYGPGDV AVDSTGNVYV SDTQNAQIVK
FTPDIPLIPG FTATPTTGAA PLTVQFTDTT TGRPTSWYWN FGDGYASGAQ NPSHLYSTAG
TYSVTLTATN AVSGSKSITK TGYITVTEAP AITPVADFTA TPSNGAAPLA IQFTDRSTNA
KQWSWTFGDG TTSTEQHPSH TYTTAGTYTV VLTVRNAAGQ SNTKTQTNLI SVTSPVSETP
AADFTATPTS GTGPLTVRFT DTSTGVPTGW YWFFGDGYCA FEKNPSHIFA QAGTYTVQLY
TFNANGNSLK TKTDYITVSA VGGLNASFAA TPTSGTAPLN VQFTDTSTGG ATFWSWNFGD
GAASTDQSPS HTYSLAGTYT TSLTVRNSSG LTSIKEGTIT VTAPQQTLQA AFTINTQTVI
AGQTTVTGID TSTGSPATWY WDFGDGYASS ARNINHVYTT AGSYTLSLTV TSGSQTSTTS
KTITVTGESV ITPLANFTVT PQGGVGSMGI LVLDTSVNVT SVLYDLGDGT TTTYSNFRYT
YWQPGTYTIK QTATSATGSS IKTITVTVPA TNSPVSPTVT MTMTPTVSPT GTVSVTLTPT
PTDQPQTRAA SFNVSPTSGK RSFTTALIDT TTGGNPVSWK WTCGNGQSFS GKSVGLNRIW
YNNAGTYTII LTVTDQDGST RTATHTVTVL