Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1474 |
Symbol | |
ID | 7270079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1523625 |
End bp | 1525631 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643570097 |
Product | NHL repeat containing protein |
Protein accession | YP_002466519 |
Protein GI | 219852087 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0914572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTA AGTATTTTTT TTATTTTTTC TGCGCCCTGC TCCTCCTCTG TTGCAGCGCC CAGGCTGTTT CGGTTGAAGG TGGGTACGCA TATGTTACGC AATGGGGCAG TTCTGGTCAA GAAGCCGGGC AGTTCAACCA GCCCTATGGT GTCACAATTG ACAGCATTGG CGATGTCTAC GTCGTCGACA CATACAACAA CTGGATCCAG AAGTTCGATT CGAACGGCAC ATTCCTCAAA AAATGGGGCA GTTTTGGCAC CGGAGACGGG CAGTTCAACA TACCCTATGA TATCGCCGTG GACAGCGTCG GCTACGTCTA CGTCGCCGAC ATGAATAACA ACCGGATCCA GAAATTCAAT TCGACTGGTG GTTACCTGAC CCAATGGGGC ACGAAAGGCT CGGAGGAAGG ACAACTCGAC CAGCCAGGTA GTGTCGCGGT GGACAGCAGA GGACAGATCT ATGTCGCTGA CTGGGGCAAC AACCGGGTTC AGGTATTCAA TTCGACCGGT GGCTACCTCA TGCAGTGGGG GAGTTCCGGC TCGGGAGACG GACAGTTCGA CGGTCCGAAT GGAATTGCCA TAGACAGCAC CGGCAATGTC TATGTCACTG ACGCATACAA CAACCGGATT CAGGAGTTCA ATTCGACCGG TGGCTACCTC ATGCAATGGG GAAGTTCTGG CTCGGAGGCC GGGCAGTTCG AGATTCCCCA GGGTATCGCG ATGGACAGTA ACGACAACGT CTACGTGGCC GACTCTGGCA ACCGGGTCCA GAAGTTCACG TCGGCCGGCA CCTTCATCAC GCAATGGGGT ACGAAAGGCT CGGAAGCCGG GCAGTTCAGC AATCCCTTTG GTATCGCCGT GGACAGCGCC GACAATGTCT ATATCACTGA CGTGTACAAC AACCGGGTCC AGAAGTTCAC GTCGGCCGGC ACCTTCATCA CGCAATGGGG CAGTCAGGGT TTGGAAGTCG GACAGTTCAA CATGCCCTAT GGTGATGCCG TGGACAGTGC AGGCAATGTC TACGTCACCG ACCTGGGGAA CAGCAGGGTC CAGAAGTTTA CCGCGAACGG CACCTTCATC ACAGAATGGG GCAGTTCGGG ATCGGGAGAC GGACAGTTCA ACATGCCCTA TGGTATCGCC GTGGACAGCG CCGACAACGT CTACGTCGCT GATTTGAATA ACAACCGGGT CCAGAAGTTC AATTCGACTG GTAGCTACCT GACACAATGG GGCATGACAG GCTCAGGGAA CGGACAGTTC GACCAGCCAT GCGGTGTCGC GGTGGATCGC TTCGGCATCG TCTATGTCAC TGACTTTGGC AACAACCGGG TCCAGATGTT CACGTCGGCC GGTGGCTACC TCTCCCAATG GGGCAGCCAT GGTCCGGGAG CCGGGCAGTT CAGCGGTCCG AATGGAATTG CACTGGACAG CACCGGCAAT GTTTATATCA CAGACTGGGG CAACAACCGG GTCCAGAAGT TCACGTCGAC TGGTAGTTAC CTCAGGCAAT GGGGCAGTTC CGGCTCGGAA GACGGGATGT TCGGCGACTC AACGAGTGTC GCCGTGGACC GTGACAGCAA CGTCTACGTG TCCGACAGTA GCAACCACCG GATCCAGAAG TTCGATCAAA ACGGCACATT CATCACGAAA TGGGGGAGTT ATGGCTTGGA AGCCGGGCAG TTCAACAGTC CTTTTGGTAT CACGGTGGAT GGTGCCGGCA ACGTCTATGT CACCGACGTG AACAGCAATA GGGTCCTGAA GTTCGCCCCC ACTGGTACGA CCCCGGTGGT GATCGTCCCG GGCGGGTCAG CCGTCCCGCA GGATCTCAAC CATGACGGAC TGTACGAGGA CGTCGATGGC AACGGAGTCC TCGACTTTGG TGACGTGGTC CTCTTCTTCA ACCAGATGGA CTGGATCGCC GAGAATGAAC CGATCAGTGC GTTTGATTTC AACAAAAACG GTCAGATCGA TTTCAATGAT ATTATCACCC TGTTCAACGA GTTGTAG
|
Protein sequence | MKFKYFFYFF CALLLLCCSA QAVSVEGGYA YVTQWGSSGQ EAGQFNQPYG VTIDSIGDVY VVDTYNNWIQ KFDSNGTFLK KWGSFGTGDG QFNIPYDIAV DSVGYVYVAD MNNNRIQKFN STGGYLTQWG TKGSEEGQLD QPGSVAVDSR GQIYVADWGN NRVQVFNSTG GYLMQWGSSG SGDGQFDGPN GIAIDSTGNV YVTDAYNNRI QEFNSTGGYL MQWGSSGSEA GQFEIPQGIA MDSNDNVYVA DSGNRVQKFT SAGTFITQWG TKGSEAGQFS NPFGIAVDSA DNVYITDVYN NRVQKFTSAG TFITQWGSQG LEVGQFNMPY GDAVDSAGNV YVTDLGNSRV QKFTANGTFI TEWGSSGSGD GQFNMPYGIA VDSADNVYVA DLNNNRVQKF NSTGSYLTQW GMTGSGNGQF DQPCGVAVDR FGIVYVTDFG NNRVQMFTSA GGYLSQWGSH GPGAGQFSGP NGIALDSTGN VYITDWGNNR VQKFTSTGSY LRQWGSSGSE DGMFGDSTSV AVDRDSNVYV SDSSNHRIQK FDQNGTFITK WGSYGLEAGQ FNSPFGITVD GAGNVYVTDV NSNRVLKFAP TGTTPVVIVP GGSAVPQDLN HDGLYEDVDG NGVLDFGDVV LFFNQMDWIA ENEPISAFDF NKNGQIDFND IITLFNEL
|
| |