Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2369 |
Symbol | |
ID | 7272090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2514661 |
End bp | 2516400 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643570970 |
Product | NHL repeat containing protein |
Protein accession | YP_002467373 |
Protein GI | 219852941 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.547997 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAATC GGTACATGCT GGAAGAGCTT GAAGTGATGC TCGTCCTGTT GCTGTTCGGG TGCAGCGTTC AGGTCGCAAC GGCGGCGGAT ACGTATCAGT TCGTGGCGCA GTGGGGGACC AGTGGCTCAG GGGACGGGCA ATTCAACTCC CCGTATGGAG TCGCGCTCGA CGGTGTGGGG ACCGTTTATG TCACCGACAA GAACCTCGAC CGAGTCCAGC AGTTCAACGC CACCGGCGGT TTTATTAGAA CATGGGGAAC TCCCGGATCT GGAGACGGAC TGCTCTGGAA CCCCAAGGGA ATCGCAATCA ACAGCGCGGG AAACGTCTAT ATTGTCAACA ACTGGAACGA TAGAGTTCAG CGGTTCACCT CGACCGGCAT CTTCCTCGCA CGGTGGGGGA CCGGCGGCAC CGGGGACGGG CAGTTCAAAT CCCCATCTGG GGTCGCGGTC GACAGCGCAG GAAACGTCTA TGTCGCCGAC ATGTACAACT ACCGGGTTCA GAAATTCTCC TCAGCCGGCA CTCTCCTCGC GAAGTGGGGA ACCGAAGGAG GGGGAGACGG GCAGTTCGAT TACCCGACCG GGATCGCGGT TGACAGCGAG AACAACGTCT ACGTCGTTGA CTCCTATAAT AACCGGGTCC AGAAGTTCAC CTCGAACGGT ACCTTCCTTG CGAAGTGGGG AGCCAGGGGA TCTGGAGACG GAGAGTTTGC GGATTTCCCA GAAGAGATCG CAGTTGACAG CACAGGTAAC GTCTTTGTCA CCGACACCGG AAACAACAGA ATTGAGAAAT TCACCTCGAA CGGCACCTTC CTCGCCAAGT GGGGAGGACG CGGCTCAGGG GACGGGCTGT TCGAATCACC CACCGGGATC GCCGTCGACA GCGCGGGAAG GATCTATATC GCCGATACCG GCAACCACCG GATCCAGATG TTTGCTTATC CAACACCGAC TGAAATTCCA ACCGTCGTTG TGCCGACCAC TATACCAACC GGGATTCCGA CTGTCGTTGA GCCAACCGTC ACGATCACCG CCACGCCTTC GGCAACGACC ACCGTTCCAA CCGAAATTCC CTCGGTGATC GTGCCGACCG TCACAGTCAC CCCAAGCATC GAGATACCGG TCTTCCGATT CACCCCGCCT TCGCTCGCCA TCCCACGTGG GTCAACAAAC ACGACCACCC TCTTCCTCGA TGGTTTAAAC ATTACCTCGG GCTACAACGT GACGCTCGGT CTCACCCTCC TCAATCCGGC AATCAGCGAG ATAGTCGGTG TATCCCTCCC TGGGTGGACG ATGGCGAAGA ATCTGACCCT CCCCTCTGAT ACGGTATGGT TGACCACACT CAATACAGCG GGCCTGGGCG AACCCACTGC TTCAGGCGCC AGGATCGGAA CGATCACTAT CAGGGGTGAC CAGGGTGGAG AGACCGCGCT CTGCGTCATT CAGGCGTATG TCAGCGATGA AAAAGGAAAA CCTGTCGCAC CCCAACTGGC AGTCTGCCCG ATCGAGGTAA CGATCCCCTA TCGTCCGCTC CCTGGCACTT CGTCCAATCC AAATGATCTC AACGGGGACG GCCTCTATGA AGATATCAAC GGGGACGGGG TGCTCGACTT CAATGACGTG ATTCTCTTCT TCAACCAGAT GGACTGGATC GCTGATAATG AGCCTGTCAT CGGCTTCGAC TTCAATGAAA ACGGGCAGAT CGACTTTAAT GACGTGGTGC TTCTCTTCAA CCAACTCTGA
|
Protein sequence | MHNRYMLEEL EVMLVLLLFG CSVQVATAAD TYQFVAQWGT SGSGDGQFNS PYGVALDGVG TVYVTDKNLD RVQQFNATGG FIRTWGTPGS GDGLLWNPKG IAINSAGNVY IVNNWNDRVQ RFTSTGIFLA RWGTGGTGDG QFKSPSGVAV DSAGNVYVAD MYNYRVQKFS SAGTLLAKWG TEGGGDGQFD YPTGIAVDSE NNVYVVDSYN NRVQKFTSNG TFLAKWGARG SGDGEFADFP EEIAVDSTGN VFVTDTGNNR IEKFTSNGTF LAKWGGRGSG DGLFESPTGI AVDSAGRIYI ADTGNHRIQM FAYPTPTEIP TVVVPTTIPT GIPTVVEPTV TITATPSATT TVPTEIPSVI VPTVTVTPSI EIPVFRFTPP SLAIPRGSTN TTTLFLDGLN ITSGYNVTLG LTLLNPAISE IVGVSLPGWT MAKNLTLPSD TVWLTTLNTA GLGEPTASGA RIGTITIRGD QGGETALCVI QAYVSDEKGK PVAPQLAVCP IEVTIPYRPL PGTSSNPNDL NGDGLYEDIN GDGVLDFNDV ILFFNQMDWI ADNEPVIGFD FNENGQIDFN DVVLLFNQL
|
| |