Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0021 |
Symbol | |
ID | 7270133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 20571 |
End bp | 21878 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643568680 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002465140 |
Protein GI | 219850708 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.099852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCAC TCATTGAGAT CGGTATCATC CTTCTGCTGA TCCTCTTCAA CGGCCTCTTC TCGATGGCAG AGTTTGCGAT CGTATCAGCC CGCAAGATCA GGCTCTCCCA GCTGGCTGCG GATGGGGATA AGCGGGCTGC AGTGGCCCTG GAACTTGCCG AGGAGCCGAA CCGTCTCCTC TCTGCCGTCC AGATCGGAAT CACTGTCATC AGTATCGTCT CCGGTGCCTA TGGTGGGGCA GCCCTCTCCG GGTACGTGGC GGCACCCCTC AAGTCGATCC CGGAGGTGGC TCAGTACAGC GACCTTCTGG CCCTGGTGTT GGTTGTCGCT GCCATCACCT ACTTGACCCT GGTCTTTGGG GAACTGGTCC CAAAAAGGCT CGCCCTCACG AATCCGGAGC AATTTGCGGC GTCGGTCGCG GTCCCGATGA AGTGGTTCGC CTGGGTGGGA TCTCCGCTCG TTTCACTCCT CTCGTACTCC ACCGATCTCG TGCTGGCAAT GCTCGGGGCA AAGAATTCGT CAGGGTCCCC GGTGACTGAG GAGGAGGTGA AGCTCCTGAT ACGGGAGGGG ACACAGGCGG GGGTCTTCCT GGAGGAGGAG CAGGCGATGG TCAGCCGGAT TCTCCGTCTT TCAGACCGGC GGGTATCCGG GCTGATGACA CCACGACCTG AGATCACGGC GATAGATCTC CGGAGTCCAG ATCTGGAGCA GATCGCCCTG ATGCGGGCCA GCGGCCACTC GTACTTTCCG GTGATCGACG GTGATCTGGA TCGGATCCGG GGGATGGTCT CGGTCCGGGA CCTCTGGGCC CGGATGCTCG ATGGTCAGGA GGCCACAGTC AGGGGCGCCC TGAGCGAACC ACTTTATATC CCTGAGTCAG TGCCAGCGCT GAAGGTGCCG GCCCTCTTCC GGGACGCCGG TCTTCATCTG GGTCTGGTCA CCGACGAATA TGGATCGGTG CAGGGCCTGG TCACCCCGCA CGACATCCTG GAATCGATCG TCGGGGTCCT CCCCTCCCCA GATCAGGAGG CCGAGCCTGA GATCGTTCAG CGGGATGATG GCTCCTGGCT GGTCGACGGG ATGCTCCCGC TCGACCAGTT CCGCGATGTC GTGCCGCTTG AGGACCTGCC GCTCGAGGAG AAGGGGTATT ACCATACGAT CGGCGGACTT GTGATGATGC ATCTTGAACG GAGGCCGCAA ACCGGGGACC GGTTTACCCA TGGGGACTTG CAGTTCGAGG TGGTGGACAT GGATGGCAAC CGGGTCGACA AGGTGCTGGT CACCCAGGTC GATGAGGCAG ATCAGTAG
|
Protein sequence | MAALIEIGII LLLILFNGLF SMAEFAIVSA RKIRLSQLAA DGDKRAAVAL ELAEEPNRLL SAVQIGITVI SIVSGAYGGA ALSGYVAAPL KSIPEVAQYS DLLALVLVVA AITYLTLVFG ELVPKRLALT NPEQFAASVA VPMKWFAWVG SPLVSLLSYS TDLVLAMLGA KNSSGSPVTE EEVKLLIREG TQAGVFLEEE QAMVSRILRL SDRRVSGLMT PRPEITAIDL RSPDLEQIAL MRASGHSYFP VIDGDLDRIR GMVSVRDLWA RMLDGQEATV RGALSEPLYI PESVPALKVP ALFRDAGLHL GLVTDEYGSV QGLVTPHDIL ESIVGVLPSP DQEAEPEIVQ RDDGSWLVDG MLPLDQFRDV VPLEDLPLEE KGYYHTIGGL VMMHLERRPQ TGDRFTHGDL QFEVVDMDGN RVDKVLVTQV DEADQ
|
| |