Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0531 |
Symbol | |
ID | 7271947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 532541 |
End bp | 533806 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643569178 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002465627 |
Protein GI | 219851195 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0811523 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCAGT TAGACCTTTT TAACGCTTCG ATTTTTGGAC TCTGTATCAT TCTTTCGGCC TTTTTCTCCA GTTCTGAAGT GGCGCTGATC TCGATCACAC GAGCCAAGGT CCGGGCATTG GTGAATGATG GGCGGTCAGG CGCAGCCCAA CTCTTTAAAC TCAAGCAGAA TCAGGATCAT ATCCTGATCA TCCTCCGGAT CGGAAACACC ATCGCGATCG TGGCCGCGGC CGCGGTGGCC ACCTCCATCG CGATCGAGGC ATTTGGGGAT CCCGGTCTGG GGATCGCGAT CGGAGGTACA GTGCTGATTC TGCTGATCTT CGGTGAGATC GGACCGAAAC TCTTTGCGAC CCGGTACACC GAACCCCTGG CCCTCAGGGT GGCTCCCCCG ATTCTCTTCC TCTCCCGGGT CGTCGGTCCG TTCCTCTGGT TATCAGATAA GGTCAGCCGT TCACTGGTCC CTGGAGATGT CTCTACTGAA CCAACGGTGA CTGAGGATGA GATCCGGGAA TGGATCGATG TCGGTATGGA GGAGGGGACG ATCGAGCAGG AGGAGCAGGA GATGCTCTAC AATGTTCTGG AACTCGGGGA CACAACCGCT CGCGAGGTGA TGACCCCCCG CGTCGACGTC GCGATGATCG AGGACACCAG CACCCTTGAG AGTTCCCTCA CTATCTTCCA TGAGACTGGT TTCTCCAGGC TCCCGGTTTA TCACGAGAAG ATCGATAACC TGACAGGGGT CCTCAACATC AAGGATGTCT ACACGGTGAT CGTCGGACAT AAGAAGGATG TAAAGATCTC GGATCTGATG TACGATCCGT ATTTCGTCCC TGAAACCAAG AAGATCGACG ACCTCTTAAA GGAGTTGCAG CTCAGAAAAG TTCCGATGGC GATGGTGATG GATGAATATG GTGGTTTTGT TGGGGTTGTG ACGGTCGAGG ACATCCTTGA GGAACTGGTT GGTGACATCC TCGATGAGTT CGATGATGAG GAACCCGAAC TGTCCCGGAT CGGCGAGGGG ATCTATATGC TGGATGCGCG TATGTGGGTC GATGATCTGA ACGAACAACT GGATATTGCG CTTCCGACCT CCGATACCTA CGAGACGATC GGGGGGCTGT TGATCGAGCA GCTCGGCCAT ATTCCGCATC CTGGTGAGAC AGTCAGGGTC GAGGAGAGCA ATGCGACGCT GGTCGTTATG CAGATGCGGG GCAAACGGAT CGTCAAGGTG AAGATGATCC TCTCCGATGA TCGGGAGAAG CGTTAA
|
Protein sequence | MLQLDLFNAS IFGLCIILSA FFSSSEVALI SITRAKVRAL VNDGRSGAAQ LFKLKQNQDH ILIILRIGNT IAIVAAAAVA TSIAIEAFGD PGLGIAIGGT VLILLIFGEI GPKLFATRYT EPLALRVAPP ILFLSRVVGP FLWLSDKVSR SLVPGDVSTE PTVTEDEIRE WIDVGMEEGT IEQEEQEMLY NVLELGDTTA REVMTPRVDV AMIEDTSTLE SSLTIFHETG FSRLPVYHEK IDNLTGVLNI KDVYTVIVGH KKDVKISDLM YDPYFVPETK KIDDLLKELQ LRKVPMAMVM DEYGGFVGVV TVEDILEELV GDILDEFDDE EPELSRIGEG IYMLDARMWV DDLNEQLDIA LPTSDTYETI GGLLIEQLGH IPHPGETVRV EESNATLVVM QMRGKRIVKV KMILSDDREK R
|
| |