Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1290 |
Symbol | |
ID | 7271150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1319472 |
End bp | 1320980 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643569924 |
Product | protein of unknown function DUF39 |
Protein accession | YP_002466347 |
Protein GI | 219851915 |
COG category | [K] Transcription [S] Function unknown |
COG ID | [COG1900] Uncharacterized conserved protein [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0616948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.376218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGT CCATTGGCCT GATCAACCAG CGGATCGCTG ACGGCAATGC CACTGTTGTT ACAGCAGAGG AGATGCCTGC CCTGGTTGAT GAACTCGGTG AAGAAGGGGC CCTTGAAACA GTGGATGTGG TCACGACCGG AACATTTGGA GCGATGTGCT CGACGGGAGC ATTCCTGAAC TTCGGTCACG CCGACCCGCC GATCCGAATG GAACGGGTCT GGCTGAACGA TGTCGAGGCG TATGCCGGCC TCGCTGCGGT GGATGCTTAT ATCGGCGCCA CCCAGCAGTC CACAACGCGT GGGGACCAGT ACGGCGGCGC CCATGTGCTC GAGGACCTGG TATCCGGGAA GCAGATCGAC CTCCGGGCCA TCTCCCGCGG TACCGATTGC TATCCCCGGC GGACGATCAA CACCGAGCTG ATCCTTGAGG ATCTGAACCA GGCCACCATG TGTAACCCCC GGAATGCATA CCAGCGCTAC AATGCCGCAA CCAACACCAC CGACCGGACC CTCCACACCT ACATGGGGAT GCTCCTCCCG AACAGTGGAA ACATCACCTA TTCTGGGGCA GGGCTGTTGA ACCCGATCAC CAACGACCCG AAGTTCCGGG TGATCGGAAG CGGGACGCCG ATCCTGCTCG GTGGCGGTCA GGGGATGATC GTTGGGGAAG GGACGCAGCA CTCGTCCGGT GCCGGATTCG GGACCCTGAT GGTCACCGGG GACCTCAAGC AGATGTCACC GGAGTACCTG CGTGCAGCCA CCATGCAGGG GTATGGGGTC ACGATGTACA TCGGGGTAGG GGTGCCGATC CCGGTGGTTG ACCTCGAGGT TGTCCGGTCC ACGGCAGTCA GAGACGAGGA TATCCTGGTC GATCTGGTCG ACTACGGGGT GCCGAGCAGG TCACGTCCGG TTCTACGGCA GGTCAGTTAT GCCGACCTGA AGAGCGGAAC GATCGATCTG AATGGGGAGG AGGTGACCGC CTCGTCGCTC TCGAGCTTCC GGATGGCCAG GCGGGTCGCG GCGGACCTGA AAGAGCGGAT CCTTGCCGGA ACGATGTCGA TGACCCTGCC GACCCGGCAG ATCGATCCGA ACAAGCTCGC CCACCCGATG CATGAGACGG TGCATGCCCC GCGGGTCTGT GATATCATGG ACCGGCACCA GGTGAGCATC ACCGAGGACG AGGAGATCCG GACGGCGGCA AAGAAACTGT TGAAGGGTGA GACCAACCAC CTGACGGTGC TGAATCTGGA GGGACGACTG GTCGGGATGG TCACCACCTA TGACCTCTCC AAGGCCGTCG CCAACCCGGG GAAGGTCTCG CTGGTCAGGG AGATCATGAC GAGAAAAGTG ATCACGACCA CACCCGATGA GGTGGTCGAC GTTGCAGCCC AGAAACTGGA ACAGTATAAT ATCAGTGCCC TGCCGGTGAT CGATAAGGCC GGCCGGGTGC TCGGCATGCT GACGGCACTT GACTTAGGAA AACTGTTCGG CAGGAGGTGG ACACGATGA
|
Protein sequence | MEKSIGLINQ RIADGNATVV TAEEMPALVD ELGEEGALET VDVVTTGTFG AMCSTGAFLN FGHADPPIRM ERVWLNDVEA YAGLAAVDAY IGATQQSTTR GDQYGGAHVL EDLVSGKQID LRAISRGTDC YPRRTINTEL ILEDLNQATM CNPRNAYQRY NAATNTTDRT LHTYMGMLLP NSGNITYSGA GLLNPITNDP KFRVIGSGTP ILLGGGQGMI VGEGTQHSSG AGFGTLMVTG DLKQMSPEYL RAATMQGYGV TMYIGVGVPI PVVDLEVVRS TAVRDEDILV DLVDYGVPSR SRPVLRQVSY ADLKSGTIDL NGEEVTASSL SSFRMARRVA ADLKERILAG TMSMTLPTRQ IDPNKLAHPM HETVHAPRVC DIMDRHQVSI TEDEEIRTAA KKLLKGETNH LTVLNLEGRL VGMVTTYDLS KAVANPGKVS LVREIMTRKV ITTTPDEVVD VAAQKLEQYN ISALPVIDKA GRVLGMLTAL DLGKLFGRRW TR
|
| |