Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1994 |
Symbol | |
ID | 7270800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2119016 |
End bp | 2120122 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643570609 |
Product | hypothetical protein |
Protein accession | YP_002467020 |
Protein GI | 219852588 |
COG category | [R] General function prediction only |
COG ID | [COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.183583 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAGGG TACTTGAACG GCGACGACAG TTCCTGAGTT TGATGCGTCA GTTCACGCTT GATCGAGGTT TTTTTCAGAT CACCGATGTC GCTGAGGCCC TGCAGGTGCC CCGCTCGACA GCTCAGGACT GGGTCAAGCG GCTGATCGAG GAGGGGTGTC TGATCGTCAG AGAGAGTCCT CACGGGCGTC ATGGGGGCCA CTATGCAGCG GTCAGCGCGG TCCCTGCCAG CACCTGCAGG CGAATTTTCA CCACTACCGA TGGCGACGAG GTGGAGATCT TTCATGAATG TATCTCCGGC GCCTGCGCCG CCTTCTGCGG ATTTCACCAT ACAAAGTCTG GTGGTGTGAT CACTGCCATC GAACGGGACG GGCCCCTGCT TCGGGAGCAT GGCCGGATCG GGGAGGCACA CCTCAAGATC GGGCTTTACC CGGCCCCTGC GGTCGGAGTG GGTGCGATCC GGCGGGAAGG AGTGAACATC CTTCAGGAGA TCACCTCGAT CGGAGGGCCG GCATACTCGC TGACCGAGAT GATCGGTCAT GCTGAAGGGG TCTGTGAGGT CAGGATCCGG CGGGAAGGAT CGCTCGTCAC GGGAGAGGTG ATCACACAGG ACCTCACCCA TCTGATCATC GGGATCGACG ATACCGACGG CCCGGATGGG GGGGCTACCT TCGCGCTGGC AGTGGCCCTC CTCCAGCACC TGGGACGGAT CAAAGGCGTG ATCCCGATCG GCCACCGGGT CGCTATGCTG AACCCCAGTA TTGAACAGAA GACCGTGGGG AATGCATGCA GTTATCTGGA ACTGGCCGTC GATCCGGAAC TGGTGACAGA GGTGACAGCA CGATGCAAAA AATTCGTCGG TGACGAGAGT CGATCGCCGG AATGGGGGAT CGCCGTGAAG GAGGGATTCC AAATAGGTCC GGCCCTTCGA GCATACGGGG AATGTGCCCG GATCGGCTAC CTTTCGCGAG ATATTGCGGA AGAGACGGCC GCAGCCCATG CGATCAGTCT TTATGGCGGA GATGGGGTGA TCGGGGCGCT GGCAGCCGTT GCCCTGGCCG GGGTTCCGAC CGAAACGCTG CTGGATCTGC GAATCCCGAT CTGCTGA
|
Protein sequence | MSRVLERRRQ FLSLMRQFTL DRGFFQITDV AEALQVPRST AQDWVKRLIE EGCLIVRESP HGRHGGHYAA VSAVPASTCR RIFTTTDGDE VEIFHECISG ACAAFCGFHH TKSGGVITAI ERDGPLLREH GRIGEAHLKI GLYPAPAVGV GAIRREGVNI LQEITSIGGP AYSLTEMIGH AEGVCEVRIR REGSLVTGEV ITQDLTHLII GIDDTDGPDG GATFALAVAL LQHLGRIKGV IPIGHRVAML NPSIEQKTVG NACSYLELAV DPELVTEVTA RCKKFVGDES RSPEWGIAVK EGFQIGPALR AYGECARIGY LSRDIAEETA AAHAISLYGG DGVIGALAAV ALAGVPTETL LDLRIPIC
|
| |