Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1566 |
Symbol | |
ID | 7271111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1613824 |
End bp | 1615713 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643570180 |
Product | protein of unknown function DUF814 |
Protein accession | YP_002466602 |
Protein GI | 219852170 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.853541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAA CACAGGGGAT GAGCGGCGTC GACCTGCTCG CAGTAACAGC GGAACTGCGC GAGCATCTGC CGCTCTGGAT CAACAAGATC TACCAGTATG ACAACAAGAT GCTGAGCATC AGGCTCAATG GCGAGGAGCA TGCAAAGTAT CACCTGCTCA TCGAGTCAGG ACGGCGAATC CATCTCGCAA CGGTCCTGCC GAATCCACCC AAGAACCCAC CGTCCTTTGC AATGCTGCTC CGGAAGTACC TCGAAGGGGG GAGGGTTCTT GAGATCCGAC AGCAGGGGCT CCAGCGGGTC GTGACCTTTG TGATCGGCAA GCGGGACACG ACGCTGCACC TGGTGATCGA ACTCTTCGAT GAAGGGAACG TCATCCTCTG TGACGATCAG ATGACGATCA TCAAGCCGCT CTGGCATCAC CGGTTCAAGG ACCGGGAGGT GATTCCGGGG GTCGTCTACA CCTACTCGGG CAGCAGTGAG ACGGCTCCGG ACCAGGAGGC ACTGAAGACG TTACTCGCCA CATCCGATCG GGATGTGGTC AGGACCGTAG CCGTCGGGTG TATGCTCGGC GGGCAGTACG CCGAGGAGGT CTGCACCGGT GCCGGGATCA GTAAGGAGAC CCCGGCCACC GAAGCCAATC CGATCGCCAT TGGGGCGGCG CTGGAGAGGC TCTTCACCCG GGTCAGCGAA GATCGTGACC CGGTGGTCAC CAGCGGCGGG GCCTGGCCGA TCGTGCTGAC TGGAATGACT CCAATCAGCC ACCACCCCAC CTTCTCCGAG GCGCTCGAAG CGATCTATCC CCTGGTGACC AGGCACGAGG GGCCGCAGAA GAAGGCACCG ATCCCGCGGG AGGAACGGAT CCGGCTTCAG CAGGAGGCGG CGCTCAAATC GTTCGATAAG AAGATCGTTC TGAACAAGGC GATCGTCGAC CTGATCTACG AGAACTATAC GCTGGTCACC GATGTGATCA AAACTCTGGA TGCGGCCAGT AAAACCCTCT CCTGGCAGGA GATCGGATCG ATGCTCAAGG AGAGCGACAA CGATGTGGCC CGACAGATCG CCGGCGTCCA TCCAGCTGAG GCAGCGGTGG ACCTCCTCCT CGATGGGAAG AAGGTACTGA TCCATGTGCA TGAGAGCATC GAGGTGAACC TCGAACGCTA CTATGCGCAG GTCAAGAAGT TCAAGAAGAA GCGGGACGGG GCTGTGTCCG CGATGGAGCG GCCGGTGGCA AAGAAAGCCA CGAGCAAGGT CCACCTGACC CCGCTGAAGA AGCGGTGGTA TCACCGGTTC CGCTGGTTCT TCACCAGTGA TAACTGTCTG GTGCTCGGAG GCAGGGACGC CGGCCAGAAC GAGGAACTGG TGAAGCGGTA CATGGAAGGG GGCGACACCT TCGTCCATGC CGACGTCCAT GGGGCCAGTG TGGTGATCGT CAAGGGGAAG ACCGAACAGA TGGACGAGGT GGCCCAGTTC GCCGCCTCGT ACTCAGGTGC ATGGCGGAGC GGCCACTTCT CTGCCGACGT CTACGCGGTC CGCCCCGACC AGGTCAGCAA GACCCCGGAG GCCGGCGAGT TCGTCTCCCG CGGGTCGTTC ATCGTCAGAG GCGAACGGAC GTACTTCAAG AGCGTTCCGC TCGGGGTGGC CATCGGTTAC CAGACCGAGC CGAACGCGGC GGTGATTGGG GGGCCGGTGA ATGCGGTCGA AGCCTGGACA ACGCAGCGGG TGCTGCTGAA GCCGGGCCCG TACGAACCGA ACGATATCGC AAAGAAGGTG CTGCGGCAAC TTCGTGACAC GATCCCAGAA GAGGACTGGA AAGGGTTGAA GACGGTGTTG AACACCGAGC AGGTCGCCGG CTATGTTCCG CCCGGCGGTT CAGAGATCGT CGGGGTATGA
|
Protein sequence | MATTQGMSGV DLLAVTAELR EHLPLWINKI YQYDNKMLSI RLNGEEHAKY HLLIESGRRI HLATVLPNPP KNPPSFAMLL RKYLEGGRVL EIRQQGLQRV VTFVIGKRDT TLHLVIELFD EGNVILCDDQ MTIIKPLWHH RFKDREVIPG VVYTYSGSSE TAPDQEALKT LLATSDRDVV RTVAVGCMLG GQYAEEVCTG AGISKETPAT EANPIAIGAA LERLFTRVSE DRDPVVTSGG AWPIVLTGMT PISHHPTFSE ALEAIYPLVT RHEGPQKKAP IPREERIRLQ QEAALKSFDK KIVLNKAIVD LIYENYTLVT DVIKTLDAAS KTLSWQEIGS MLKESDNDVA RQIAGVHPAE AAVDLLLDGK KVLIHVHESI EVNLERYYAQ VKKFKKKRDG AVSAMERPVA KKATSKVHLT PLKKRWYHRF RWFFTSDNCL VLGGRDAGQN EELVKRYMEG GDTFVHADVH GASVVIVKGK TEQMDEVAQF AASYSGAWRS GHFSADVYAV RPDQVSKTPE AGEFVSRGSF IVRGERTYFK SVPLGVAIGY QTEPNAAVIG GPVNAVEAWT TQRVLLKPGP YEPNDIAKKV LRQLRDTIPE EDWKGLKTVL NTEQVAGYVP PGGSEIVGV
|
| |