Gene Mpal_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1566 
Symbol 
ID7271111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1613824 
End bp1615713 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content61% 
IMG OID643570180 
Productprotein of unknown function DUF814 
Protein accessionYP_002466602 
Protein GI219852170 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.853541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAA CACAGGGGAT GAGCGGCGTC GACCTGCTCG CAGTAACAGC GGAACTGCGC 
GAGCATCTGC CGCTCTGGAT CAACAAGATC TACCAGTATG ACAACAAGAT GCTGAGCATC
AGGCTCAATG GCGAGGAGCA TGCAAAGTAT CACCTGCTCA TCGAGTCAGG ACGGCGAATC
CATCTCGCAA CGGTCCTGCC GAATCCACCC AAGAACCCAC CGTCCTTTGC AATGCTGCTC
CGGAAGTACC TCGAAGGGGG GAGGGTTCTT GAGATCCGAC AGCAGGGGCT CCAGCGGGTC
GTGACCTTTG TGATCGGCAA GCGGGACACG ACGCTGCACC TGGTGATCGA ACTCTTCGAT
GAAGGGAACG TCATCCTCTG TGACGATCAG ATGACGATCA TCAAGCCGCT CTGGCATCAC
CGGTTCAAGG ACCGGGAGGT GATTCCGGGG GTCGTCTACA CCTACTCGGG CAGCAGTGAG
ACGGCTCCGG ACCAGGAGGC ACTGAAGACG TTACTCGCCA CATCCGATCG GGATGTGGTC
AGGACCGTAG CCGTCGGGTG TATGCTCGGC GGGCAGTACG CCGAGGAGGT CTGCACCGGT
GCCGGGATCA GTAAGGAGAC CCCGGCCACC GAAGCCAATC CGATCGCCAT TGGGGCGGCG
CTGGAGAGGC TCTTCACCCG GGTCAGCGAA GATCGTGACC CGGTGGTCAC CAGCGGCGGG
GCCTGGCCGA TCGTGCTGAC TGGAATGACT CCAATCAGCC ACCACCCCAC CTTCTCCGAG
GCGCTCGAAG CGATCTATCC CCTGGTGACC AGGCACGAGG GGCCGCAGAA GAAGGCACCG
ATCCCGCGGG AGGAACGGAT CCGGCTTCAG CAGGAGGCGG CGCTCAAATC GTTCGATAAG
AAGATCGTTC TGAACAAGGC GATCGTCGAC CTGATCTACG AGAACTATAC GCTGGTCACC
GATGTGATCA AAACTCTGGA TGCGGCCAGT AAAACCCTCT CCTGGCAGGA GATCGGATCG
ATGCTCAAGG AGAGCGACAA CGATGTGGCC CGACAGATCG CCGGCGTCCA TCCAGCTGAG
GCAGCGGTGG ACCTCCTCCT CGATGGGAAG AAGGTACTGA TCCATGTGCA TGAGAGCATC
GAGGTGAACC TCGAACGCTA CTATGCGCAG GTCAAGAAGT TCAAGAAGAA GCGGGACGGG
GCTGTGTCCG CGATGGAGCG GCCGGTGGCA AAGAAAGCCA CGAGCAAGGT CCACCTGACC
CCGCTGAAGA AGCGGTGGTA TCACCGGTTC CGCTGGTTCT TCACCAGTGA TAACTGTCTG
GTGCTCGGAG GCAGGGACGC CGGCCAGAAC GAGGAACTGG TGAAGCGGTA CATGGAAGGG
GGCGACACCT TCGTCCATGC CGACGTCCAT GGGGCCAGTG TGGTGATCGT CAAGGGGAAG
ACCGAACAGA TGGACGAGGT GGCCCAGTTC GCCGCCTCGT ACTCAGGTGC ATGGCGGAGC
GGCCACTTCT CTGCCGACGT CTACGCGGTC CGCCCCGACC AGGTCAGCAA GACCCCGGAG
GCCGGCGAGT TCGTCTCCCG CGGGTCGTTC ATCGTCAGAG GCGAACGGAC GTACTTCAAG
AGCGTTCCGC TCGGGGTGGC CATCGGTTAC CAGACCGAGC CGAACGCGGC GGTGATTGGG
GGGCCGGTGA ATGCGGTCGA AGCCTGGACA ACGCAGCGGG TGCTGCTGAA GCCGGGCCCG
TACGAACCGA ACGATATCGC AAAGAAGGTG CTGCGGCAAC TTCGTGACAC GATCCCAGAA
GAGGACTGGA AAGGGTTGAA GACGGTGTTG AACACCGAGC AGGTCGCCGG CTATGTTCCG
CCCGGCGGTT CAGAGATCGT CGGGGTATGA
 
Protein sequence
MATTQGMSGV DLLAVTAELR EHLPLWINKI YQYDNKMLSI RLNGEEHAKY HLLIESGRRI 
HLATVLPNPP KNPPSFAMLL RKYLEGGRVL EIRQQGLQRV VTFVIGKRDT TLHLVIELFD
EGNVILCDDQ MTIIKPLWHH RFKDREVIPG VVYTYSGSSE TAPDQEALKT LLATSDRDVV
RTVAVGCMLG GQYAEEVCTG AGISKETPAT EANPIAIGAA LERLFTRVSE DRDPVVTSGG
AWPIVLTGMT PISHHPTFSE ALEAIYPLVT RHEGPQKKAP IPREERIRLQ QEAALKSFDK
KIVLNKAIVD LIYENYTLVT DVIKTLDAAS KTLSWQEIGS MLKESDNDVA RQIAGVHPAE
AAVDLLLDGK KVLIHVHESI EVNLERYYAQ VKKFKKKRDG AVSAMERPVA KKATSKVHLT
PLKKRWYHRF RWFFTSDNCL VLGGRDAGQN EELVKRYMEG GDTFVHADVH GASVVIVKGK
TEQMDEVAQF AASYSGAWRS GHFSADVYAV RPDQVSKTPE AGEFVSRGSF IVRGERTYFK
SVPLGVAIGY QTEPNAAVIG GPVNAVEAWT TQRVLLKPGP YEPNDIAKKV LRQLRDTIPE
EDWKGLKTVL NTEQVAGYVP PGGSEIVGV