Gene Mpal_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1764 
Symbol 
ID7270310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1835444 
End bp1838320 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content55% 
IMG OID643570380 
ProductPKD domain containing protein 
Protein accessionYP_002466794 
Protein GI219852362 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATG GAAAGGAGAC CCGTTGTGGG GGCAGAGCCG GCTTTTTCGT TGCTCTTGGA 
TGTATCGTGC TGCTGCTTCT GCTGATCGGT CCGGTGACGG CGCAGACGGT GACGGTGGCC
GCCAGTGACA GCAGCGCTGC ATCCAAAGCA GCTGCTAATT ATATCTGTGA TGGATATAAT
GACCAGGTTG AGATCAACGC GGCTTTCAAT GCCCTACCTG GTGAAGTCGG GACGGTGCAA
CTGACAGAGG GCACCTTTCA CTGTTCTGAT GTGATCTTTC CGACTGCTGG ATCTGAGTTG
TTGGGGAGTG GGCAGGATAG TACGGTGATC GAGATGCTCA ACCCGCTCAA CTCCTATATC
TCGATCAGTG TCAAATATTC CGGGATCACC CTCCGGGGCT TCACCCTTCG GGGGCAGGGG
GCGGTGACGA TCCGTGCCAG TCAGGTGATC GTGCAGGATG TGACGGCGAC CAGTATCGGC
CTTGATGGCA AGCGGTATTC GACCAAGAAT GACGGAATGT TCTATCTCTG GGGGGACGGG
AGTACCATCG AGGATGTGAT CTTCATCAAC TGCAAGGCTG TCGACTCGGT GACTCATGGA
TTCCATCTGA ATGCGATCAA CCTGCCAAAG GAGACCCGAA ATATACGCTT CATCAGTTGT
CAGGCCATCC GCTGTGGATT CGGGGTGTCT GGCGGTTCCC CGTCCGAGTG GATCACCGGG
TATGATCTGC AGGAGGACAA TGATCTCTAT GACGTCCAGC TGATCGACTG CCTTGCAGAG
GATAACTGGG AGTCCGGGTT CCACTTTGAA CCGGGTGAGG ACAAGACCCC GCCCACGGTC
AAGAAAGGAA TCATCATGAC CGATTGTGTC AGCAGAGACA ATGGCCAGCG GAACCCGCAG
CAGTTCCCCT ACCATGACAC CTTCCTCTCA GGCTACTTTG TCCATTTCAA TGCAGTGCTG
ACCAACTGTA TCTCCGAGAA CAATAAGAAT GCCGGCTACT TTGTCCAGGG TGGGAATAAT
GTGGTCTTCA ACGGCTGTAC AGATACCGGT TCGACCTATG GCTGGAAGAT TGTGAAAGCC
GCTTCCGATA TCACTCTGAA CAACTGCACG ACCAGTGACA ATCTGTTCTG GGGACTCTGG
TCGGCCTTTG CAGATCATAT CGTGTTGAAT CATTTCTCAC AGAATAATAT CGGCGGTAAC
CTGAATTACC AGTCGATGCT CGGATGGTAT TATGATGAGG AAGCCTACCA GTACCCGGTC
ACCGATTCGT CCTTCAATAT CACGGCCTAT GGGAACAACA CCTCCCTGCC GATCATCAAT
CAGGAGGGGC AGGGTAACAC CTATTCGCTC AGTTGGGGTT CGAATGATTC ACGGAATGTC
ACCCCGACGG TGAACGTCAC GCCGACCCAG CCAGCGACTC ATTCCCCGGT GGCTCAGTTC
CAGGCGACCC CTCTCACCGG GTCGGCACCG CTCTCTGTCT CATTCACGGA TCTCTCCACC
GGGTCGCCTG CGAACTGGTC CTGGAACTTT GGGGACGGCA GTACCTCAAC CCTCCAGAAC
CCCCAGCACC AGTATCTGGA AGACGGGATA GAGACGGTCA CGCTGACCGT TTCAAATTCA
TTCGGCTCAA GTAATACCAC AAAGATTATC ACGGTTGGTG AAGGGGTCTA TGCCGGGTTC
ACGGCCACCC CACGAACGAT CACCGCCGGT CAGACCATCC AGTTCACCGA TCACTCGTCT
GGTGCCCCCT CGGGGTGGAT CTGGAACTTT GGTGACGGTA GTGTCTCCGC CTCCCAGAAT
CCACAGCATC TGTACGGGGT CACTGGTGTC TATTCGGTCC AGCTGACCGC TTCCTTCCCT
GGGAGCAGCG ATCATGAGAC CAAGACCGAT TATATCGTTG CCGACCCTGA TTTCAGTGTA
GACGCCACCA CCGGTATTGC GCCGACGACC ATCAGGTTCA CCGATACCAC GCCGAATGGT
CAAACGGCCT GGCAGTGGGA CTTTGGTGAT GGAACGACCT CAAAGGAGGA GAATCCAGTG
CATCTCTACC AGAATGGGGG CAACTATACG GTCACCCTGA CCGCCACAAG CCCGTATGGC
ACCACGTCCG TCACGAAGGA CGCGTACATC CATCTCGCAG GAAAACCGGT TGCAGCCTTC
ACGTCCACAC CGCAGGTCGG TACCGTGCCG TTTGACGTCT CCTTCACCGA CATGTCGACC
GGTGGAACAG CGGTCGGGTA TTACTGGCAG TTTGGTGATG GAAACTTCTC GACGCTGCAG
AACCCGGTCC ATACATATAC CCATGAAGGG GTCTACACCG TCCAGATGAT CGTCTTCAAC
CAGTATGGGG TATCGAACAC TTCGAAGATT GCCTGTGTGG TTGCAAACCT TCCGTCCCCG
ATCGCGGACT TCTCTGCGGG ATCAACGGAG GGAACCGCTC CATTCCAGGT CCAGTTCTCG
GACCGCTCGT TGAACCAGCC GTCCAACTGG TTCTGGCAGT TCGGTGACGG TGCGACCTCG
ACAGCACAGA ATCCTGTGCA TAATTACACA ACCGCCGGCA CCTATACGGT GAGCCTGATG
GTCAGAAACA GCGGAGGTGT GATGACCGTC ACCAAAACGA ATTATATTAC GGTTACATCA
CCGGTGGTCA GTGGTGTGGT CCGTGTCTAT GGACAGACTG CGATGCCGAC AGACCCCAAT
CACGACGGGC TGTATGAGGA TCTGAACGGG AACGGGGTGA TCGACTTCAA CGACGTGGTC
CTCTTCTTCA ACCAGATGGA CTGGATCGCG GAAAATGAAC CGCTTGCAGC GTTTGATTTC
AACCATAACG GACAGATTGA CTTCAACGAT ATCGTCCAGC TCTTCAACAT GCTTTGA
 
Protein sequence
MMDGKETRCG GRAGFFVALG CIVLLLLLIG PVTAQTVTVA ASDSSAASKA AANYICDGYN 
DQVEINAAFN ALPGEVGTVQ LTEGTFHCSD VIFPTAGSEL LGSGQDSTVI EMLNPLNSYI
SISVKYSGIT LRGFTLRGQG AVTIRASQVI VQDVTATSIG LDGKRYSTKN DGMFYLWGDG
STIEDVIFIN CKAVDSVTHG FHLNAINLPK ETRNIRFISC QAIRCGFGVS GGSPSEWITG
YDLQEDNDLY DVQLIDCLAE DNWESGFHFE PGEDKTPPTV KKGIIMTDCV SRDNGQRNPQ
QFPYHDTFLS GYFVHFNAVL TNCISENNKN AGYFVQGGNN VVFNGCTDTG STYGWKIVKA
ASDITLNNCT TSDNLFWGLW SAFADHIVLN HFSQNNIGGN LNYQSMLGWY YDEEAYQYPV
TDSSFNITAY GNNTSLPIIN QEGQGNTYSL SWGSNDSRNV TPTVNVTPTQ PATHSPVAQF
QATPLTGSAP LSVSFTDLST GSPANWSWNF GDGSTSTLQN PQHQYLEDGI ETVTLTVSNS
FGSSNTTKII TVGEGVYAGF TATPRTITAG QTIQFTDHSS GAPSGWIWNF GDGSVSASQN
PQHLYGVTGV YSVQLTASFP GSSDHETKTD YIVADPDFSV DATTGIAPTT IRFTDTTPNG
QTAWQWDFGD GTTSKEENPV HLYQNGGNYT VTLTATSPYG TTSVTKDAYI HLAGKPVAAF
TSTPQVGTVP FDVSFTDMST GGTAVGYYWQ FGDGNFSTLQ NPVHTYTHEG VYTVQMIVFN
QYGVSNTSKI ACVVANLPSP IADFSAGSTE GTAPFQVQFS DRSLNQPSNW FWQFGDGATS
TAQNPVHNYT TAGTYTVSLM VRNSGGVMTV TKTNYITVTS PVVSGVVRVY GQTAMPTDPN
HDGLYEDLNG NGVIDFNDVV LFFNQMDWIA ENEPLAAFDF NHNGQIDFND IVQLFNML