Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1764 |
Symbol | |
ID | 7270310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1835444 |
End bp | 1838320 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643570380 |
Product | PKD domain containing protein |
Protein accession | YP_002466794 |
Protein GI | 219852362 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGATG GAAAGGAGAC CCGTTGTGGG GGCAGAGCCG GCTTTTTCGT TGCTCTTGGA TGTATCGTGC TGCTGCTTCT GCTGATCGGT CCGGTGACGG CGCAGACGGT GACGGTGGCC GCCAGTGACA GCAGCGCTGC ATCCAAAGCA GCTGCTAATT ATATCTGTGA TGGATATAAT GACCAGGTTG AGATCAACGC GGCTTTCAAT GCCCTACCTG GTGAAGTCGG GACGGTGCAA CTGACAGAGG GCACCTTTCA CTGTTCTGAT GTGATCTTTC CGACTGCTGG ATCTGAGTTG TTGGGGAGTG GGCAGGATAG TACGGTGATC GAGATGCTCA ACCCGCTCAA CTCCTATATC TCGATCAGTG TCAAATATTC CGGGATCACC CTCCGGGGCT TCACCCTTCG GGGGCAGGGG GCGGTGACGA TCCGTGCCAG TCAGGTGATC GTGCAGGATG TGACGGCGAC CAGTATCGGC CTTGATGGCA AGCGGTATTC GACCAAGAAT GACGGAATGT TCTATCTCTG GGGGGACGGG AGTACCATCG AGGATGTGAT CTTCATCAAC TGCAAGGCTG TCGACTCGGT GACTCATGGA TTCCATCTGA ATGCGATCAA CCTGCCAAAG GAGACCCGAA ATATACGCTT CATCAGTTGT CAGGCCATCC GCTGTGGATT CGGGGTGTCT GGCGGTTCCC CGTCCGAGTG GATCACCGGG TATGATCTGC AGGAGGACAA TGATCTCTAT GACGTCCAGC TGATCGACTG CCTTGCAGAG GATAACTGGG AGTCCGGGTT CCACTTTGAA CCGGGTGAGG ACAAGACCCC GCCCACGGTC AAGAAAGGAA TCATCATGAC CGATTGTGTC AGCAGAGACA ATGGCCAGCG GAACCCGCAG CAGTTCCCCT ACCATGACAC CTTCCTCTCA GGCTACTTTG TCCATTTCAA TGCAGTGCTG ACCAACTGTA TCTCCGAGAA CAATAAGAAT GCCGGCTACT TTGTCCAGGG TGGGAATAAT GTGGTCTTCA ACGGCTGTAC AGATACCGGT TCGACCTATG GCTGGAAGAT TGTGAAAGCC GCTTCCGATA TCACTCTGAA CAACTGCACG ACCAGTGACA ATCTGTTCTG GGGACTCTGG TCGGCCTTTG CAGATCATAT CGTGTTGAAT CATTTCTCAC AGAATAATAT CGGCGGTAAC CTGAATTACC AGTCGATGCT CGGATGGTAT TATGATGAGG AAGCCTACCA GTACCCGGTC ACCGATTCGT CCTTCAATAT CACGGCCTAT GGGAACAACA CCTCCCTGCC GATCATCAAT CAGGAGGGGC AGGGTAACAC CTATTCGCTC AGTTGGGGTT CGAATGATTC ACGGAATGTC ACCCCGACGG TGAACGTCAC GCCGACCCAG CCAGCGACTC ATTCCCCGGT GGCTCAGTTC CAGGCGACCC CTCTCACCGG GTCGGCACCG CTCTCTGTCT CATTCACGGA TCTCTCCACC GGGTCGCCTG CGAACTGGTC CTGGAACTTT GGGGACGGCA GTACCTCAAC CCTCCAGAAC CCCCAGCACC AGTATCTGGA AGACGGGATA GAGACGGTCA CGCTGACCGT TTCAAATTCA TTCGGCTCAA GTAATACCAC AAAGATTATC ACGGTTGGTG AAGGGGTCTA TGCCGGGTTC ACGGCCACCC CACGAACGAT CACCGCCGGT CAGACCATCC AGTTCACCGA TCACTCGTCT GGTGCCCCCT CGGGGTGGAT CTGGAACTTT GGTGACGGTA GTGTCTCCGC CTCCCAGAAT CCACAGCATC TGTACGGGGT CACTGGTGTC TATTCGGTCC AGCTGACCGC TTCCTTCCCT GGGAGCAGCG ATCATGAGAC CAAGACCGAT TATATCGTTG CCGACCCTGA TTTCAGTGTA GACGCCACCA CCGGTATTGC GCCGACGACC ATCAGGTTCA CCGATACCAC GCCGAATGGT CAAACGGCCT GGCAGTGGGA CTTTGGTGAT GGAACGACCT CAAAGGAGGA GAATCCAGTG CATCTCTACC AGAATGGGGG CAACTATACG GTCACCCTGA CCGCCACAAG CCCGTATGGC ACCACGTCCG TCACGAAGGA CGCGTACATC CATCTCGCAG GAAAACCGGT TGCAGCCTTC ACGTCCACAC CGCAGGTCGG TACCGTGCCG TTTGACGTCT CCTTCACCGA CATGTCGACC GGTGGAACAG CGGTCGGGTA TTACTGGCAG TTTGGTGATG GAAACTTCTC GACGCTGCAG AACCCGGTCC ATACATATAC CCATGAAGGG GTCTACACCG TCCAGATGAT CGTCTTCAAC CAGTATGGGG TATCGAACAC TTCGAAGATT GCCTGTGTGG TTGCAAACCT TCCGTCCCCG ATCGCGGACT TCTCTGCGGG ATCAACGGAG GGAACCGCTC CATTCCAGGT CCAGTTCTCG GACCGCTCGT TGAACCAGCC GTCCAACTGG TTCTGGCAGT TCGGTGACGG TGCGACCTCG ACAGCACAGA ATCCTGTGCA TAATTACACA ACCGCCGGCA CCTATACGGT GAGCCTGATG GTCAGAAACA GCGGAGGTGT GATGACCGTC ACCAAAACGA ATTATATTAC GGTTACATCA CCGGTGGTCA GTGGTGTGGT CCGTGTCTAT GGACAGACTG CGATGCCGAC AGACCCCAAT CACGACGGGC TGTATGAGGA TCTGAACGGG AACGGGGTGA TCGACTTCAA CGACGTGGTC CTCTTCTTCA ACCAGATGGA CTGGATCGCG GAAAATGAAC CGCTTGCAGC GTTTGATTTC AACCATAACG GACAGATTGA CTTCAACGAT ATCGTCCAGC TCTTCAACAT GCTTTGA
|
Protein sequence | MMDGKETRCG GRAGFFVALG CIVLLLLLIG PVTAQTVTVA ASDSSAASKA AANYICDGYN DQVEINAAFN ALPGEVGTVQ LTEGTFHCSD VIFPTAGSEL LGSGQDSTVI EMLNPLNSYI SISVKYSGIT LRGFTLRGQG AVTIRASQVI VQDVTATSIG LDGKRYSTKN DGMFYLWGDG STIEDVIFIN CKAVDSVTHG FHLNAINLPK ETRNIRFISC QAIRCGFGVS GGSPSEWITG YDLQEDNDLY DVQLIDCLAE DNWESGFHFE PGEDKTPPTV KKGIIMTDCV SRDNGQRNPQ QFPYHDTFLS GYFVHFNAVL TNCISENNKN AGYFVQGGNN VVFNGCTDTG STYGWKIVKA ASDITLNNCT TSDNLFWGLW SAFADHIVLN HFSQNNIGGN LNYQSMLGWY YDEEAYQYPV TDSSFNITAY GNNTSLPIIN QEGQGNTYSL SWGSNDSRNV TPTVNVTPTQ PATHSPVAQF QATPLTGSAP LSVSFTDLST GSPANWSWNF GDGSTSTLQN PQHQYLEDGI ETVTLTVSNS FGSSNTTKII TVGEGVYAGF TATPRTITAG QTIQFTDHSS GAPSGWIWNF GDGSVSASQN PQHLYGVTGV YSVQLTASFP GSSDHETKTD YIVADPDFSV DATTGIAPTT IRFTDTTPNG QTAWQWDFGD GTTSKEENPV HLYQNGGNYT VTLTATSPYG TTSVTKDAYI HLAGKPVAAF TSTPQVGTVP FDVSFTDMST GGTAVGYYWQ FGDGNFSTLQ NPVHTYTHEG VYTVQMIVFN QYGVSNTSKI ACVVANLPSP IADFSAGSTE GTAPFQVQFS DRSLNQPSNW FWQFGDGATS TAQNPVHNYT TAGTYTVSLM VRNSGGVMTV TKTNYITVTS PVVSGVVRVY GQTAMPTDPN HDGLYEDLNG NGVIDFNDVV LFFNQMDWIA ENEPLAAFDF NHNGQIDFND IVQLFNML
|
| |