Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2070 |
Symbol | |
ID | 7271547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2192413 |
End bp | 2194821 |
Gene Length | 2409 bp |
Protein Length | 802 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643570682 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002467092 |
Protein GI | 219852660 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.002169 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAC AACTTCAAAG ACTGATCAGG CCGCTGATCG TGCTGATCCT CGTCTCCAGT CTTTTGCTTA TCCCGCCGGT GACGGCAATG CACACAGAGA CGCTAACAGG TACCGTAGTT GATTCCTACA CCCATCAACC AATAAACCAT GTAGAGATCA CTACCTCACA AGTGCCCGGT CCTCATTTTT ATACTGATTC TAATGGGGTT TTCTCAATCG AAGTGGATAA TTCCCAGAGC GGGATTTCTT ATACCCTGGG CCTCACGCCT CCTGAGGCTG CTGATTACCA GTACCGTGAT AGGGAGATCA CCTATTCATT TAGTGCTGCA GAAATTTCAG CCAGATCAAA GGATCTTGGG CAGATCGAAC TCACCCGAAA GGTTTCAGGG TACGTCACTG ATTCTGTTAC GGGGCAGCCA TTATCGGGTG TGCATGTCTC TCTGGGTCCC TCGTCGGGTA CCTCGGATGG TAATGGATAT TATTATATTC GGTTGCAAGA TGCAGGCCTT AATCAAAATG GCCTCTCCTT CAGGGGTGTA TATTACTTGA TTGCGACCAA GGATGGATAT CTGTCTTCCT CCATTCCGTT TGGTATCTAT GATGGGGATG GAACTCATCT CTATAACATC CAGCTCACCC GGGCCTTCAC GGCGAACTTC TACGCCTTTG GCCACGTCGG CCAGGCTCCG TACTCAGTCC GGTTCATGGA TCAGTCCGTT GGCTCGCCGA CCGCCTGGAA GTGGGATTTC GGGGATAACA CGACCTCGAC CGAGCAGAAC CCGACGCATG TCTACAACCG GACCGGCGCC TATAACGTTG CCCTGACCGC CTCGAACGAC CAGGCGAGCG ACACCTGCAC CCAGTACCGA TGTGTCATCG TGAACGACGT GCCTGTGGCG AACTTCACGA CCAATGCGAC CTCCGGCCAG ACGCCGTTCT CGGTGCAGTT CACCGACCAG TCCACCAGCG GTGCGGACGG GTACCAGTGG CAGTTCGGCG ATGGTGCGAC CTCGACCGAA CAGAACCCGG TCCACACCTA TACGACCTCC GGGTCATACA CGGTGATGCT GACGGTCTCC GAGCCCAACT ATGGGAGTGT CTTCGTCCAG AAGCCGGGGT ACATCACGGT CGGCGACCCC TCGACGATCG GGTTCTCGGC GAACGTGACG ACCGGTCTCT CACCGCTTCC GGTGCAGTTC AACGAGTCGG TGAACGGATC TGTCCAGTAC TCGTTCTGGC ATTTCGGTGA CGGTTCGACC TCGATGGATT CCAACCCGGT CCACGTCTAT GATACGCCCG GCCAGTACAC GGTCTCGCTG ATGACCGTCG GCTCCAATGG AACTGAGACA AAGACGATTG AGGACTTCAT CAACGTCACC GCCCCAGTAA CCCCGACAGT AACTCCAACA TCGACGGTAA CACCGACACC GACCGGAGCC CTGCCGGTGG CGAACTTCAC GGTCATGGCC GGTGCCCCCG GCTCGCTGGC GATCCAGGTG ACCGACACCT CGACGAACGC AACCTCGGTC CGCTACGATC TCGGTGACGG TACGACTACC GCCTATCGGA ACTTCCAGTT CACCTACTGG CAGGCCGGGA CGTACACGAT CGAACAGACC GCGACCAATG CTGCGGGATC ATCGATGAAG ACGGTCGAGG TTATCGTGCC AGCGACCACC CCAACGCCGA CGATCACGTT CCCGCCTATC GGCGGGGATC AGGGATGGTA TACCGTCCAC TGTAATGTTG ACGGGGCGAC GGTCATCTTC GATCAGACCA CAATGGGTAC AATCGCACAG GGGATCCTGA AGATACCTGT CTATGTTACT GGATCACCCT ATGGAACCTA CACAGTCCAG AAAGACGGAT ACTCCACTGT CTCAGGGGTC ATCACAGAAC ATCCCGGCAA GGACCAGAAC GTGGATATAA CCGTGCACCT GACCCCGACC TCCACCCCGT ACAACGGTCC GCACACGATC CCCGGAACGC TGCAGGCCGA GGATTATAAC CTCGGCGGTG AGGGTGTCGC CTATCACGAC ACCACTGCCG GCAATGAGGG TGGTGCGTAC CGGCATGACG ACGTGGATAT CGAGCAGCTC GACACTGACG GCTCGCCGAA CGTCGGCTGG ATCCGTGCCG GCGAGTGGCT CGCGTACACG GTGAATGTCA CCACCGCCGG CACTTACAAC GCCGGATTCC GGGTTGCGTC TTCTCACAGT GGTTCAACCG TCCAGGTCTA CGTTGACGAC GGTACGACCC CGGTCGCGAC GGTGAGCGTC CCGAACACTG GCGACTGGCC GGCCTTCCAG ACCGTCTCGA TGCCGGTGAC CCTGCCGGCC GGTCAGCACC GACTCATGTT GAAGTTCCCG ACCGACTACG TCAACATCAA CTGGATCAGT TTCACCTGA
|
Protein sequence | MTTQLQRLIR PLIVLILVSS LLLIPPVTAM HTETLTGTVV DSYTHQPINH VEITTSQVPG PHFYTDSNGV FSIEVDNSQS GISYTLGLTP PEAADYQYRD REITYSFSAA EISARSKDLG QIELTRKVSG YVTDSVTGQP LSGVHVSLGP SSGTSDGNGY YYIRLQDAGL NQNGLSFRGV YYLIATKDGY LSSSIPFGIY DGDGTHLYNI QLTRAFTANF YAFGHVGQAP YSVRFMDQSV GSPTAWKWDF GDNTTSTEQN PTHVYNRTGA YNVALTASND QASDTCTQYR CVIVNDVPVA NFTTNATSGQ TPFSVQFTDQ STSGADGYQW QFGDGATSTE QNPVHTYTTS GSYTVMLTVS EPNYGSVFVQ KPGYITVGDP STIGFSANVT TGLSPLPVQF NESVNGSVQY SFWHFGDGST SMDSNPVHVY DTPGQYTVSL MTVGSNGTET KTIEDFINVT APVTPTVTPT STVTPTPTGA LPVANFTVMA GAPGSLAIQV TDTSTNATSV RYDLGDGTTT AYRNFQFTYW QAGTYTIEQT ATNAAGSSMK TVEVIVPATT PTPTITFPPI GGDQGWYTVH CNVDGATVIF DQTTMGTIAQ GILKIPVYVT GSPYGTYTVQ KDGYSTVSGV ITEHPGKDQN VDITVHLTPT STPYNGPHTI PGTLQAEDYN LGGEGVAYHD TTAGNEGGAY RHDDVDIEQL DTDGSPNVGW IRAGEWLAYT VNVTTAGTYN AGFRVASSHS GSTVQVYVDD GTTPVATVSV PNTGDWPAFQ TVSMPVTLPA GQHRLMLKFP TDYVNINWIS FT
|
| |