Gene Mpal_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2070 
Symbol 
ID7271547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2192413 
End bp2194821 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content57% 
IMG OID643570682 
ProductCarbohydrate binding family 6 
Protein accessionYP_002467092 
Protein GI219852660 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.002169 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC AACTTCAAAG ACTGATCAGG CCGCTGATCG TGCTGATCCT CGTCTCCAGT 
CTTTTGCTTA TCCCGCCGGT GACGGCAATG CACACAGAGA CGCTAACAGG TACCGTAGTT
GATTCCTACA CCCATCAACC AATAAACCAT GTAGAGATCA CTACCTCACA AGTGCCCGGT
CCTCATTTTT ATACTGATTC TAATGGGGTT TTCTCAATCG AAGTGGATAA TTCCCAGAGC
GGGATTTCTT ATACCCTGGG CCTCACGCCT CCTGAGGCTG CTGATTACCA GTACCGTGAT
AGGGAGATCA CCTATTCATT TAGTGCTGCA GAAATTTCAG CCAGATCAAA GGATCTTGGG
CAGATCGAAC TCACCCGAAA GGTTTCAGGG TACGTCACTG ATTCTGTTAC GGGGCAGCCA
TTATCGGGTG TGCATGTCTC TCTGGGTCCC TCGTCGGGTA CCTCGGATGG TAATGGATAT
TATTATATTC GGTTGCAAGA TGCAGGCCTT AATCAAAATG GCCTCTCCTT CAGGGGTGTA
TATTACTTGA TTGCGACCAA GGATGGATAT CTGTCTTCCT CCATTCCGTT TGGTATCTAT
GATGGGGATG GAACTCATCT CTATAACATC CAGCTCACCC GGGCCTTCAC GGCGAACTTC
TACGCCTTTG GCCACGTCGG CCAGGCTCCG TACTCAGTCC GGTTCATGGA TCAGTCCGTT
GGCTCGCCGA CCGCCTGGAA GTGGGATTTC GGGGATAACA CGACCTCGAC CGAGCAGAAC
CCGACGCATG TCTACAACCG GACCGGCGCC TATAACGTTG CCCTGACCGC CTCGAACGAC
CAGGCGAGCG ACACCTGCAC CCAGTACCGA TGTGTCATCG TGAACGACGT GCCTGTGGCG
AACTTCACGA CCAATGCGAC CTCCGGCCAG ACGCCGTTCT CGGTGCAGTT CACCGACCAG
TCCACCAGCG GTGCGGACGG GTACCAGTGG CAGTTCGGCG ATGGTGCGAC CTCGACCGAA
CAGAACCCGG TCCACACCTA TACGACCTCC GGGTCATACA CGGTGATGCT GACGGTCTCC
GAGCCCAACT ATGGGAGTGT CTTCGTCCAG AAGCCGGGGT ACATCACGGT CGGCGACCCC
TCGACGATCG GGTTCTCGGC GAACGTGACG ACCGGTCTCT CACCGCTTCC GGTGCAGTTC
AACGAGTCGG TGAACGGATC TGTCCAGTAC TCGTTCTGGC ATTTCGGTGA CGGTTCGACC
TCGATGGATT CCAACCCGGT CCACGTCTAT GATACGCCCG GCCAGTACAC GGTCTCGCTG
ATGACCGTCG GCTCCAATGG AACTGAGACA AAGACGATTG AGGACTTCAT CAACGTCACC
GCCCCAGTAA CCCCGACAGT AACTCCAACA TCGACGGTAA CACCGACACC GACCGGAGCC
CTGCCGGTGG CGAACTTCAC GGTCATGGCC GGTGCCCCCG GCTCGCTGGC GATCCAGGTG
ACCGACACCT CGACGAACGC AACCTCGGTC CGCTACGATC TCGGTGACGG TACGACTACC
GCCTATCGGA ACTTCCAGTT CACCTACTGG CAGGCCGGGA CGTACACGAT CGAACAGACC
GCGACCAATG CTGCGGGATC ATCGATGAAG ACGGTCGAGG TTATCGTGCC AGCGACCACC
CCAACGCCGA CGATCACGTT CCCGCCTATC GGCGGGGATC AGGGATGGTA TACCGTCCAC
TGTAATGTTG ACGGGGCGAC GGTCATCTTC GATCAGACCA CAATGGGTAC AATCGCACAG
GGGATCCTGA AGATACCTGT CTATGTTACT GGATCACCCT ATGGAACCTA CACAGTCCAG
AAAGACGGAT ACTCCACTGT CTCAGGGGTC ATCACAGAAC ATCCCGGCAA GGACCAGAAC
GTGGATATAA CCGTGCACCT GACCCCGACC TCCACCCCGT ACAACGGTCC GCACACGATC
CCCGGAACGC TGCAGGCCGA GGATTATAAC CTCGGCGGTG AGGGTGTCGC CTATCACGAC
ACCACTGCCG GCAATGAGGG TGGTGCGTAC CGGCATGACG ACGTGGATAT CGAGCAGCTC
GACACTGACG GCTCGCCGAA CGTCGGCTGG ATCCGTGCCG GCGAGTGGCT CGCGTACACG
GTGAATGTCA CCACCGCCGG CACTTACAAC GCCGGATTCC GGGTTGCGTC TTCTCACAGT
GGTTCAACCG TCCAGGTCTA CGTTGACGAC GGTACGACCC CGGTCGCGAC GGTGAGCGTC
CCGAACACTG GCGACTGGCC GGCCTTCCAG ACCGTCTCGA TGCCGGTGAC CCTGCCGGCC
GGTCAGCACC GACTCATGTT GAAGTTCCCG ACCGACTACG TCAACATCAA CTGGATCAGT
TTCACCTGA
 
Protein sequence
MTTQLQRLIR PLIVLILVSS LLLIPPVTAM HTETLTGTVV DSYTHQPINH VEITTSQVPG 
PHFYTDSNGV FSIEVDNSQS GISYTLGLTP PEAADYQYRD REITYSFSAA EISARSKDLG
QIELTRKVSG YVTDSVTGQP LSGVHVSLGP SSGTSDGNGY YYIRLQDAGL NQNGLSFRGV
YYLIATKDGY LSSSIPFGIY DGDGTHLYNI QLTRAFTANF YAFGHVGQAP YSVRFMDQSV
GSPTAWKWDF GDNTTSTEQN PTHVYNRTGA YNVALTASND QASDTCTQYR CVIVNDVPVA
NFTTNATSGQ TPFSVQFTDQ STSGADGYQW QFGDGATSTE QNPVHTYTTS GSYTVMLTVS
EPNYGSVFVQ KPGYITVGDP STIGFSANVT TGLSPLPVQF NESVNGSVQY SFWHFGDGST
SMDSNPVHVY DTPGQYTVSL MTVGSNGTET KTIEDFINVT APVTPTVTPT STVTPTPTGA
LPVANFTVMA GAPGSLAIQV TDTSTNATSV RYDLGDGTTT AYRNFQFTYW QAGTYTIEQT
ATNAAGSSMK TVEVIVPATT PTPTITFPPI GGDQGWYTVH CNVDGATVIF DQTTMGTIAQ
GILKIPVYVT GSPYGTYTVQ KDGYSTVSGV ITEHPGKDQN VDITVHLTPT STPYNGPHTI
PGTLQAEDYN LGGEGVAYHD TTAGNEGGAY RHDDVDIEQL DTDGSPNVGW IRAGEWLAYT
VNVTTAGTYN AGFRVASSHS GSTVQVYVDD GTTPVATVSV PNTGDWPAFQ TVSMPVTLPA
GQHRLMLKFP TDYVNINWIS FT