Gene Mpal_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1920 
Symbol 
ID7272737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2038165 
End bp2040498 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content58% 
IMG OID643570534 
ProductCarbohydrate binding family 6 
Protein accessionYP_002466947 
Protein GI219852515 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.208758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTAA AAGTTCGAGA GATGAATCGG GCGCTGCTGA TCCTGCTGGC AGCGGTCTGT 
CTTCTGCTGT TGATCGTTCC AACTGGGGCG AGTACGCCTG TGTATGGTCC GGATCGGGTG
TTCATCAACG AGTCCGGCAC CTACGAACTC ATGAATGATC TCACCTACGG TTCCGGGGAA
CCGCTGATTG TGATCAATGC ATCCGATGTC GTCCTTGACG GCCAGGGCCA TACGATCCAT
GCGATGTATG ACCAGGAGGG TACCGGTATC GTTGTCAACC CGGAGGGTGA ATCCCCGACC
GTCCAGCTCT CCAACGTCAC CATCAAGAAT GTGAACCTGG ACAACTTGAC CTATGGGATC
GTCTTCTTTA ATTCCCGGGG TTGTACGGTA CAGGGTGTCG CCATGACCGG CAACGGAATG
GATATTTTTC TCTCCAATAA TTGTACGGAG AACACCATCT CGGGCAACAC CTTCAGCGAT
GGAATAGGTA TCTACCTTCT CGATTCATGT CGTTCAAACA CTATCAGTAA GAACACCTTC
ACCGGCTGCC TGCCTGGTAT CTACATGAGC GGAGACTGTG ACCAGAATAC GATCTGTGAG
AACACCTTCA CCGGCAACGT GTATGGTATC CTGATCTCAG ATTCGAACAA TAATATCGTG
AAAGGGAATT CCATCATCAA CAGTATGACC TGTGGGCTGG TCGTAGGTGG TGGATCTGGA
AATACCATCT ATAACAACTA CTTCAACAAC CTTCAGAATG TTTTCGTAGC AGGTCACGAC
AGCACCAACA TCTGGAACAT CGACCGGACA GCAGGTCCCG ACATCATGGG CGGTCCAACG
ATCGGTGGTA ACTTCTGGGG TACGCCGGAT GGGACCGGTT TCTCCCAGAC TGATCCGGAT
ATCAACAACG ACGGCTTCTG CGACGATCCG TATACCATCG GAACAGGCAA TATCGATCGT
TACCCGCTTC GTTTCTGGCC CCCGACAGTC ACCGCCGGCT TCTACGCCTT CGGCCATGTC
GGCCAGGCCC CGTATTCGGT CCGGTTCCTG GATCAGTCCG CCGGCTCGCC GACCGCCTGG
ACGTGGGACT TCGGGGACGG AAGTACTTCA ACCGAGCAGA ACCCGACCCA CATCTACAAC
CAGACCGGTG CCTACAACGT GGCCCTGACC GCATCGAACG GGTGGGGGAG CGATATGGCG
ATCCAGTACC GGTGTATCAT CGTCAACACG GTGCCGGCTG CCAACTTCAC GGCCAATGCG
ACCGCCGGCC GGACGCCGTT CACCGTGCAG TTCACTGATC AATCCACCGG AGCGAGCGGG
TACCAGTGGC AGTTCGGCGA CGGCACGACC TCGACCGAGC AGAACCCGGT CCACACCTAC
ACAACCCCGG GCTCCTACAC CGTGACGCTG GTCGCCTCTG GTGTGAATTA TGGCAGCGTC
TACTCGCAGA AGCCGGGGTA CATTACGGTC ACCGACCCGC CGACGGTCGG GTTCTCGGCG
AACGTGACGG CCGGCCTCGC CCCGCTCGCC GTGCAGTTCA ACGAGTCGGT GAACGGCTCG
GTCCAGTATT ATTACTGGCA GTTCGGGGAC GGGGGGACTT CGTTCGACAA GGAGCCGATC
CATGTTTATA ACAAGGCTGG CCTGTACACC GTCTCTTTCT ATGCGATCGG TTCGAACGGG
TCCCAGGTGA AGACAGTCGA TCAGTACATC AATGTCACCT CCCCGGTCAT GCCGACTCCG
ACAACACCGG CGCCGTTGAA CACCACGACT GTACCGACGA CGATGGTTCC GACTGCAACC
GTCACCCCGG TACCAACGAA CACCACTGGT GCTCCGACAG GTACGACAGT CGCTCCGACC
GCGACTGGCT CGGCATACTA CGGTCCGCAC ACGATCCCCG GGACGCTGCA GGCCGAGGAC
TATGACCTCG GCGGTGAGGG TGTCGCCTAC CATGACACCA CCGCCGGAAA CGAAGGCGGG
ATCTACCGTC ATGACGACGT CGACATCGAG CAGCTCGATA CCGATCACAG TCCGAATGTC
GGCTGGATCC GTTCCGGCGA GTGGCTCGCG TACACGGTGC ACGTCAGCAC CGCCGGCACC
TACGACGCCG GCTTCCGGGT GGCTTCATCC CACGCCGGTT CATCTGTCCT GGTGTATGTC
GACGATGAGA CGACGCCGGT CGCGACCGTG ACCGTCCCGA ACACCGGCGA CTGGCCGATC
TTCCGAACCG TCTCGGTGCC GGTCATCCTG CCGGCCGGCA CCCACCGGCT GAAGCTTTCG
TTCCCGACCG ACTTTGTCAA CATCAACTGG ATCACCTTTG CCCAGAGAGG CTGA
 
Protein sequence
MMLKVREMNR ALLILLAAVC LLLLIVPTGA STPVYGPDRV FINESGTYEL MNDLTYGSGE 
PLIVINASDV VLDGQGHTIH AMYDQEGTGI VVNPEGESPT VQLSNVTIKN VNLDNLTYGI
VFFNSRGCTV QGVAMTGNGM DIFLSNNCTE NTISGNTFSD GIGIYLLDSC RSNTISKNTF
TGCLPGIYMS GDCDQNTICE NTFTGNVYGI LISDSNNNIV KGNSIINSMT CGLVVGGGSG
NTIYNNYFNN LQNVFVAGHD STNIWNIDRT AGPDIMGGPT IGGNFWGTPD GTGFSQTDPD
INNDGFCDDP YTIGTGNIDR YPLRFWPPTV TAGFYAFGHV GQAPYSVRFL DQSAGSPTAW
TWDFGDGSTS TEQNPTHIYN QTGAYNVALT ASNGWGSDMA IQYRCIIVNT VPAANFTANA
TAGRTPFTVQ FTDQSTGASG YQWQFGDGTT STEQNPVHTY TTPGSYTVTL VASGVNYGSV
YSQKPGYITV TDPPTVGFSA NVTAGLAPLA VQFNESVNGS VQYYYWQFGD GGTSFDKEPI
HVYNKAGLYT VSFYAIGSNG SQVKTVDQYI NVTSPVMPTP TTPAPLNTTT VPTTMVPTAT
VTPVPTNTTG APTGTTVAPT ATGSAYYGPH TIPGTLQAED YDLGGEGVAY HDTTAGNEGG
IYRHDDVDIE QLDTDHSPNV GWIRSGEWLA YTVHVSTAGT YDAGFRVASS HAGSSVLVYV
DDETTPVATV TVPNTGDWPI FRTVSVPVIL PAGTHRLKLS FPTDFVNINW ITFAQRG