Gene Mpal_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2066 
Symbol 
ID7271306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2182621 
End bp2185233 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content58% 
IMG OID643570678 
ProductCarbohydrate binding family 6 
Protein accessionYP_002467088 
Protein GI219852656 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTA TTACAGGATT AATGGGAGGG GTGCTGCTGG TGTTCGCGCT GGTCGCGGTC 
TGCTGCACCC CGGTATCTGC ATCGTCGCTG TTTGTTGGCG CTCCTTCCCC AACCATGGAT
GTACCAAACA CCGTGAATGA GTTCTCTGTA TCCGGTTCTA CCGCCCAGTA TCAGCAGAAC
CTCACCATTT CCCGATCAAA TTCATTCTTT AACATTAAAG GGATGGCTGT GGACGCTTCC
AACCGGACGC TCTTCGTCAC GAATAATCGA TATCATCCTA CAGGGTATAA TATCACAGTC
ATCAATGCTA CGACGATGCA ACCGGTCACG AACCTTTCAC TCTCCGCGAC ATTTCGTGCA
GGTAAGTGTG TGTTTGACCA GGGACATGAC CGTCTGTATG TCTATGATAC AAATGGGGGG
GATCTGGTTG GGTATACGTG GGACTCAGAC AATCTGACTC TGACGCCCGA CGGGATGTCG
ATCCCTCTGT CGGAGGGGTG TAATGGACTT GATATCGATC CGGACGAGTA TCTGCTCTAT
GTGATAACTT ATAATGGTAC GATTCAAAAA TACAGCTCCG CCTCCGGAGT TTTCCTGGGA
AATGTCAGCC TTCCGGTTGA TACAGGTGCA TACTCCCTTG CGGTCGATGA CGTGACCCAT
TGTCTCTATA TGGAGACGGT CTCTTCAGAT CAATGCACCT ACTCCCTGCT CAGACTGGAC
CCCTACGGCA TCCTTCCCAC CAGAAGTATC GTATATCAGC ATCTCGATCA ATACAGTTAT
CACTCCGATC TATTCCTGAA ACTGGCCGTG GATCCCGCAT CTCATTACCT CTATGCAGGG
ACGACCCGTA AAGTCTCAGA TGACGATGAC CCTCTGATGA TTCCTGAGCT CAGAATATAT
GACAGTGATC TGAACCTGCA GAACACGATC CCTGTTGGTC CCCCGGATAT CACTGCCACA
CGGGATGATC ACCTGCTGGT ACCAGGTTAT CTGGCTATCC TCTCTGATGG TATCGTTCCC
CCGACCCCGA CGCCGACATC AGCCATCACG GCGAACTTCT ACGCCTTCGG CCACGTCGGC
CAGGCTCCGT ATTCGGTCCG GTTCCTGGAT CAGTCGACCG GCTCGCCGAC CGCCTGGAAG
TGGGATTTCG GGGATAACAC GACATCGACC GAGCAGAACC CGACGCATGT CTACAACCGG
ACCGGCGCCT ATAACGTTGC CCTGACCGCC TCGAACGACC AGGCGAGCGA CACCTGCACC
CAGTACCGGT GTGTCATCGT GAATGCAGTG CCGGCAGCGA ACTTCACGGC CAATACGACC
GCCGGCAAAA CCCCCCTCAC CGTGCAGTTC ATCGACCAGT CCACCAGCGG TGCGGACGGG
TACCAGTGGC AGTTCGGCGA CGGGACGACC TCGACCGAGC AGAACCCGGT CCACACCTAT
ACGACCCCCG GGTCGTACTC GGTGATGCTG ACAGTCTCCG AGCCCGACTA CGGGAGTGTC
TTCGTCCAGA AGCCCGGGTA TATCACGGTC GGCGACCCCT CGACGATCGG GTTCTCGGCG
AACGTGACCG CCGGTCTCTC ACCGCTCGCC GTCCAGTTCA ACGAGTCGGT GAACGGATCG
GTCCAGTATT CGTTCTGGCA TTTCGGTGAC GGTTCGACCT CGATGGATTC CAACCCGGTC
CACGTCTATG ATACGCCCGG TCAGTACACT GTCTCGCTGA TGACCGTCGG CTCCAATGGA
ACCGAGACAA AGACGGTCGA GGACTACATC AACGTCACCA ACCCGGTGAC ACCGGCTCCC
ACCACGCCGG CACCGGTGAA CACCACCGTG ATTCCGACAC CGACTGAACA GAACTTGCCT
GTAGCGAACT TCACGGTCAC GGCCGGTGCC CCGGGCTCGC TGGCGATCCA GGTGATCGAC
ACCTCGGTGA ACGCGACCTC GGTCAGCTAC GACCTCGGTG ACGGCACCAC CACCTCCTAT
CCGACATTCC GGTACACCTA CTGGCAGGCC GGGACGTACA CGATCGAACA GACCGCGACC
AATGCCGTCG GCTCCTCGCA TAAGACCCTC ACGGTGACGG TGCCGGCTGC GGCGCCTACA
ACGACCCCGA CCGTGACTCC AACGACGACA ACAACGGTCT CACCGACCGT GACGGGTAAC
CCGTACAACG GTCCGCATAC GATCCCAGGA ACGCTGCAGG CCGAGGATTA CGACCTCGGT
GGTGAAGGCG TTGCCTACCA CGACACCACC CCTGGAAACG AGGGCGGCGT CTACCGGCAT
GACGACGTCG ATATCGAGCA GCTTGACACC GACGGGTCGC CGAACGTCGG CTGGATCCGT
GCCGGTGAAT GGCTCGGGTA CACCGTGAAC GTCAGCACGG CCGGCACCTA CAACGCCGGG
TTCCGTGTTG CTTCCTCTCA CTCCGGCTCA TCGATCCAGG TCTATGTCGA CAATGGTACG
ACCCCGGTCG CGACGGTGAA CGTCCCGAAC ACCGGTGACT GGCCGGTCTT CAGGACCGTT
TCGGTGCCGG TGACCCTGCC GGCCGGGCAG CACCGGCTGA GACTTTCGTT CCCGACCGAC
TACGTCAACA TCAACTGGAT CACCTTTGTC TGA
 
Protein sequence
MKVITGLMGG VLLVFALVAV CCTPVSASSL FVGAPSPTMD VPNTVNEFSV SGSTAQYQQN 
LTISRSNSFF NIKGMAVDAS NRTLFVTNNR YHPTGYNITV INATTMQPVT NLSLSATFRA
GKCVFDQGHD RLYVYDTNGG DLVGYTWDSD NLTLTPDGMS IPLSEGCNGL DIDPDEYLLY
VITYNGTIQK YSSASGVFLG NVSLPVDTGA YSLAVDDVTH CLYMETVSSD QCTYSLLRLD
PYGILPTRSI VYQHLDQYSY HSDLFLKLAV DPASHYLYAG TTRKVSDDDD PLMIPELRIY
DSDLNLQNTI PVGPPDITAT RDDHLLVPGY LAILSDGIVP PTPTPTSAIT ANFYAFGHVG
QAPYSVRFLD QSTGSPTAWK WDFGDNTTST EQNPTHVYNR TGAYNVALTA SNDQASDTCT
QYRCVIVNAV PAANFTANTT AGKTPLTVQF IDQSTSGADG YQWQFGDGTT STEQNPVHTY
TTPGSYSVML TVSEPDYGSV FVQKPGYITV GDPSTIGFSA NVTAGLSPLA VQFNESVNGS
VQYSFWHFGD GSTSMDSNPV HVYDTPGQYT VSLMTVGSNG TETKTVEDYI NVTNPVTPAP
TTPAPVNTTV IPTPTEQNLP VANFTVTAGA PGSLAIQVID TSVNATSVSY DLGDGTTTSY
PTFRYTYWQA GTYTIEQTAT NAVGSSHKTL TVTVPAAAPT TTPTVTPTTT TTVSPTVTGN
PYNGPHTIPG TLQAEDYDLG GEGVAYHDTT PGNEGGVYRH DDVDIEQLDT DGSPNVGWIR
AGEWLGYTVN VSTAGTYNAG FRVASSHSGS SIQVYVDNGT TPVATVNVPN TGDWPVFRTV
SVPVTLPAGQ HRLRLSFPTD YVNINWITFV