Gene Mpal_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0404 
Symbol 
ID7271430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp420440 
End bp421846 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content62% 
IMG OID643569049 
ProductCarbohydrate binding family 6 
Protein accessionYP_002465501 
Protein GI219851069 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.813596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCAC TCCGCTCTAT CCCGGTCTGC CTGCTGGTAA CTGTTCTCTT TCTCCTGCTT 
CTGGTCGCCC CGGTGGTCGG GACGGTCAGG ATCGTTCCGA TCGGCGACTC GATCACCCAG
GGAACCCTCC AGAACGACGA CGGACTCTCC CATCCGACCT ACCGGTACTG GCTCTGGCAG
ACTCTGAAGA GCAGAGGCTA CGACGTCGAC TTTGTCGGCA GTGTCGACCA GCCGCAACTC
CCGTACTCCT TCGACCAGGA TAACGAGGGG CATGACGGTT ACCGGACCCG CGATCTTCTC
GGCACCGATC GGCTGAAGAC CTGGCTCACG GGATACAGCC CCGACATCGC TATCGTCCAC
CTCGGGTCCA ATGATGCGAT GGACGGGGTG CCGGTGGAGA CGACGATCGG AAACCTCAAA
CAGATCGTCA CGATCCTCCG TGCGAAGAAC CCCTCGATCG TGATCCTGAT CGGGACGGTG
ATCCCGCGGG GCGGGTACGG TTCGAACCTG CCGGCCCTGA ACAGCGCCAT CCCGGGGATC
GCCGCCTCGA TGAGCACCCC CTCCTCCCCG ATCGTGATCG TCGACCAGTT CACCGGCTAC
GACGGGTATG ACGACAACCA GCCGAAACGG TACCTGCATC CGAACCAGCA GGGCGAGCAG
AAGATCGCAG CCAGGTATGC CGCGGCTCTG GCCCCGTACC TCGAGAGCCC CGGCCCGTAC
ACTGATCTCT CCATCCCCGG GCGGGTCGAG GCCGAGGCCT ATGATAAAGG AGGCGAAGGT
GTTGCCTACC ATGACACCAC CACCGGCAAC CAGGGGGGTG CCTACCGGCA GGATGATGTG
GACATCGAGT CCGGTGCCAG CGGCTATGAT CTCTGCTTCA TCCGCGATGG AGAGTGGGTG
AACTATACGG TGAATGTGGC CGCCGCCGGG GAGTATACGG CAACCTTCCG GACCGCGGCC
TGGAACGACG GGCATACGAT CACCCTCTCT GTCGACGGCA CCTCGGTCGG ATCGGCCGGA
CTGGCGAACA CCGGTTCCTC TGAAGCATAT GTCGACACCA CCATGTCGGT CTCATTGTCG
GCCGGCACCC ACACGCTGAC CGTTCAATTC TCCGGTGACG GCGAGAACCT GGACTATCTC
TCCTTCGCCG CGGTGCCGGT GACGACCCCG ACGTCGAGAC CGACGCTGCA GCCGATCCCG
CCGTCGACGC TGCAGCCGAC AGACCCTGAT CAGGACGGGC TGTACGAGGA TCTCAACGGC
AACGGGGCAC TCGATTTCAA TGACGTCGTC CTCTTCTTCG ACCGGATGGA CTGGATCGCA
GGCTATGAAC CCGTTGAAGT CTTCGACTTC GACCGGAACG GTCGGATCGA TTTCAATGAT
ATCGTCAGGG TCTTCTCTCT CCTCTGA
 
Protein sequence
MHPLRSIPVC LLVTVLFLLL LVAPVVGTVR IVPIGDSITQ GTLQNDDGLS HPTYRYWLWQ 
TLKSRGYDVD FVGSVDQPQL PYSFDQDNEG HDGYRTRDLL GTDRLKTWLT GYSPDIAIVH
LGSNDAMDGV PVETTIGNLK QIVTILRAKN PSIVILIGTV IPRGGYGSNL PALNSAIPGI
AASMSTPSSP IVIVDQFTGY DGYDDNQPKR YLHPNQQGEQ KIAARYAAAL APYLESPGPY
TDLSIPGRVE AEAYDKGGEG VAYHDTTTGN QGGAYRQDDV DIESGASGYD LCFIRDGEWV
NYTVNVAAAG EYTATFRTAA WNDGHTITLS VDGTSVGSAG LANTGSSEAY VDTTMSVSLS
AGTHTLTVQF SGDGENLDYL SFAAVPVTTP TSRPTLQPIP PSTLQPTDPD QDGLYEDLNG
NGALDFNDVV LFFDRMDWIA GYEPVEVFDF DRNGRIDFND IVRVFSLL