Gene Mpal_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1960 
Symbol 
ID7270764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2077535 
End bp2079895 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content56% 
IMG OID643570573 
ProductCarbohydrate binding family 6 
Protein accessionYP_002466986 
Protein GI219852554 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0676284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0447838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTTC AAGCAGTTCG GATGAGATTA TTCAATAGAA TGCGAATATT TTTTGTAATA 
TGTGTCCTGC TGCTTCTGTT ATCAGGAGTC GGAGTTGTCG GGGCATTAAA CCCACTTCCT
TATTCCCTGC AGTGGAACGG GACTGAAATC GGAAGTCTAG GCGGTGGGTT ACATGCGCCA
GGTTCTGTTG CAGTGGATGA CGCCGGTTCT GTTTACGTCT CCGACTCCGC GAACCATCAG
GTTCAGAAGT TCACATCTGA TGGAATATTC GTCAAGAATT GGAGCAACCC TGTTGGGAAT
ACTGGGGATT TCTGGTATCC GGTTGGAATT GCAGTGGACC CGACAGGTGC CAACGTCTAT
GTGTCTGATT TAGAAAGCAA CCGGATTCTG AAATTCACGT CGAACGGTGA CTTCGTTGAT
GAATGGAACA CCTCTGGTGG GGATCCCTAT GGTGTTGCTG TGGACCGTAG TGGCCATGTC
TTCGTGGCCG AAGTCTATAA TTATGAAGGT GGATTGGGCC AGGTGCAGGA ATTCGCCCCA
TCTGGGGATC TCATCAACAA TTGGAGTCCT GGTGGAAAGA AAAACCCGCT GGGGGTCGCT
GTAGACAGCA ATAATAATGT CTATATCTCA TACTTCAACT CTAACCAGAT CAGAAAGTTT
ACCTCAAACG GTACGTGGCT TGCCACCTGG GGCAGGGCTG ATGGAGTTCA TGGTTCGAAT
GCAACAGAGT TTTGGGCCCC CACTGATATC GGTGTGGATG GCGATGATAA CGTCTACGTC
GCCGATACGG ACAACAACCG GATCCAGATC TTCAACGCGG CCGGTCTTCC CCTTGGGAGT
CTGGGGAACA CGGCTCCCCA TTCAGGCCAG GGATCGGGAG CATTCAATCG CCCTTCTGAT
GTTGCCGTGG ACTCCACAGG CACTGTATTC GCGGCCGATA CCGGTAACAA CAGAATCCAG
AGGTTCGGGG TGGTCAGGGC GCCGACACCA ACCATATCCG CGGGCTTTTA TGCCATCGGC
CATATCGGCC AGGCCCCGTA CCCTGTTCGG TTTCTGGATC AGTCCGTCGG TTCGCCGACC
GCCTGGCACT GGGACTTCGG TGATGGTTCA ACCTCGACCG AGCAGAGCCC GACCCATATC
TACAACAACA CGGGTGCCTA CAATGTGGCG CTGACCGCTT CGAACGATCT GGCAAGCGAC
ACCGCGATTC AATATCGGTG CATCATCGTC AACACCGTGC CGGTGGCTAA CTTCACATCC
AATGCGACGG CCGGTCAGAC GCCGTTCACG GTGCAGTTCA CCGACCAGTC CTCTGATGCG
AGTGGGTACC AGTGGCAGTT CGGCGACGGT ACGACCTCGG CCGACAAAAA CCCGGTCCAT
ACCTATTCGC ACCCGGGTAC CTACTCTGTA ACGCTCACCA TCACCAGTGG AGATTATGGG
AGCGTCTTCA CGGAGAAGTC CGGGTACATC ACGGTGACCG ATCCGCCGAC GGTCGGGTTC
TCTGCGAATG TGACGGCCGG CCTCTTCCCG CTCGCCGTGC AGTTCAACGA GTCGATCACT
GGCTCGGTCC AGTATTATTA CTGGCAGTTC GGTGACGGTG CGACTTCGTT CGACCGGGAA
CCGATCCATG TCTATAACGT AGCCGGCAGG TACACCGTTT CGCTCTACGC GATCGGCTCG
AACGGAACAC AGGAGAAGAC GGTCGAGGAC TACATCAATG TCATCTCACC GATCACCCCA
ACCCCCACAA CCCCGGCGCC GGTGAACACC ACACCAGTGC CGACAACGCA GGTGCCGACC
ATGACCCTGA CTCCTGTGCC GACGAACACC ACACTGGTTC CGACAGTCAC AACGATCGTA
CCGACGGTGA CCGGTAGCCC GTACAACGGC CCGCATAACA TCCCCGGGAC CTTACAGGCC
GAGGACTACG ACCTCGGTGG TGAGGGCGTC GCCTACCACG ACACCACCGC CGGCAACGAG
GGTGGCGTCT ACCGGCATGA CGATGTCGAT ATCGAACAGC TCGACACCGA CGGCTCGCCG
AATGTCGGCT GGATCCGTTC CGGCGAATGG CTAGGATATA CGGTGAACGT CAGCACGGTC
GGCACCTATT CGACCAGTAC CTACGATGCC AGGTTCAGGG TCGCCTCGTC CCACTTCGGG
TCGTCAATTC TGGTATATGT CGACAACGGT ACGACCCCTG TAGCGAATGT TTCTGTCCCG
AACACCGGTG ACTGGCAGAT CTTTAAGACC ATTTTGGTAT CCATACCCCT GCCGGCCGGC
CAGCACCGGC TGGTGTTGAA ATTCCCGACC GACAACGTCA ACATCAACTG GATCACCTTC
ACCTCACGAG GAGTCGAATA A
 
Protein sequence
MRFQAVRMRL FNRMRIFFVI CVLLLLLSGV GVVGALNPLP YSLQWNGTEI GSLGGGLHAP 
GSVAVDDAGS VYVSDSANHQ VQKFTSDGIF VKNWSNPVGN TGDFWYPVGI AVDPTGANVY
VSDLESNRIL KFTSNGDFVD EWNTSGGDPY GVAVDRSGHV FVAEVYNYEG GLGQVQEFAP
SGDLINNWSP GGKKNPLGVA VDSNNNVYIS YFNSNQIRKF TSNGTWLATW GRADGVHGSN
ATEFWAPTDI GVDGDDNVYV ADTDNNRIQI FNAAGLPLGS LGNTAPHSGQ GSGAFNRPSD
VAVDSTGTVF AADTGNNRIQ RFGVVRAPTP TISAGFYAIG HIGQAPYPVR FLDQSVGSPT
AWHWDFGDGS TSTEQSPTHI YNNTGAYNVA LTASNDLASD TAIQYRCIIV NTVPVANFTS
NATAGQTPFT VQFTDQSSDA SGYQWQFGDG TTSADKNPVH TYSHPGTYSV TLTITSGDYG
SVFTEKSGYI TVTDPPTVGF SANVTAGLFP LAVQFNESIT GSVQYYYWQF GDGATSFDRE
PIHVYNVAGR YTVSLYAIGS NGTQEKTVED YINVISPITP TPTTPAPVNT TPVPTTQVPT
MTLTPVPTNT TLVPTVTTIV PTVTGSPYNG PHNIPGTLQA EDYDLGGEGV AYHDTTAGNE
GGVYRHDDVD IEQLDTDGSP NVGWIRSGEW LGYTVNVSTV GTYSTSTYDA RFRVASSHFG
SSILVYVDNG TTPVANVSVP NTGDWQIFKT ILVSIPLPAG QHRLVLKFPT DNVNINWITF
TSRGVE