Gene Mpal_2409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2409 
Symbol 
ID7271322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2550638 
End bp2551684 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content58% 
IMG OID643571011 
ProductCellulase 
Protein accessionYP_002467412 
Protein GI219852980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAA AACTGTTACA GCAACTATCA GATGCCCATG GCACATCAGG CAACGAAGAG 
AGTATCTATG CATTGATTAA GAAGGAGATC ACCCCCTATG TCGACGAGAT TCTCGAAGAT
ACGATGGGCA ACCTGATAGC CGTCAAACAT GGCGATGACT TCAAGGTGAT GCTGGCCGCC
CACATGGACG AGATCGGGCT GATGGTCAGC TATGTCGATG AGAAGGGGTT CATCCGGTTC
GTCCCGATCG GCGGCTGGTA TGCCCCTACC CTGTACAACC AGCGGGTCCT GATCCATGGA
AAGAACGGGA TGATCGCCGG CGTCGTCGGT GGCAAGCCTC CGCATATGAT GAAGGACGAA
GAACGGAAAC GGGGCGTCAA GATCGAGGAG ATGTTCGTCG ATGTCGGTGC CGCATCGGTT
GAGGAGGTCG CAGCTCTCGG AATTGAAATC GGTGACCCGG TCACCGTCGA CCGTTCGTTT
GCTGAATTAT CGAACAACCG TGTCAGTGGC AAGGCCTTTG ACAACCGGGC CGGCGTTGTG
ATGCTCATTA AGGCCATGCA GCGGATGAAG TCCCCGTCCA CGGTCTATGC CGTATTCACC
GTCCAGGAGG AAGTCGGATG TAAGGGGGCG AAGACCAGCG CCTTTGGGAT CGATCCTGAC
TGTGCAATCG CCACCGACGT GACGATCCCC GGAGACCATC CCGGCATCGA GCAGAAGGAC
GCCCCGGTGA AGATGGGCAA GGGGCCGGTC GTCGCCATCG TCGAGGCGGG CGGACGGGGT
GTAGTCGCCC ATAAACGAAT GATCTCATGG CTTCGGGACG CCGCAGAGAA AGCCCAGATC
CCGGTCCAGT TCGAGGTTGG AAGTGGCGGA GTGACCGACG CTTCATCTAT CTACCTGACC
CGAGCTGGCA TCCCCTGCAC CTCCTTCTCG ATCCCGACCC GGTATATCCA CTCCCCGGTA
GAGGTGCTCG ACATCGCCGA TCTCGAGGCT GCGGTCGATC TCCTGGTCGA GGCGCTGAAG
ACTCGTCCGG ACTTCTCCAC CCACTAA
 
Protein sequence
MVKKLLQQLS DAHGTSGNEE SIYALIKKEI TPYVDEILED TMGNLIAVKH GDDFKVMLAA 
HMDEIGLMVS YVDEKGFIRF VPIGGWYAPT LYNQRVLIHG KNGMIAGVVG GKPPHMMKDE
ERKRGVKIEE MFVDVGAASV EEVAALGIEI GDPVTVDRSF AELSNNRVSG KAFDNRAGVV
MLIKAMQRMK SPSTVYAVFT VQEEVGCKGA KTSAFGIDPD CAIATDVTIP GDHPGIEQKD
APVKMGKGPV VAIVEAGGRG VVAHKRMISW LRDAAEKAQI PVQFEVGSGG VTDASSIYLT
RAGIPCTSFS IPTRYIHSPV EVLDIADLEA AVDLLVEALK TRPDFSTH