Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2409 |
Symbol | |
ID | 7271322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2550638 |
End bp | 2551684 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643571011 |
Product | Cellulase |
Protein accession | YP_002467412 |
Protein GI | 219852980 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAA AACTGTTACA GCAACTATCA GATGCCCATG GCACATCAGG CAACGAAGAG AGTATCTATG CATTGATTAA GAAGGAGATC ACCCCCTATG TCGACGAGAT TCTCGAAGAT ACGATGGGCA ACCTGATAGC CGTCAAACAT GGCGATGACT TCAAGGTGAT GCTGGCCGCC CACATGGACG AGATCGGGCT GATGGTCAGC TATGTCGATG AGAAGGGGTT CATCCGGTTC GTCCCGATCG GCGGCTGGTA TGCCCCTACC CTGTACAACC AGCGGGTCCT GATCCATGGA AAGAACGGGA TGATCGCCGG CGTCGTCGGT GGCAAGCCTC CGCATATGAT GAAGGACGAA GAACGGAAAC GGGGCGTCAA GATCGAGGAG ATGTTCGTCG ATGTCGGTGC CGCATCGGTT GAGGAGGTCG CAGCTCTCGG AATTGAAATC GGTGACCCGG TCACCGTCGA CCGTTCGTTT GCTGAATTAT CGAACAACCG TGTCAGTGGC AAGGCCTTTG ACAACCGGGC CGGCGTTGTG ATGCTCATTA AGGCCATGCA GCGGATGAAG TCCCCGTCCA CGGTCTATGC CGTATTCACC GTCCAGGAGG AAGTCGGATG TAAGGGGGCG AAGACCAGCG CCTTTGGGAT CGATCCTGAC TGTGCAATCG CCACCGACGT GACGATCCCC GGAGACCATC CCGGCATCGA GCAGAAGGAC GCCCCGGTGA AGATGGGCAA GGGGCCGGTC GTCGCCATCG TCGAGGCGGG CGGACGGGGT GTAGTCGCCC ATAAACGAAT GATCTCATGG CTTCGGGACG CCGCAGAGAA AGCCCAGATC CCGGTCCAGT TCGAGGTTGG AAGTGGCGGA GTGACCGACG CTTCATCTAT CTACCTGACC CGAGCTGGCA TCCCCTGCAC CTCCTTCTCG ATCCCGACCC GGTATATCCA CTCCCCGGTA GAGGTGCTCG ACATCGCCGA TCTCGAGGCT GCGGTCGATC TCCTGGTCGA GGCGCTGAAG ACTCGTCCGG ACTTCTCCAC CCACTAA
|
Protein sequence | MVKKLLQQLS DAHGTSGNEE SIYALIKKEI TPYVDEILED TMGNLIAVKH GDDFKVMLAA HMDEIGLMVS YVDEKGFIRF VPIGGWYAPT LYNQRVLIHG KNGMIAGVVG GKPPHMMKDE ERKRGVKIEE MFVDVGAASV EEVAALGIEI GDPVTVDRSF AELSNNRVSG KAFDNRAGVV MLIKAMQRMK SPSTVYAVFT VQEEVGCKGA KTSAFGIDPD CAIATDVTIP GDHPGIEQKD APVKMGKGPV VAIVEAGGRG VVAHKRMISW LRDAAEKAQI PVQFEVGSGG VTDASSIYLT RAGIPCTSFS IPTRYIHSPV EVLDIADLEA AVDLLVEALK TRPDFSTH
|
| |