Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0173 |
Symbol | |
ID | 7270944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 200940 |
End bp | 202145 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643568831 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002465288 |
Protein GI | 219850856 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.572056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCTC GACGTGCTTC TTCAGATAAG CAACTCTGCG CCCTGCAGAT CTTTCATCAG TACCTCGACC CATCGGAGAA CTGGGTGTAC CGGCTCCTTT CGCACTGCAC CAGAACAAGG ATGGTGATCG GTGCAGAGAC CTACCTGCAC AACGACCTCT ATGATCCCAG GTTCACCTAC ATCCTCTCCC CAATTCACCC GCTCAGCGTC CATCCGGGAT CGCCGATGGT GCAGTGGTTG AACCGGTACA TTAAGAAACT GCGAGGGGTG ACCTACCCCT GGTTTCTGGA ACAGCAGGCC AGGCACCAGG GGTGCAAACT GATCCACTCG CACTTCGCCA ATGTCGGGTG GTACTCGCTG CCGATCGCAA AAAGGCTCCA CCTCCCACAT GTGGTCTCCT TCTACGGACT CGACTATGAG TGGCTGCCGT TCACGCAACC GGAGTGGAGG CCACGGTACC AGGATCTGAT CGATCAGGCA GACCTGTTCA TCTGCGAGGG AGAGCATGGG ATCGGTCTGC TCAGGGCGAT GGGGTGTCCG GAAGAGAAGG CCACCGTGGT CCACCTTGGA GTCGAGGTGG AGACGATCCC CTGGCAGATC CGAAAGAAGC AGGCGGGAGA GCTTCACCTG CTGCAGATCG CACGACTGAT CGAGAAGAAG GGGCATATCG ACACGGTCAG GGCCTTTCTC AAGGCTCTCC AACACTGCCC CAATATGACA CTGACGATAG CAGCCCCCGG CTCGGAGAAA CGCCGGCAAC GGTTGGATGC CGTGGTCCGG AGGGCCGGGG CCGAGGATGC CGTGACGTTT CTGGACTGGG TGGACTTCGC CACCCTGCAC CAGTTCATGC TCAGGTACCA GGTCTTCATC CATCCAAGCC GCCATGCCAG TGATCGTGAC TGTGAGGGGG GAGCCCCGGT GGTGCTGATC GATGCAGAGG CCACCGGGAT GCCAGTGATC GCGACCGACC ACTGCGATAT TCCGGAGGTC GTCGTGAACG ACAGGACCGG CCTGCTGACC CCGGAGCGGG ACGTGGATCA ACTAGCCGCC TCGATCGAGC GGTTCTACTG GATGGACCAG GACGAGTACC GGACGTTCTG CGAGCAGGCG CGGAGGCATG TGGAAGAGGA GTACAGTGCA GCCAGGTCTG CCGAGCGGCT CGAGATGGTG TACGCCAGCC TCATTGAACA GTATATACGG CGATAG
|
Protein sequence | MSSRRASSDK QLCALQIFHQ YLDPSENWVY RLLSHCTRTR MVIGAETYLH NDLYDPRFTY ILSPIHPLSV HPGSPMVQWL NRYIKKLRGV TYPWFLEQQA RHQGCKLIHS HFANVGWYSL PIAKRLHLPH VVSFYGLDYE WLPFTQPEWR PRYQDLIDQA DLFICEGEHG IGLLRAMGCP EEKATVVHLG VEVETIPWQI RKKQAGELHL LQIARLIEKK GHIDTVRAFL KALQHCPNMT LTIAAPGSEK RRQRLDAVVR RAGAEDAVTF LDWVDFATLH QFMLRYQVFI HPSRHASDRD CEGGAPVVLI DAEATGMPVI ATDHCDIPEV VVNDRTGLLT PERDVDQLAA SIERFYWMDQ DEYRTFCEQA RRHVEEEYSA ARSAERLEMV YASLIEQYIR R
|
| |