Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1512 |
Symbol | |
ID | 7271057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1556156 |
End bp | 1558846 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643570127 |
Product | aminotransferase class V |
Protein accession | YP_002466549 |
Protein GI | 219852117 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0374099 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGCG AGCCGCCGGT GAAAAAAGTC CTTTACTGGT GCACCCATTG CAATGTCCCC CTGCTGGCCA GGTCCTGCGC CTGTGGCCGG GACGGAGAGG AACTCCCCCT CCTCCAGCCG TACGATCTCC GGCCGGCCCT TCGTGCCGAT GCAGAACTGA TCCGGGACCT GGTCAGCGCC CGGTTCGGCG ATGTGACGAT CCCGACCATC CTCCTCCTCA ACAAGACCGG TGGGATGGAC CGGAACGACC TGGTCATCGC CAACGGCGCC CGGTTCGGCT GGCTCGTCTT CGACCCCGTC GACCGGCAGT ACCGGTTCGA CATCGCCCCC GAGTCCCTCT CCTGGGTGGT CCCGATGGTG ACGAAAGGGA TCGTCGACCT CTCCACCGCG GCCGACCCGG CCACCCTGGC CGGCCGGCGG CTCGGCGGTA AGAAGGTCGA GGTGACCACC GACGAGCCGG AAGGGACCGT CATCGTCAAG TACCGGCAGC GGTACGGCAC CGGCGTACTC CGGGAAGGGA CGATCCGGGT CAAGGAGCTG TCGCCCTTCG AAGCGAAGAC CTTCGAGAAC CCCGACTGGC AGGAGGCCGT CCACCAGAAC CGGCTCCACC TCAAGAACCT GGAACGGTTC GCGGTCAGGA CGATCAAGCA GCATATGCAC GACCGGCCGA CCATCAACGT CTCCTTCTCG GGCGGCAAGG ACAGCACGGC AGCCCTCGCC CTGGCCCGGC GGGCCGGCGT GACCGACGCC TTCTTCATCA ATACCGGGCT GGAGTTCCCC GAGACCGTCG ACTTCGTCCG GGAGCAGGGG GTCGAGGTGA TCGACAGCGG CGGCGACTTC TGGGCCTCGG TCAGCAAGGC CGGTCCGCCG GGGAAGGACA ACCGCTGGTG CTGCAAGAGC CTCAAACTCC ACCCGCTGAA ACGGTTCCTG GCGAAGACCG GCCCCTGCGT GACCGTGCAG GGCAACCGCT GGTACGAGTC CTGGAACCGG GCCGGCCTCG AAGAGACCAG CCAGAACCCG AACAACCCGC TGCAGCTGAA CATCTCCCCC ATCCGGAACT GGCGGGCGAT CGAGGTCTTC TTCTACCTCT GGTGGCGGAA GGTCCCGTTC AACTCCCTGT ACGAGGAGGG GTTCGAGCGG CTCGGCTGCT ACCTCTGCCC GGCGATGCTC GAAGCCGAGG GCGAACTGAT CAAGCGAACG CACCCCGATT ACGAGGCCCG GTGGCAGAAC TTCCTGGCGG CCTGGGCTGC GCAGAAGGGA TTCCCCGAGG AGTACGCCAC CTGGGGGCTC TGGCGGTGGC GCGAACTGCC GCCGAAGATG TCTGAGATCT GCAGAGAGCA CGGGCTCGCC GTGACCGAGA AGGGGACGCT GGCCACCGGG CCGGCCAGAC CGGTGCCGGT GCCCGTGCAG GTGTCCGAGC CGGTCCTGGA GGCACCGCCG AAGGAACAGC CGGAACCGGT ACAGCAGAAA CTGGCCGGCC GTCAGACCGA AGAGCAGCCT GACCCGTTCT CGGAATACCG AAAGGACTTC CCCCTCCCCG CCGGCCTGAC CTACCTGGAC AGCGCCGGGA CCAGCATCTC GCCAACACCG GTGCTCGACG CCATGATGCA GTACGACCAG ACCTACCGGG CCAACGTCGG CCGCGGGGTC CACCGGCTGA CCCAGGTGGC CACCCAGCGC TACTGGCACG CCCACAAGAA GGTCGCCCGG TTCATCGGCG CCGAGGAGCA GGGCGAGGTG GTCTTCACCA AGAACGCGAC CGAAGCGATC GCGATGGTCG CCTACGGCCT CGGGTTCTGC CCGAAGGAGC GAGTCGTCAC GACCATCCTC GAGCACCACT CCAACCTCCT CCCCTGGATG CGCCTCGCAG AAAAACAGCA GATCGGTGAC CTCACGATCG TCCCGATCGG TGAGGACCTG CTGCTGGACA TGAACGCCCT CGAGGAGGCG ATCACCGACA CCACCCGGCT GGTCACGGTC ACGCAGGCCT CGAATGTGAT CGGCACGATC GTGCCGGTCA AAGAGATCGC AAAGATCTGC CACGACCATG GCGCCCTGCT GCTGGTCGAC GCTGCCCAGT CCGTCCCCCA CATGCCGGTG GACGTCTCGG ACCTGAACTG CGACTTCCTC GCCTTCTCAG GCCACAAGAT GCTCGGTCCG ACCGGCACCG GGGTGCTCTG GATGAAGGAG TCGATCCTCG AACCCCTCCT CCTCGGTGGC GGGGCCGTCA GCTCGGTCAC CGGCACCGGG TATACGCTGG CCGAAGGCTA CGCCCGGTAC GAGGCCGGCA CGCCGCCGAT AGGAGCGGGG ATCGGCCTCG GCGCCGCCGT CGACTACCTG GAGAAGGTCG GGATGGAGAA GGTCCGGTCG CACGAGACCG CCCTGACCAC CCGGATGATC GACGGACTGC GGCGGATCGA CGGGGTCACC GTCTACGCCC CGCAGAACCC GGCCGACCGG ATCGGGGTCG TCTCCTTCAA TGTCGCCGGG TTCGACCCGC ACACCGTCGC CACCTACCTG GACGAGCACG CCGAGGTGCT GGTCAGGTCA GGCCACCACT GCTGCATACC CCTGATGGAG CACCTCGGGA TCCCGGACGG CACGGTCCGG GCGAGCCTGC ACCTCTACTC GAACAGCACC GAGGTCGACA CCCTCCTCGC AGCGGTCGGC GAGATCGCCG GAGGGGTCTG A
|
Protein sequence | MFREPPVKKV LYWCTHCNVP LLARSCACGR DGEELPLLQP YDLRPALRAD AELIRDLVSA RFGDVTIPTI LLLNKTGGMD RNDLVIANGA RFGWLVFDPV DRQYRFDIAP ESLSWVVPMV TKGIVDLSTA ADPATLAGRR LGGKKVEVTT DEPEGTVIVK YRQRYGTGVL REGTIRVKEL SPFEAKTFEN PDWQEAVHQN RLHLKNLERF AVRTIKQHMH DRPTINVSFS GGKDSTAALA LARRAGVTDA FFINTGLEFP ETVDFVREQG VEVIDSGGDF WASVSKAGPP GKDNRWCCKS LKLHPLKRFL AKTGPCVTVQ GNRWYESWNR AGLEETSQNP NNPLQLNISP IRNWRAIEVF FYLWWRKVPF NSLYEEGFER LGCYLCPAML EAEGELIKRT HPDYEARWQN FLAAWAAQKG FPEEYATWGL WRWRELPPKM SEICREHGLA VTEKGTLATG PARPVPVPVQ VSEPVLEAPP KEQPEPVQQK LAGRQTEEQP DPFSEYRKDF PLPAGLTYLD SAGTSISPTP VLDAMMQYDQ TYRANVGRGV HRLTQVATQR YWHAHKKVAR FIGAEEQGEV VFTKNATEAI AMVAYGLGFC PKERVVTTIL EHHSNLLPWM RLAEKQQIGD LTIVPIGEDL LLDMNALEEA ITDTTRLVTV TQASNVIGTI VPVKEIAKIC HDHGALLLVD AAQSVPHMPV DVSDLNCDFL AFSGHKMLGP TGTGVLWMKE SILEPLLLGG GAVSSVTGTG YTLAEGYARY EAGTPPIGAG IGLGAAVDYL EKVGMEKVRS HETALTTRMI DGLRRIDGVT VYAPQNPADR IGVVSFNVAG FDPHTVATYL DEHAEVLVRS GHHCCIPLME HLGIPDGTVR ASLHLYSNST EVDTLLAAVG EIAGGV
|
| |