Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_1122 |
Symbol | |
ID | 4795660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 1133723 |
End bp | 1136362 |
Gene Length | 2640 bp |
Protein Length | 879 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640099793 |
Product | hypothetical protein |
Protein accession | YP_001030558 |
Protein GI | 124485942 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTAAAT TTCTCTACTG GTGTCCGGCA TGCAATATCC CCCTCCTTGC AAAGACCTGT GCCTGCGGGA ACGAGAGCAT AAAGATCTCC CTTCAGCAGC CGTACGATGT CCGTCCGGCC CTGAAGGCCG ATCATGACCT CATCTCTTCC CTCATCAAAC AACGGTTCGG GGATGCGGTG ACTCTCCCAA AGATCCTCGT TCTCAACAAA GCCGGAGGTC TCGACCGTAA CGATCTCATC ATCGCAAACG GCGTACGGTT TGCCTGGCTC TGGTTCGATC CGGTCGCACG AAAGTTCAAT CTCGATCTCG AAGCCGAGGC GTTGCCCTAT CTGATCGGCA AAGCCGAGAA GGGGATCATC GATCTCGAAA AGGACGCTCC CGGACTGCCT GAAGGAAGAC TCGGCGGCAA AAAGGTCAAG GTCACTACGA CCGGGATCTC TGACGGCGTC GTCATCCTGC GGTACAAAAA TAAATACGGG ACCGGCATTC TTAAAGACGG CTCGGTCAGA ATAAAGGAAC TCATCAGCGT TTCACCGATA AAATCCAAGG CAAACCCGTC GTGGGAGGAC GCGGTCGAGA AGAACGCGTT CCACATCAAA AACATGGAGC GAAATGCCGT CCGCGAGATC AGACAGAATG CACCGCTCAA ACCAAGGGTG AACTGTTCGT TCTCCGGCGG GAAAGACAGC ACGGCCGTCT GGAACATCGC GAAAAAAGCC GGCGTGACCG AGGCGTTTTT CATAGATACC GGCCTCGAGT TCCCGGAAAC GATTGATTTC GTCAAGTCAC AGGACGTCGA ACTCATCCAA AAGGCCGGGG ACTTCTGGCA GGCAGTGGAA AAAGCCGGCC CTCCGGGAAA AGATCACCGG TGGTGCTGCA AACTCTTGAA ACTCAACCCG CTGAAAGTCC ATCTCAACGA TACAGGAGAG TGTCTGACGA TCCAGGGAAA CCGCTGGTAT GAATCATGGA GCCGGGCGTC GCTCGAAGCT CTCAGTCAGA ATCCGACGAA CCCTCTGCAG CTGAACCTCT CGCCGATCAG ATCCTGGCGG GCTCTGGACG TTTTCTTCTA TCTGTGGCTC CGCGAACTTC CCTACAATCC GCTTTACGAA CGCGGGTATG AAAGAATCGG CTGCTATCTT TGTCCAGCGA TGCTTGAGTC GGAACTTGAA ACGCTTCGCG TTACCCACCC GGAGATGGCG GACCGCTGGC ATGAGTTTCT TACCCGATGG GCGGAGGAAC GCGGTCTCCC GCCCGAGTTC GTTACCTGGG GTCTCTGGCG CTGGAAGGAG CTGCCGCCGA AAATGCAGGA GCTCGTCAAA GAAGCGGGCC TCGATCTGAC GGAGAAAAAA ATCAGACGAA GTACGCCCGC CTCGGTCGCT CATCTTATGC CGGCCGGCAA AACAACAGAC ATCGTCGAAG ACCGCGTTAC GCAGGAGCCG GAGCCGGAAA AAACACCGGA ACCTGACTGG GACGCACTTC GCGGCGAGTT TCCCATGATG GGGGATCTGA TGTACTTCGA CAACGGAGCG ACGACCTGGT CGCCAGAGTG CGTTCTTGCG GCGACGGACG AATTTGAACG GGAGTATCGT GCAAACGTAG GACGCGGCGT TCACCGGCTG ACGAGGATCG CGACACAGAA ATACTGGCAT GCTCACGAAA AGGTCGCCGA GTTCATCAAC GGCTCTGCGG GAACGACCGT GTTTGTGAAA AACACCACCG AAGCGGTCAA CACGATCGCC CGCGGTCTTT CGTTCAAGGA AGTGGACGTG ATCGTAACGA CGATCCTCGA GCACCACTCG AATCTTCTTC CCTGGCGGGC GCTCGAAGCA AAAGGAGTCA CACTCCGCAT CATCGGGCTC AACGATGATC TGACGCTGAA TATGGATGAA TTCGAAGCAG CGATGGATCC GTCTGTCCGG CTTGTTGCCG TGACCCATGC TTCGAACGTT ACGGGGACGA TCACGCCGAT CGCTGAAATC TCACGGCTGT GCAAAAAATA CGGCTCGCTG CTTGCGGTCG ATGCAGCTCA GTCGGTCCCG CGAATGCCGG TCGATGTCGA GAAACTCGGC GTGGATTTCC TCTCTTTCTC AGGACACAAG ATGTTTGGTC CCATGGGGAC CGGCGTTCTC TGGATGAGGG AACCGATCCT TGAACCCCTG CTCCTTGGAG GCGGTATGGT CGAGAGCGTC ACCGAAGACG GCTATGTGCC TGCCGAAGGG TATCATAAAT ATGAAGCCGG CACGCCCAAC GTATCAGGCG GTATCGGCCT TGGGGCCGCC GTCGAATTTC TTTCCGGCAT CGTTATGGAC CATATCGAAG CCCATGAACG AAAACTCACC GATATGCTGA TCGACGGACT TGCCGCGATC CCCGGCGTTA CCCTGTACTG CTGTCGAGAT AAAACTCAGC GTATCGGCGT TGTCTCCTTC ACGGTCGAAG GCTTTGCGCC CCACGAAATC GCCGAGTGGC TGGATGACGA ACACGGAATC GAGATCAGAT CGGGTCTCCA CTGCGCCGAG CCGCTTATGC AGTATCTTTC TGCAGAAAAA GGAACGGCAC GTGCGTGCAT CTCTTTCTAT AATACCGAAA GCGAAGTGAA CACATTTATT GCCGTGATCA GGGAACTGAC CGGCAATTAA
|
Protein sequence | MTKFLYWCPA CNIPLLAKTC ACGNESIKIS LQQPYDVRPA LKADHDLISS LIKQRFGDAV TLPKILVLNK AGGLDRNDLI IANGVRFAWL WFDPVARKFN LDLEAEALPY LIGKAEKGII DLEKDAPGLP EGRLGGKKVK VTTTGISDGV VILRYKNKYG TGILKDGSVR IKELISVSPI KSKANPSWED AVEKNAFHIK NMERNAVREI RQNAPLKPRV NCSFSGGKDS TAVWNIAKKA GVTEAFFIDT GLEFPETIDF VKSQDVELIQ KAGDFWQAVE KAGPPGKDHR WCCKLLKLNP LKVHLNDTGE CLTIQGNRWY ESWSRASLEA LSQNPTNPLQ LNLSPIRSWR ALDVFFYLWL RELPYNPLYE RGYERIGCYL CPAMLESELE TLRVTHPEMA DRWHEFLTRW AEERGLPPEF VTWGLWRWKE LPPKMQELVK EAGLDLTEKK IRRSTPASVA HLMPAGKTTD IVEDRVTQEP EPEKTPEPDW DALRGEFPMM GDLMYFDNGA TTWSPECVLA ATDEFEREYR ANVGRGVHRL TRIATQKYWH AHEKVAEFIN GSAGTTVFVK NTTEAVNTIA RGLSFKEVDV IVTTILEHHS NLLPWRALEA KGVTLRIIGL NDDLTLNMDE FEAAMDPSVR LVAVTHASNV TGTITPIAEI SRLCKKYGSL LAVDAAQSVP RMPVDVEKLG VDFLSFSGHK MFGPMGTGVL WMREPILEPL LLGGGMVESV TEDGYVPAEG YHKYEAGTPN VSGGIGLGAA VEFLSGIVMD HIEAHERKLT DMLIDGLAAI PGVTLYCCRD KTQRIGVVSF TVEGFAPHEI AEWLDDEHGI EIRSGLHCAE PLMQYLSAEK GTARACISFY NTESEVNTFI AVIRELTGN
|
| |