Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0420 |
Symbol | |
ID | 5410141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 405021 |
End bp | 405956 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640867633 |
Product | cysteine synthase A |
Protein accession | YP_001403582 |
Protein GI | 154149964 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.858609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAA TCTATGACAA TATCACCCAG ACAATCGGCA ACACCCCGCT TGTCCGGCTC AACCGTATCG CCCCGAAGTC CGGTGCAACA ATCCTTGCCA AGATCGAATC CTTCAACCCG ATGAGCAGCG TCAAGGACCG CATCGGGGTT GCAATGATCG ATGCGGCAGA AAAGGCCGGT CTTATCAAAA AAGACACCAT CATCCTTGAA CCCACGAGCG GGAATACCGG CGTTGCACTG GCATTCGTGA GTGCGGCCCG GGGATACAGG TTCACGCTTG TTATGCCCGA GACCATGAGT ATCGAGCGCC GGAAACTCGC AAAGGCATTT GGTGCAGAAC TTGTCCTCAC CCCCGGTGCG GAAGGGATGA AAGGGGCAGT GGCAAAGGCT GAGCAGCTTG CCGCAGAAAA CCCGAACTAC TTCTACATCC CCCAGCAGTT CAAAAACCTG GCAAACCCCG AGATCCACCG GAAGACCACG GCCCAGGAGA TCTGGCGTGA TACCGGCGGA AACGTAGATA TCCTTATTGC AGGGGTGGGT ACCGGTGGGA CGATCACCGG CATTTCTGAA GTCATCAAGA AGAAGAAGCC CTCGTTTTGT GCCATCGCTG TTGAACCGGA AGCATCGCCC GTTCTCTCGG GCGGAAAGCC CGGTCCCCAC CGTATCCAGG GGATCGGTGC CGGCTTTGTC CCGGACGTAC TCAAGCGCGA GCTGGTAGAC GAGATCATCC AGGTCAAAAA CGAGGACGCG TTCGAAACAA CACGAAACCT CGCCAAGCAG GAAGGGATCC TTGCCGGGAT CTCAAGCGGT GCGGCCCTGT ATGCAGCGCT TACGGTTGCA AAACGAAAAG AGAACAAAGG CAAGACCATT GTCGTGATCC TGCCGGATAC CGGTGAACGC TACCTCAGTA TCCCGGACCT CTTTGCAGCG GAGTAA
|
Protein sequence | MVKIYDNITQ TIGNTPLVRL NRIAPKSGAT ILAKIESFNP MSSVKDRIGV AMIDAAEKAG LIKKDTIILE PTSGNTGVAL AFVSAARGYR FTLVMPETMS IERRKLAKAF GAELVLTPGA EGMKGAVAKA EQLAAENPNY FYIPQQFKNL ANPEIHRKTT AQEIWRDTGG NVDILIAGVG TGGTITGISE VIKKKKPSFC AIAVEPEASP VLSGGKPGPH RIQGIGAGFV PDVLKRELVD EIIQVKNEDA FETTRNLAKQ EGILAGISSG AALYAALTVA KRKENKGKTI VVILPDTGER YLSIPDLFAA E
|
| |