Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3201 |
Symbol | |
ID | 3627104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 4116063 |
End bp | 4117253 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637702040 |
Product | cysteine desulphurase |
Protein accession | YP_306665 |
Protein GI | 73670650 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.11568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0184494 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGAAA AGCGCTTTGT TTACATGGAC CACGCAGCCA CCACTTTCAC AAAACCTGAA GTGTTTGAAG CTATGCTGCC TTTTTTGAAA GAACATTTCG GAAACCCTTC TTCCCTGTAT TCAATAGGGA GAGAAGGTAA AGAGGCAGTA GAGACCGCAC GTGAGCAGCT TGCAAAGGCT CTGGGAGCTA GTCCTGAGGA AATATATTTC ACCTCCGGAG GAACCGAGTC CGATAACTGG GCTATCAAGG GAACAGCTTT TGCCAGGAGA AAGAAAGGAA AACATATCAT TACAACACCA ATTGAACATC ATGCAGTGCT CTATCCTTGT AAGTACCTGG AAACCCAGGG CTTTGATGTG ACTTACCTGC CTGTAGACAG TTACGGGCTT GTAGACCCTG CAGAGGTTGA AGCTGCAATT AGAGATGATA CTATCCTGCT CTCGGTTATG TATGCGAATA ATGAAATCGG GACAATAGAG CCTATTCATG AGATAGGCGA GATCGCAAGA GAACATGAGA TTCCTTTTCA TACTGATGCT GTTCAGGTAA TTGGTAAAAT TCCTCTTGAG ATGGAAAAGA AAGAAAAGAA TGTTGACATG CTTGCCCTTT CTTCTCACAA GTTCTATGGA CCCAAAGGAA TAGGAGCGCT CTATCTACGG GAAGGGATAG AAATCGACAA TTATATGCAT GGGGGCAGCC AGGAGCGCAA AAAGCGAGCA GGAACTGAGA ATGTGGCAGG TATTGTAGGA TTGGGAAAAG CAATAGAACT TGCAACAGGA AATCTTGAGA AGCATAATGA GAAAATGAAG AGACTGAGAG ACCGTCTCCT TAAAGGAGTC CTGAAAATTT CTGACTGCAG GCTTAACGGA CACCCGGAAA AATGCCTTTC GAACAACCTG AATTTCAGTT TTGAATACAT CGAAGGCGAA TCTCTTCTTC TCATGCTTGA CGAGATGGGG ATCTGCAGTT CCACAGGGAG TGCCTGTTCC TCAGGTTCTC CTGAGCCCTC GCACGTGCTC AGGGCAATAG GGCTGCCTCC AGAAATAGCT CAGGGTTCCC TTCGTCTGAC CCTTGGAGAT GATAATTCCG AAGAAGACAT TGATTATGTA CTTGAGGTTT TGCCTGAGAC CGTCGAAAAG CTAAGGGTTA TGTCTCCTTT CTATAAACCT GAAAATGCAT GTAAGAAATA A
|
Protein sequence | MGEKRFVYMD HAATTFTKPE VFEAMLPFLK EHFGNPSSLY SIGREGKEAV ETAREQLAKA LGASPEEIYF TSGGTESDNW AIKGTAFARR KKGKHIITTP IEHHAVLYPC KYLETQGFDV TYLPVDSYGL VDPAEVEAAI RDDTILLSVM YANNEIGTIE PIHEIGEIAR EHEIPFHTDA VQVIGKIPLE MEKKEKNVDM LALSSHKFYG PKGIGALYLR EGIEIDNYMH GGSQERKKRA GTENVAGIVG LGKAIELATG NLEKHNEKMK RLRDRLLKGV LKISDCRLNG HPEKCLSNNL NFSFEYIEGE SLLLMLDEMG ICSSTGSACS SGSPEPSHVL RAIGLPPEIA QGSLRLTLGD DNSEEDIDYV LEVLPETVEK LRVMSPFYKP ENACKK
|
| |