Gene Mbar_A3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3201 
Symbol 
ID3627104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4116063 
End bp4117253 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content45% 
IMG OID637702040 
Productcysteine desulphurase 
Protein accessionYP_306665 
Protein GI73670650 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.11568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0184494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGAAA AGCGCTTTGT TTACATGGAC CACGCAGCCA CCACTTTCAC AAAACCTGAA 
GTGTTTGAAG CTATGCTGCC TTTTTTGAAA GAACATTTCG GAAACCCTTC TTCCCTGTAT
TCAATAGGGA GAGAAGGTAA AGAGGCAGTA GAGACCGCAC GTGAGCAGCT TGCAAAGGCT
CTGGGAGCTA GTCCTGAGGA AATATATTTC ACCTCCGGAG GAACCGAGTC CGATAACTGG
GCTATCAAGG GAACAGCTTT TGCCAGGAGA AAGAAAGGAA AACATATCAT TACAACACCA
ATTGAACATC ATGCAGTGCT CTATCCTTGT AAGTACCTGG AAACCCAGGG CTTTGATGTG
ACTTACCTGC CTGTAGACAG TTACGGGCTT GTAGACCCTG CAGAGGTTGA AGCTGCAATT
AGAGATGATA CTATCCTGCT CTCGGTTATG TATGCGAATA ATGAAATCGG GACAATAGAG
CCTATTCATG AGATAGGCGA GATCGCAAGA GAACATGAGA TTCCTTTTCA TACTGATGCT
GTTCAGGTAA TTGGTAAAAT TCCTCTTGAG ATGGAAAAGA AAGAAAAGAA TGTTGACATG
CTTGCCCTTT CTTCTCACAA GTTCTATGGA CCCAAAGGAA TAGGAGCGCT CTATCTACGG
GAAGGGATAG AAATCGACAA TTATATGCAT GGGGGCAGCC AGGAGCGCAA AAAGCGAGCA
GGAACTGAGA ATGTGGCAGG TATTGTAGGA TTGGGAAAAG CAATAGAACT TGCAACAGGA
AATCTTGAGA AGCATAATGA GAAAATGAAG AGACTGAGAG ACCGTCTCCT TAAAGGAGTC
CTGAAAATTT CTGACTGCAG GCTTAACGGA CACCCGGAAA AATGCCTTTC GAACAACCTG
AATTTCAGTT TTGAATACAT CGAAGGCGAA TCTCTTCTTC TCATGCTTGA CGAGATGGGG
ATCTGCAGTT CCACAGGGAG TGCCTGTTCC TCAGGTTCTC CTGAGCCCTC GCACGTGCTC
AGGGCAATAG GGCTGCCTCC AGAAATAGCT CAGGGTTCCC TTCGTCTGAC CCTTGGAGAT
GATAATTCCG AAGAAGACAT TGATTATGTA CTTGAGGTTT TGCCTGAGAC CGTCGAAAAG
CTAAGGGTTA TGTCTCCTTT CTATAAACCT GAAAATGCAT GTAAGAAATA A
 
Protein sequence
MGEKRFVYMD HAATTFTKPE VFEAMLPFLK EHFGNPSSLY SIGREGKEAV ETAREQLAKA 
LGASPEEIYF TSGGTESDNW AIKGTAFARR KKGKHIITTP IEHHAVLYPC KYLETQGFDV
TYLPVDSYGL VDPAEVEAAI RDDTILLSVM YANNEIGTIE PIHEIGEIAR EHEIPFHTDA
VQVIGKIPLE MEKKEKNVDM LALSSHKFYG PKGIGALYLR EGIEIDNYMH GGSQERKKRA
GTENVAGIVG LGKAIELATG NLEKHNEKMK RLRDRLLKGV LKISDCRLNG HPEKCLSNNL
NFSFEYIEGE SLLLMLDEMG ICSSTGSACS SGSPEPSHVL RAIGLPPEIA QGSLRLTLGD
DNSEEDIDYV LEVLPETVEK LRVMSPFYKP ENACKK