Gene Mbar_A3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3080 
Symbol 
ID3625211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3961766 
End bp3963082 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content49% 
IMG OID637701918 
Productarylsulfatase regulator 
Protein accessionYP_306548 
Protein GI73670533 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.482835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0699226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA ACCATCTGCC TCCACGCATT CATGTGCTGG CAAAACCAAC AGGAGCTATC 
TGTAACCTTG CTTGCTCCTA CTGCTTCTTC CTGGCTAAAG AAGCACTCTA CCCAGGCAGC
AAGTTCCGCA TGTCAGACAA AGTTCTGGAA AACTACATCC GTCAGCTTAT CGAAGCCCAC
CACAGCCCGC AGGTAACCGT CGCCTGGCAG GGAGGGGAAC CCACACTTAT GGGAATAGAT
TTTTACCGGC GTGCAATCGA ACTGCAGGAA AAATACAGAA AACCAGGCAT ATCTTTCGAA
AATACGATGC AAACAAACGG CACGCTGCTG GATGACGAGT GGTGCCGGTT TTTTAAGGAA
AACAATTTTC TTATAGGGAT CAGCATCGAC GGTCCTCGTG AATTGCACGA TGCTTACCGG
GTGGACAAAA AAGGCAATGG AACTTTCGAC CAGGTCATGA AGGGGCTTCG ACTGTTGCAA
AAACACGGTG TTGAATACAA TGTCCTGACC ACAGTAAACA GGGCCAATGC CGATTATCCA
CTTGAAGTCT ACCGCTTCCT TAGGGATGAA GCAGGAACAG ATTGGATGCA GTTTATTCCG
GTTGTAGAGC GAATCAATGA AGAGGGACAT ACTCTTTATC AGAGAGGAGA CACAGTTTCG
AACCGTTCTG TGCAGCCAGA GCAGTTTGGC AATTTCCTGA ACCACATTTT TGACGAGTGG
GTAAGAAACG ATGTGGGAAA GATATTCGTG CAGACTTTTG AAGCTTCTGC GCGAAGATGG
CTTGGTATGC CTTCAGGAAT GTGCGTTTTT GAGGAAACGT GCGGTACTGG GCTTGCTCTG
GAACACAATG GCGATCTTTA TTCATGCGAC CATTTTGTGG AACCTGACTA TCTGCTTGGC
AATATTATGG AAAAAGAGAT TTCTGAGCTT GCTGCTTTGG AAAAGCAGTA CAGGTTCGGA
CAGGACAAAT GCGACACTCT CCCACAGGTA TGCCGGGAAT GTGAGGTACT CTTTGCCTGC
CAGGGGGAAT GCCCTAAAAA CCGCTTCCTT ACCACTCCTG CCGGGGAAAC CGGACTGAAC
TACCTATGCG AGGGCTGGAA AGCTTTCTTC CGGCATATCG ACTTTCCGAT GCAGATTCTG
GCAGGCTTAA TTCGCAGAGG TTACCCAGCT TCGGAAGTTA TGCGAGTTAT GGCTCTAGAA
GATGCTTTTG CCCGGGCAGG GCGAAATGAT CCCTGTCCCT GCGGCAGCGG CCGAAAGTTT
AAACGTTGCC ATGGTCTCAG GAAAACTAAC GCGAAAGGGG AAACGAGAAT TAGGTAA
 
Protein sequence
MTKNHLPPRI HVLAKPTGAI CNLACSYCFF LAKEALYPGS KFRMSDKVLE NYIRQLIEAH 
HSPQVTVAWQ GGEPTLMGID FYRRAIELQE KYRKPGISFE NTMQTNGTLL DDEWCRFFKE
NNFLIGISID GPRELHDAYR VDKKGNGTFD QVMKGLRLLQ KHGVEYNVLT TVNRANADYP
LEVYRFLRDE AGTDWMQFIP VVERINEEGH TLYQRGDTVS NRSVQPEQFG NFLNHIFDEW
VRNDVGKIFV QTFEASARRW LGMPSGMCVF EETCGTGLAL EHNGDLYSCD HFVEPDYLLG
NIMEKEISEL AALEKQYRFG QDKCDTLPQV CRECEVLFAC QGECPKNRFL TTPAGETGLN
YLCEGWKAFF RHIDFPMQIL AGLIRRGYPA SEVMRVMALE DAFARAGRND PCPCGSGRKF
KRCHGLRKTN AKGETRIR