Gene Mbar_A2700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2700 
Symbol 
ID3624810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3424584 
End bp3425726 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content45% 
IMG OID637701554 
Productarylsulfatase regulator 
Protein accessionYP_306184 
Protein GI73670169 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00619293 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.114693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTC ATGTGATGCT AATCCCTACC CTTGGTTGTC CTGCTAACTG CAGCTACTGC 
TGGAGTTCCG AAGAAAAGTC TCCGATAATG AGCATTGAAA CTATAAAAGA AGTGGTTGAG
TGGCTTAAAA CTTTTAGAGG AGATGCCGTA ACTTTCACTT TCCACGGCGG AGAACCACTC
CTGGCAGGAG CGGAATTTTA TCGGGAAGCA TTGCCTCTTC TGGTTGAGGG CTTGAGCCCG
AGGAAAATCG CGTTTGCGAT ACAGACGAAC CTCTGGAAAA TGACCCATGA GATGGCCGAA
ATTTTTGCGG AATACGGGGT TCCGATAGGC TCCAGCCTGG ATGGCCCAAA GGAACTTAAT
GACCTGCAGA GGGGAAAAGG ATACTACGAT AAAACCATGA AAGGCTATGA AATTGCCAGG
GAACATGGGC TTAATGTGAG GTTCATTACC ACTTTCACTT CTCATTCCGT AAAACAAAAG
GAAGAAATTT TCAATTTTTA TCTTGAGAAA GGATTGACTC TCAAACTCCA CCCCTGCCTG
CCTTCTTTAA AAGGTGACAA TCCTGATAAA TGGACTCTTG CTCCTGTGGA GTACGGAGAA
TTATTAATCT ATCTCCTGGA CAAATATCTG GAAAACCTGG GCCGGATTGA CGTTATGAAC
ATTGATCAGC TGTGCAAATG CGTATTCACA GGCAGGGGAA CAGTCTGCAC CTACGTTGAC
TGCATGGGAG ATACCTTTGC AGTCGGCCCT GAAGGAAACA TATATCCCTG CTATCGCTTT
GTTGGGATGC CTGAATATGT TATGGGCAAT GTTTATGACC GCCCGACAAT GGCAGACCTT
GCTAAATCCG AAGCCTGGAA GCAGATGCAC CAGTATAAAG AATATGTGGA TACTGCATGC
AGCAAATGCG CCCATATCAA ATATTGCAGA GGCGGATGCC CGTACAATGC AATAGTGCCC
ACCGATGGCG AGATAAAGGG TGTAGATCCG CACTGCACCG CCTACAAGAT GATTTTTGAT
GAAATAAACA AGCGTGTCAA TGAGGAAATG TTCGGGGGTT CAGGTATGGA TAATATGTTT
ATGCCCCAGA CAATGAAGCC TTCAAAATCA GGAATAATGT CCCTTATGCT TAAGAAACTC
TGA
 
Protein sequence
MPFHVMLIPT LGCPANCSYC WSSEEKSPIM SIETIKEVVE WLKTFRGDAV TFTFHGGEPL 
LAGAEFYREA LPLLVEGLSP RKIAFAIQTN LWKMTHEMAE IFAEYGVPIG SSLDGPKELN
DLQRGKGYYD KTMKGYEIAR EHGLNVRFIT TFTSHSVKQK EEIFNFYLEK GLTLKLHPCL
PSLKGDNPDK WTLAPVEYGE LLIYLLDKYL ENLGRIDVMN IDQLCKCVFT GRGTVCTYVD
CMGDTFAVGP EGNIYPCYRF VGMPEYVMGN VYDRPTMADL AKSEAWKQMH QYKEYVDTAC
SKCAHIKYCR GGCPYNAIVP TDGEIKGVDP HCTAYKMIFD EINKRVNEEM FGGSGMDNMF
MPQTMKPSKS GIMSLMLKKL