Gene Mbar_A3073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3073 
Symbol 
ID3625084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3954672 
End bp3957026 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content49% 
IMG OID637701913 
Productarylsulfatase 
Protein accessionYP_306543 
Protein GI73670528 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.328282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTAA AAGAATACAA ATCAGGAACC ACTTTTCCAG GTGTAATCGG GCGCACGTTT 
GACAAGTCTG AACCAGCCTG GCCGGAACCA CTGCGCGCCA AAGAGGGGGC TCCAAACGTA
CTTTTCATTG TGCTGGACGA CACAGGCTTT GGACAACTTG GGTGCTATGG CAGCCCTATC
CAGACACCAA ACCTGGAAAG CCTGGCTGCA GAAGGCCTCA TTTACAGTAA TATGCATACG
ACTGCGCTTT GCTCTCCTAG TCGTTCATGC ATATTGACCG GCCGAAATCA CCATTCAAAC
AACATGGCCT GTATTACAGA GGGTTCAACA GGCTATCCCG GCTACAACGG CTATATACCT
TTTGAAAATG GCTTTTTGTC AGAGATACTT CTCGAGCATG GATACAACAC TTATGCTATA
GGCAAATGGC ATCTGACGCC TGCAGACCAG ATCTCGGCAG CCGGGCCTTA TGACCGCTGG
CCTTTAGGAA GAGGTTTTGA ATGTTTTTAT GGTTTTTTAG GAGGTGAGAC TCACCAGTAT
TACCCTGAAC TGACCTATGA TAATCATTCT GTAAACCCGC CCAAGACTCC AGAAGAAGGT
TACACCTTGA ATGAGGATCT GGCTGACAGG GCTATCCAGT TCATTGCCGA TGCCAAACAA
GTAGCTCCAA ACAAGCCTTT TTTCATGTAT TTCTGCACAG GAGCCATGCA TGCTCCTCAT
CATGTTCCTA AAGAATGGGC AGACAAATAT AAAGGGAAAT TCGATGACGG TTGGGAGGCT
TACAGGGAGA AAACATTCGC ACAGCAGAAG GAACTGGGCA TCGTTCCAAA AGATGCAAAA
CTATCCCGTC ATGACCCGGA TGTCAAGCCC TGGGAAGAAT GCTCACCGGA AGAAAAGAAG
CTCTATGCCC GCATGATGGA AGTCTTTGCC GGATTTCTGG AACACACCGA TTACCACATC
GGCAGGTTGC TGCAGTTTCT GAAAGACATT GGTGAATTTG AAAACACCTT AATCATGGTG
ATTTCCGACA ATGGTGCCAG CTCCGAAGGC GGCTCTGCGG GATCAGTCAA CGAGAATTTA
TTTTTCAACA ACGTGCCTGA GTCGCTAGAA GAGAACCTCT CATTGCTGGA TAAGCTGGGG
GGCCCGGAGA CCTTCAACCA CTATGCCTGG GGCTGGACCT TTGCTGGAAA CACACCCTTC
CGCCGCTGGA AACGTGAAAC CTACCGTGGC GGTGTCAGCG ACCCCTTCAT AGTCCACTGG
CCCAGGGGGA TAAAGGCAAG AGGTGAAGTC CGAAATCAGT ATGCTCACGT TATCGATATG
ATACCTACAG TTCTGGACTG CCTGGGAATC GAACCTCCGA CTGCTATCAA AGGTGTAACC
CAATCGCCTA TTGAAGGCAT AAGTTTTGCG CATACTCTCG ACAATGCCAG TGTACCTACC
AGACACCACA CCCAGTATTT CGAGATGATG GGACACCGCT CCCTGTATCA TGATAGCTGG
CGTGCCGTAT GCCCATGGCC TGGCCCTTCA TTCACTGAAG CAGGAAAACC CTTTGGTGAA
CCGATTACTG CAGAAAAGTT GACCGATCTG GACGCAAAAG GCTGGGAACT TTACAATGTT
CAGAAGGATT GGACAGAGAA TGAGAATGTA GCTGCAGAAA ACCGGCCGAA GTTAATCGAG
ATGATTGCTA CCTGGTATGC AGAGGCAGGA AAATATAATG TGCTGCCTAT CGATGCCCGA
GGAGTGCTGC GCCTTGCTGA TGAACGGCCA CAGATTGCAG CCGATCGGAC TAACTACGTT
TATTATCCCG GAACTCAGCC AGTTCCCGCA AACGCAACTG TCAATGTTCT TAACCGCGCA
CACAGCATCA CTGCCGATGT AGAGATTCCA CCAGAGGGTG CTGAAGGAAT CTTGCTCGCC
CATGGTGGTA TCGACGCCGG CTATTCATTT TATATTAAAG GTGGAAAGCT GCACTGGGTA
CATAACTATG TAGCCAAGGC CCTCTATCAT GTAGAATCCG GAGAAAATGT TCCAGAAGGA
CGGCATCAAC TGCGCTTCGA GTTTGAGGTG ACCGGTAAGC CAGATGTTGC CAATGGTAAG
GGTACACCTG GAAAGGCTCA GCTCTATATA GACGGAAAAT TAGTCGGCCA GGCTGAGATC
CCTGTAACGA CTCCGCTTAT ACTCGGGCTA ACCAGTGGGA TTACCTGTGG CTCAGCTCAT
GGATCACCGG TTACACCTGA TTACGAGCCT CCTTTTGAGT TCACAGGCAA GATCTACAGC
GTAAATGTGG ACGTGAGCGG CAAACTGATC GAGGACAAGG AAGCTGAAAC ACGCATGGTC
ATGGCAAGGC AATAA
 
Protein sequence
MPLKEYKSGT TFPGVIGRTF DKSEPAWPEP LRAKEGAPNV LFIVLDDTGF GQLGCYGSPI 
QTPNLESLAA EGLIYSNMHT TALCSPSRSC ILTGRNHHSN NMACITEGST GYPGYNGYIP
FENGFLSEIL LEHGYNTYAI GKWHLTPADQ ISAAGPYDRW PLGRGFECFY GFLGGETHQY
YPELTYDNHS VNPPKTPEEG YTLNEDLADR AIQFIADAKQ VAPNKPFFMY FCTGAMHAPH
HVPKEWADKY KGKFDDGWEA YREKTFAQQK ELGIVPKDAK LSRHDPDVKP WEECSPEEKK
LYARMMEVFA GFLEHTDYHI GRLLQFLKDI GEFENTLIMV ISDNGASSEG GSAGSVNENL
FFNNVPESLE ENLSLLDKLG GPETFNHYAW GWTFAGNTPF RRWKRETYRG GVSDPFIVHW
PRGIKARGEV RNQYAHVIDM IPTVLDCLGI EPPTAIKGVT QSPIEGISFA HTLDNASVPT
RHHTQYFEMM GHRSLYHDSW RAVCPWPGPS FTEAGKPFGE PITAEKLTDL DAKGWELYNV
QKDWTENENV AAENRPKLIE MIATWYAEAG KYNVLPIDAR GVLRLADERP QIAADRTNYV
YYPGTQPVPA NATVNVLNRA HSITADVEIP PEGAEGILLA HGGIDAGYSF YIKGGKLHWV
HNYVAKALYH VESGENVPEG RHQLRFEFEV TGKPDVANGK GTPGKAQLYI DGKLVGQAEI
PVTTPLILGL TSGITCGSAH GSPVTPDYEP PFEFTGKIYS VNVDVSGKLI EDKEAETRMV
MARQ