Gene Mbar_A2518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2518 
Symbol 
ID3625613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3176615 
End bp3178315 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content44% 
IMG OID637701380 
Productsulfate transporter 
Protein accessionYP_306011 
Protein GI73669996 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.440075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.566321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTG CGCAGAATTC CAGAAGATCT TTTCATTATT CCCTCTTTCA GGGAATATTA 
CCCCTCAAAT CCAAGCAAAT ACCCCTGGAA ATCGCCGCAG GGATAACATT CGCAGCGTTT
GCCATCCCCG AAGTGATGGG ATACACTAAA ATTGCAGGCA TGCCTGTAGT TACTGGAATC
TACACGATCC TGTTTCCAAT GTTAGCTTTT GCCATTTTTG GTTCATCCCG TCATCTTGTC
GTTGGCGCCG ATTCCGCAAC CGCGGCAATT ATAGCAAGTG GACTAACAAC GATAGCCGTG
CCGGGATCTT CCCAATATGT CGCATATGCG AGCGCAATCG CTTTACTCGT AGCAATCTTG
CTTTTCCTTG GAGGCCTGCT ACAACTTGGT TTTCTCGCGG ACTTCCTTTC CTATACGATT
TTGATAGGAT TTCTCACAGG TGTTGGTATC TATATCTCTA TATCACAGAT TAGCGGGATG
CTTGGGATAC CTTCAGAACC GGATGGAACA TCTGTGCAGA TTCTCTCCTT AGTAAGAAAC
CTCTCGCTTA CGAATACTTC CACACTCCTA GTCTCATTAT CCGTAATCGG GGTTATTGTT
CTTGGCGAAA AGCTCAGCCA CAAATTTCCA GGTTCATTAA TAGCGATTAT TGGCGCAATT
GCTGCAAGCC GGATATTAGA TCTTTCTTCT TACGGGATTA GCATCCTTGG CGCAGTACCA
CAAGGTCTGC CGCAAATTTC TTTACCTCAA ATCCCGCTTT CAAATCTCCC AGAGATCTTT
AACATATCCA TTTCTTGTTT TATAATTATC CTTGCCCAGA GTGCTGCAAC ATCTCGTGCT
TATGCCATTA AGTTTTCCGA CACTTTTAAT GAAAATACAG ATCTCATCGG ATTGAGCCTT
GCTAATGCGG CGGCAGGGAT ATCAGGGACT TTCGTTGTCA ACGGCAGTCC AACCAAGACT
GAAATGATCA AAAATGCCGG AGGTCGGACA CAGCTTACTC AGCTCACAAC GGTCTTCACT
GTTATATTAG TCCTATTGTT CTTTACCAGG CCATTCGCCT ATTTACCCAC AGCTGTACTT
TCGTCTATGG TATTCCTCAT TGGTTTGCAT CTTATCGATA TCAAAGGGAT GACCTCCCTT
CACAGACAAC GACCTGTCGA GTTTGCTGTC GCCTTGATAA CGGCTATAAC GGTCGTAGTT
ATAGGTGTCG AGCAGGGAAT CCTTATAGCT ATTGTACTCT CGATCATTGC TCACCTGCGC
CATAGTTACA GGCCACTTAA TTTGCTGCTT GTTCCAAAGG CCGGAGGGGC CATGAAGACG
TTTCCACTAG AAAGTGGGCA GCAAGCTGTT GAAGGATTGT TAATCTACCG TTTTGGTTCA
AACCTCTACT TTGCTAACGA AGGCCGCTTC GCGGAAGAAA TAATAGATCT TGCCATAAAA
AATGGTTCAC TTAAATGGTT TTGCATTTCT GCCACAAACA TTGGAGATAT CGATTTCACC
TCTGCAGAGA CACTCAAAAA AGTGTATACA GAACTGCAAA AACTGGACAT TACTCTTGTA
TTAAGTGAAG TAGTACAGCC TGTGATGAAT GAGCTGGATA GAGACGGCAT AACTCAAATG
ATTGGTAAAG ACCATATCTT TGAATCAGTT CAGGATGTTA TAGAAGAGTA TAAGAGATCA
ACAGATAGTT CGTTACGCTA G
 
Protein sequence
MKPAQNSRRS FHYSLFQGIL PLKSKQIPLE IAAGITFAAF AIPEVMGYTK IAGMPVVTGI 
YTILFPMLAF AIFGSSRHLV VGADSATAAI IASGLTTIAV PGSSQYVAYA SAIALLVAIL
LFLGGLLQLG FLADFLSYTI LIGFLTGVGI YISISQISGM LGIPSEPDGT SVQILSLVRN
LSLTNTSTLL VSLSVIGVIV LGEKLSHKFP GSLIAIIGAI AASRILDLSS YGISILGAVP
QGLPQISLPQ IPLSNLPEIF NISISCFIII LAQSAATSRA YAIKFSDTFN ENTDLIGLSL
ANAAAGISGT FVVNGSPTKT EMIKNAGGRT QLTQLTTVFT VILVLLFFTR PFAYLPTAVL
SSMVFLIGLH LIDIKGMTSL HRQRPVEFAV ALITAITVVV IGVEQGILIA IVLSIIAHLR
HSYRPLNLLL VPKAGGAMKT FPLESGQQAV EGLLIYRFGS NLYFANEGRF AEEIIDLAIK
NGSLKWFCIS ATNIGDIDFT SAETLKKVYT ELQKLDITLV LSEVVQPVMN ELDRDGITQM
IGKDHIFESV QDVIEEYKRS TDSSLR