Gene Mbar_A0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0416 
Symbol 
ID3626979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp501106 
End bp502146 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content39% 
IMG OID637699310 
Producthypothetical protein 
Protein accessionYP_303979 
Protein GI73667964 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0322809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.695963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGTC AAAAAGTTAA TATTTTTAGG AAAATATTGT TTTGTTTGGT TTTATTATTT 
CTTTGTTTGA TGAGCGCTTC TGCTACATAT GCTGAGACTT ATAATTTTGT TACTAAATGG
GGTTCTTATG GCAGTGGCAG TGGACAATTT GCATGTCCAA CTGGTGTTGC TGTAGATTCT
TCGGGTAACG TTTATGTTAC CGATACTGGC AATCACCGCA TTCAGAAGTT TAATAGCACA
GGCGGTTACC TCACTCAATG GGGTTCTAAT GGAACCGACA ACAGACAATT TTTTTTACCA
TATGGTGTTG CTGTCGATTC TTCGGGTAAT GTTTATGTTG CCGATAAGGG TAATAAATGC
ATTCAGAAGT TTAACAGCAA CGGCGGACAC CTCACTCAAT GGGGTTCTTC AGGCAATGGA
AACGGACAAT TTTATTTCCT AAATGGTGTT GCTGTAGATT CTTCGGGTAA TGTTTATGTT
GCCGATAGTG GTAATAATCG CATTCAGAAG TTTAACAGCA ACGGCGGATA CCTCACTCAA
TGGGGTTCTT ATGGTAGCGG CAACGGACAA TTTAATGATC CTGAGGGCGT TGCTGTAGAT
TCTTCGGGTA ATGTTTATGT TGCCGATAGT GGTAATAATC GCATTCAAAA ATTTAACAGC
ACAGGCGGAT ACCTCACTCA ATGGGGTTCT TATGGTAGCG GCAACGGACA ATTTGAATTT
CCGTTGAGTA TTGCTGTAGA TTCTTCGGGT AATGTTTATG TTGCCGATAA ATATAATCAG
CGCATTCAGA AGTTTAACAG CATAGGCAGA TACCTCACTC AATGGGGTTC TAATGGAACC
GACAACAGAC AAATTTATGA CCCAAATGGT ATTTATGACC CAAATGGTGT TGCTGTAGAT
TCTTCGGGTA ATGTTTATGT TGCTGAAACA GGATATTCAC GCATTCAGAA GTTTGCTCCA
AATTTCGTAG ATTTTCCTTC AATTATTGTA CCTGTTGCTG CAATGCTTGT TTTAACAGTA
ATATTTAGAC GTAAAAAATA G
 
Protein sequence
MKCQKVNIFR KILFCLVLLF LCLMSASATY AETYNFVTKW GSYGSGSGQF ACPTGVAVDS 
SGNVYVTDTG NHRIQKFNST GGYLTQWGSN GTDNRQFFLP YGVAVDSSGN VYVADKGNKC
IQKFNSNGGH LTQWGSSGNG NGQFYFLNGV AVDSSGNVYV ADSGNNRIQK FNSNGGYLTQ
WGSYGSGNGQ FNDPEGVAVD SSGNVYVADS GNNRIQKFNS TGGYLTQWGS YGSGNGQFEF
PLSIAVDSSG NVYVADKYNQ RIQKFNSIGR YLTQWGSNGT DNRQIYDPNG IYDPNGVAVD
SSGNVYVAET GYSRIQKFAP NFVDFPSIIV PVAAMLVLTV IFRRKK