Gene Mbar_A3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3194 
Symbol 
ID3627136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4108213 
End bp4109262 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content45% 
IMG OID637702033 
ProductO-GlcNAc transferase, p110 subunit 
Protein accessionYP_306658 
Protein GI73670643 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.756541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0088423 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGATTG ATGCTATAGA TGAGCTGGTT TTTCGTGTAA TCGAACAGGT TGCAGCTGAT 
AACTGTGACT TCGATGAGGT CGTTAAGTTT GTCGTTTCCT TCTCAAATGA GCATTCGCTT
TCCCAGGAAA TCCTACTGAA ACTTTCATCT ATTTTTGATA AAGATTCAAT GTACAGGGAG
AAGTATGTAA TTACCAGAGC CTGCGCTTCG TTATTTTCAG GAAAAGCCAG GGAAGACACC
CTTATGGAGG CAGGAAAAAC AGCTTTTCTA CTGGGACTTG ATGAACTTGC TGTACGTGAA
TTCAAAGAAA TCCTTCAGCA GAACCCCCAA AATGTCGAAG CACTTAGTGG GTATGGAACA
GTTCTGGCTA AGGAAGGAAA AAACGCTGCT GCACGGATTC AATACGAAAA AGCCCTGGAA
TTCAATCCTT ACCACGTAGA TACGCTTTGC AATTATGGCT ACCTGCTTTA CAGGCTCAAA
AAACTGGATG AGGCCGAAGA AGTATATAGC CGTGCTTTGA TTCTTGACCG GGAGAATGTA
AGCGCACACT GTGGGTATGG GATTCTACTT TCCAAACGCG GACAAAAGAA TGAAGCAAGT
TACCATTATA CTCGAGCTCT TGAGCTTGAT CCTGGCCATG TGGAATCAAA TTTTCGCTAT
GCCCGTCTCC TTGAAGAAAA GGGAGAACCC CTTGATGCCG AGAAACATTA CATAGTAGCC
CTCAAAGCTG AGTCTGCTGA TCCGCGCCCG CACATTTTCT ATGCCCGTTT GCTTGCAGAA
CACGGTTTTA TTCATGGGGC AAGGGTGCAC TTCAGGTGTG CTCTTAAACT GAACCCTGAA
GATGTTGAAG CTCATTGTGA GTATGCCAGG CTGCTTGCCA GGTTTGGGCA CAGGCACGAA
GCTGAGGTAC AGTACAAAAA AGCCCTTGAG CTTGACCCCG GGCATTTCGG GACTCTGAAA
GGTTACGCGG AGTTACTAAA AGAAAAAGGG CAGTATGCAG CCGCCGAAGA AATTTACAGA
CAGGTTGAGT TATTTAAACG GAGTGCCTGA
 
Protein sequence
MEIDAIDELV FRVIEQVAAD NCDFDEVVKF VVSFSNEHSL SQEILLKLSS IFDKDSMYRE 
KYVITRACAS LFSGKAREDT LMEAGKTAFL LGLDELAVRE FKEILQQNPQ NVEALSGYGT
VLAKEGKNAA ARIQYEKALE FNPYHVDTLC NYGYLLYRLK KLDEAEEVYS RALILDRENV
SAHCGYGILL SKRGQKNEAS YHYTRALELD PGHVESNFRY ARLLEEKGEP LDAEKHYIVA
LKAESADPRP HIFYARLLAE HGFIHGARVH FRCALKLNPE DVEAHCEYAR LLARFGHRHE
AEVQYKKALE LDPGHFGTLK GYAELLKEKG QYAAAEEIYR QVELFKRSA