Gene Mpal_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2356 
Symbol 
ID7272077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2499689 
End bp2500729 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content58% 
IMG OID643570959 
Productglycosyl transferase family 2 
Protein accessionYP_002467362 
Protein GI219852930 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.934543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.35695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGATG AACCTGCAGT CTCGATCGTA CTGGTAAACT GGAACGGCTG GAAGGACACC 
GTGGAGTGTC TGACCTCCCT CTCTCGCCTG CAGTACCACA GGATCCAGAT CGTGATCGTA
GACAATGGGT CGACCGATGG CTCAGTGGAG AAGATCGGTG ATTACTGTCA GGGTCGTCTC
AACGTCACCT CCCCGTTCTT CCCGGACCAG CCGGCAGTAG TAGCAGTCTC ATTCTCTCTA
CTGACTGCAG AAGAGGCCAG GTCGACCGGA GCCGTAAACG CCGAAACGGG CACGGTCACC
GTCATCACCA ACCAGAGGAA CCTCGGATTC GCCGAGGCGA ACAATCAGGG GACCCGGTTC
GCTCTTCGGG CGTTCGAGTC CGATTATGTC CTCTTTCTGA ACAACGACAC CATTGTGGAC
CCCGGATTCC TCACCGCATT CATTGCGGTC GCCAAAGAGG ATCCGTCGAT CGGGTTCCTC
GGCCCCAAGA CCTGCTACTA TGACTACCAG GGGCGACGGG ATGTGATCAA CTTCGCTGGT
GGGGAACTGA GTCTCCTCAC CGGGAACACC GTGCACATCG GCCAGAACCA GCCGGACCAG
GGGCAGTTCG ACACCCAGAG GACCGTCGAC TATGTCGAGG GGTCCTGCCT TCTGGCCCGT
TCCTCGATGC TCCGGCAGAT CGGTCTCCTC GACCCCGGCT ACTTCGTCTA CTATGAGGAG
AACGACCTTG TTATGCGGGG GAGAGAAGCA GGATTCTCGG CCGTCTATGT CCCGACAGCG
GTGATCTGGC ACAAGGTCTC GGCCTCCTCC AAGAAGACCC CTATCAAGAC CTACTACATG
GCCAGGAATC GATTTTGGTT CATGAAGCGG CATGCCGGGT GGCATTATCC GCTCTTTCTG
ATCGTCTTCT TCCTCAGTTC ATTCTGGCTC TCGACCGGGA TCCATCTCCT CTACTACAAG
AGCCCAGACG CTTTTCGGGC TTACGCACGA GGAATCAGGG ACGGCCTCAG AGGACCGGCC
CCTCTCCCGG AGACCCTCTA A
 
Protein sequence
MRDEPAVSIV LVNWNGWKDT VECLTSLSRL QYHRIQIVIV DNGSTDGSVE KIGDYCQGRL 
NVTSPFFPDQ PAVVAVSFSL LTAEEARSTG AVNAETGTVT VITNQRNLGF AEANNQGTRF
ALRAFESDYV LFLNNDTIVD PGFLTAFIAV AKEDPSIGFL GPKTCYYDYQ GRRDVINFAG
GELSLLTGNT VHIGQNQPDQ GQFDTQRTVD YVEGSCLLAR SSMLRQIGLL DPGYFVYYEE
NDLVMRGREA GFSAVYVPTA VIWHKVSASS KKTPIKTYYM ARNRFWFMKR HAGWHYPLFL
IVFFLSSFWL STGIHLLYYK SPDAFRAYAR GIRDGLRGPA PLPETL