Gene Mpal_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1998 
Symbol 
ID7270804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2124771 
End bp2125910 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content56% 
IMG OID643570613 
Productglycosyl transferase group 1 
Protein accessionYP_002467024 
Protein GI219852592 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.312706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT TGCGTGTTGC CAGTGATCTC TACCCTACCA CGGTGGGTGG GTACGGGATC 
CATGTGCACG AGATGTCAAA GATGCAGGCA ACACTCGGCC ATGATGTAAC GGTCGTCACC
TCCAATCCCA ATGGCCTACC GGAGGAGGAG TACGTCGATG GGTATCGGGT GCTCAGGTTC
AACCATGCGA TCAAGATGGT TGGGAACACG ATCAGCCCGA CGATGATGTT TCGGCTCCTC
GAGATGAGGA ACGACTATGA TATCATCCAC GCCCATTCGC ACCTCTTCTT TCCGACCAAC
GTCTGTGCCC TGGTGAAACG GTACGGTTCC TCGCCGCTGG TGATCACCAA TCATGGGATC
ATGTCGGCCA GTGCTCCGCA GTGGTTGAAC ATCTCGTACA TGTCGACGAT CGGGAAGTGG
ACGCTGAACT CTGCCGACCG TGTTATCTGT TACACTGACA TCGAGCGGAG ACGGTTGATC
GAGGAGTTTG GAATCAATAA GCCAGAAGTG GTCGTGATCC CGAACGGGGT GAACACCGAG
GTCTTTCACC CGGACGATAG CAAGGTGGAC GAGCGCTACT TCACGCTGCT CTGGGTGGGC
AGGATCGTCA AGGGCAAGGG GGTTGAGTTT CTGATCCACG CCGCACACCG GGTGCTTGAG
AAGATTCCGA ATCTGCGGAT CCTGGTGATC GGGGAGGGAC CTGAACGGGA TGAGATCTGG
AAACTGGTCA ATGAGTATCA TATGACCAAT ATCGTGACGA TCCTGCCGTT CATGAGTTAC
GATGCCATCC CTCTGCTGTA CCAGCGCTCT GACGTGCTGG TGCTGCCGAG TCTGCAGGAG
GGGGTACCCC GGACGATGCT GGAAGCGATG GCATCCGGAA AGCCGGTGAT TATTTCAGAG
TTCGATCATC TGAAGGATCT CGCCGAGGGG GCTGCGCTGA TGTTTCCCAA AGGAGACGTG
GCGGCCCTTG CAGAAAAGAT CCTGCTGCTC GAACAGGACC GTGAGCGTGT TCATCGGATG
GGCGTCTGTG GAAGAGAGCG GATATTGGCG CAGAACTCGT GGAAGAACAC GGTTCTTCGG
ACGATCACCC TGTACCAGGA ACTGCTCTCT CAAGATCAGT CGCCGGTGAA GCGTCACTGA
 
Protein sequence
MKILRVASDL YPTTVGGYGI HVHEMSKMQA TLGHDVTVVT SNPNGLPEEE YVDGYRVLRF 
NHAIKMVGNT ISPTMMFRLL EMRNDYDIIH AHSHLFFPTN VCALVKRYGS SPLVITNHGI
MSASAPQWLN ISYMSTIGKW TLNSADRVIC YTDIERRRLI EEFGINKPEV VVIPNGVNTE
VFHPDDSKVD ERYFTLLWVG RIVKGKGVEF LIHAAHRVLE KIPNLRILVI GEGPERDEIW
KLVNEYHMTN IVTILPFMSY DAIPLLYQRS DVLVLPSLQE GVPRTMLEAM ASGKPVIISE
FDHLKDLAEG AALMFPKGDV AALAEKILLL EQDRERVHRM GVCGRERILA QNSWKNTVLR
TITLYQELLS QDQSPVKRH