Gene Mpal_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0149 
Symbol 
ID7270920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp176322 
End bp177455 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content62% 
IMG OID643568808 
ProductCBS domain containing protein 
Protein accessionYP_002465265 
Protein GI219850833 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG1994] Zn-dependent proteases
[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0113637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGAT CGCTCAAAAT CGGACGTATT TTAGGAATTC CGATCTATCT TCACTTCACA 
TTTCTGCTGG TTATCCCCCT CTTCGCCTGG ATCATCGGCA GCCAGATCGA AACGACCGTC
CAGCTCCTGG CCCAGATCTT CTCGGTCACG ATCGATTCAT CCCTGATCAC GACGGGGCCG
GCCCCGTACC TGCTCGGGGC CCTGATCGCC CTCGGTCTCT TTGCCGGGGT GCTGGTCCAT
GAAGTCGCCC ACTCAGTGAT CGCAAAGAAG CGGGGGATCC GGATCAACAA CATCACCCTC
TTCCTCTTCG GCGGAGTCTC ATCGATGGAG GAGGGGACGC CCGACCCGAA GGTCGAACTC
CCGATGGCCC TGGCCGGGCC ACTCACCTCC CTCGGTCTCG GGATCCTCTC GATCGGGATC
ATCTACCTGA TCCCGCTGAT CGTCGAGTCG CCGGCGATCG CCGGGGTGCT GATCTTCCTC
TTCGCTTACA CCGGGCTGCT CAATGTGATC CTCTTCGCGT TCAACCTGCT GCCGGCCTTC
CCCATGGACG GCGGCCGGGT ACTGCGGGCC TTCCTTGCCC AGCGGATGCC GGCGACCAGG
GCGACCAGGA TCGCCAGCGA GGTCGGCAAA GGTTTTGCGG TCTTCTTCGG GATCTTCGGG
TTTCTGGCAT TCAACCCGAT CCTGATCATC ATCGCGTTCT TCATCTATAT CGGAGCCAGC
CAGGAGTCGT CAGCCGTCCG CTACACCTCC CTGCTGCAGG ACCTGACCCT CGGGGCCGTG
ATGTCCACTG CAGTGATGAC GGTCTCTCCA CAGACCCCGG TCCTGGAGAT GCTCGACCAG
ATGTATGCCA CCAAGCATCT CGGGTTCCCG GTGGTCGACC GGGGGATCGT GGTCGGGATG
ATCACCCTCT CCGACCTGCA CCGGGCCTCG CCGATCGACC GGGACGCCCT GCAGGTGCGG
GACCTGATGA CCAGGGAGGT GGTCTCACTG CCGCCACAGG CACCAGTGGC AGAGGCGCTC
AGAGTGATGT CCGAGCGGAA CATCGGGCGG ATCCCGGTGC TGGAGAACAC CGAACTGGTC
GGGCTTGTAA CCAGGACCGA TATCATCAAA GTGATGCAGC TCCGTGAAGT CTGA
 
Protein sequence
MEGSLKIGRI LGIPIYLHFT FLLVIPLFAW IIGSQIETTV QLLAQIFSVT IDSSLITTGP 
APYLLGALIA LGLFAGVLVH EVAHSVIAKK RGIRINNITL FLFGGVSSME EGTPDPKVEL
PMALAGPLTS LGLGILSIGI IYLIPLIVES PAIAGVLIFL FAYTGLLNVI LFAFNLLPAF
PMDGGRVLRA FLAQRMPATR ATRIASEVGK GFAVFFGIFG FLAFNPILII IAFFIYIGAS
QESSAVRYTS LLQDLTLGAV MSTAVMTVSP QTPVLEMLDQ MYATKHLGFP VVDRGIVVGM
ITLSDLHRAS PIDRDALQVR DLMTREVVSL PPQAPVAEAL RVMSERNIGR IPVLENTELV
GLVTRTDIIK VMQLREV