Gene Mpal_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2369 
Symbol 
ID7272090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2514661 
End bp2516400 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content57% 
IMG OID643570970 
ProductNHL repeat containing protein 
Protein accessionYP_002467373 
Protein GI219852941 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.547997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAATC GGTACATGCT GGAAGAGCTT GAAGTGATGC TCGTCCTGTT GCTGTTCGGG 
TGCAGCGTTC AGGTCGCAAC GGCGGCGGAT ACGTATCAGT TCGTGGCGCA GTGGGGGACC
AGTGGCTCAG GGGACGGGCA ATTCAACTCC CCGTATGGAG TCGCGCTCGA CGGTGTGGGG
ACCGTTTATG TCACCGACAA GAACCTCGAC CGAGTCCAGC AGTTCAACGC CACCGGCGGT
TTTATTAGAA CATGGGGAAC TCCCGGATCT GGAGACGGAC TGCTCTGGAA CCCCAAGGGA
ATCGCAATCA ACAGCGCGGG AAACGTCTAT ATTGTCAACA ACTGGAACGA TAGAGTTCAG
CGGTTCACCT CGACCGGCAT CTTCCTCGCA CGGTGGGGGA CCGGCGGCAC CGGGGACGGG
CAGTTCAAAT CCCCATCTGG GGTCGCGGTC GACAGCGCAG GAAACGTCTA TGTCGCCGAC
ATGTACAACT ACCGGGTTCA GAAATTCTCC TCAGCCGGCA CTCTCCTCGC GAAGTGGGGA
ACCGAAGGAG GGGGAGACGG GCAGTTCGAT TACCCGACCG GGATCGCGGT TGACAGCGAG
AACAACGTCT ACGTCGTTGA CTCCTATAAT AACCGGGTCC AGAAGTTCAC CTCGAACGGT
ACCTTCCTTG CGAAGTGGGG AGCCAGGGGA TCTGGAGACG GAGAGTTTGC GGATTTCCCA
GAAGAGATCG CAGTTGACAG CACAGGTAAC GTCTTTGTCA CCGACACCGG AAACAACAGA
ATTGAGAAAT TCACCTCGAA CGGCACCTTC CTCGCCAAGT GGGGAGGACG CGGCTCAGGG
GACGGGCTGT TCGAATCACC CACCGGGATC GCCGTCGACA GCGCGGGAAG GATCTATATC
GCCGATACCG GCAACCACCG GATCCAGATG TTTGCTTATC CAACACCGAC TGAAATTCCA
ACCGTCGTTG TGCCGACCAC TATACCAACC GGGATTCCGA CTGTCGTTGA GCCAACCGTC
ACGATCACCG CCACGCCTTC GGCAACGACC ACCGTTCCAA CCGAAATTCC CTCGGTGATC
GTGCCGACCG TCACAGTCAC CCCAAGCATC GAGATACCGG TCTTCCGATT CACCCCGCCT
TCGCTCGCCA TCCCACGTGG GTCAACAAAC ACGACCACCC TCTTCCTCGA TGGTTTAAAC
ATTACCTCGG GCTACAACGT GACGCTCGGT CTCACCCTCC TCAATCCGGC AATCAGCGAG
ATAGTCGGTG TATCCCTCCC TGGGTGGACG ATGGCGAAGA ATCTGACCCT CCCCTCTGAT
ACGGTATGGT TGACCACACT CAATACAGCG GGCCTGGGCG AACCCACTGC TTCAGGCGCC
AGGATCGGAA CGATCACTAT CAGGGGTGAC CAGGGTGGAG AGACCGCGCT CTGCGTCATT
CAGGCGTATG TCAGCGATGA AAAAGGAAAA CCTGTCGCAC CCCAACTGGC AGTCTGCCCG
ATCGAGGTAA CGATCCCCTA TCGTCCGCTC CCTGGCACTT CGTCCAATCC AAATGATCTC
AACGGGGACG GCCTCTATGA AGATATCAAC GGGGACGGGG TGCTCGACTT CAATGACGTG
ATTCTCTTCT TCAACCAGAT GGACTGGATC GCTGATAATG AGCCTGTCAT CGGCTTCGAC
TTCAATGAAA ACGGGCAGAT CGACTTTAAT GACGTGGTGC TTCTCTTCAA CCAACTCTGA
 
Protein sequence
MHNRYMLEEL EVMLVLLLFG CSVQVATAAD TYQFVAQWGT SGSGDGQFNS PYGVALDGVG 
TVYVTDKNLD RVQQFNATGG FIRTWGTPGS GDGLLWNPKG IAINSAGNVY IVNNWNDRVQ
RFTSTGIFLA RWGTGGTGDG QFKSPSGVAV DSAGNVYVAD MYNYRVQKFS SAGTLLAKWG
TEGGGDGQFD YPTGIAVDSE NNVYVVDSYN NRVQKFTSNG TFLAKWGARG SGDGEFADFP
EEIAVDSTGN VFVTDTGNNR IEKFTSNGTF LAKWGGRGSG DGLFESPTGI AVDSAGRIYI
ADTGNHRIQM FAYPTPTEIP TVVVPTTIPT GIPTVVEPTV TITATPSATT TVPTEIPSVI
VPTVTVTPSI EIPVFRFTPP SLAIPRGSTN TTTLFLDGLN ITSGYNVTLG LTLLNPAISE
IVGVSLPGWT MAKNLTLPSD TVWLTTLNTA GLGEPTASGA RIGTITIRGD QGGETALCVI
QAYVSDEKGK PVAPQLAVCP IEVTIPYRPL PGTSSNPNDL NGDGLYEDIN GDGVLDFNDV
ILFFNQMDWI ADNEPVIGFD FNENGQIDFN DVVLLFNQL