Gene Mpal_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0531 
Symbol 
ID7271947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp532541 
End bp533806 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content54% 
IMG OID643569178 
Productprotein of unknown function DUF21 
Protein accessionYP_002465627 
Protein GI219851195 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0811523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCAGT TAGACCTTTT TAACGCTTCG ATTTTTGGAC TCTGTATCAT TCTTTCGGCC 
TTTTTCTCCA GTTCTGAAGT GGCGCTGATC TCGATCACAC GAGCCAAGGT CCGGGCATTG
GTGAATGATG GGCGGTCAGG CGCAGCCCAA CTCTTTAAAC TCAAGCAGAA TCAGGATCAT
ATCCTGATCA TCCTCCGGAT CGGAAACACC ATCGCGATCG TGGCCGCGGC CGCGGTGGCC
ACCTCCATCG CGATCGAGGC ATTTGGGGAT CCCGGTCTGG GGATCGCGAT CGGAGGTACA
GTGCTGATTC TGCTGATCTT CGGTGAGATC GGACCGAAAC TCTTTGCGAC CCGGTACACC
GAACCCCTGG CCCTCAGGGT GGCTCCCCCG ATTCTCTTCC TCTCCCGGGT CGTCGGTCCG
TTCCTCTGGT TATCAGATAA GGTCAGCCGT TCACTGGTCC CTGGAGATGT CTCTACTGAA
CCAACGGTGA CTGAGGATGA GATCCGGGAA TGGATCGATG TCGGTATGGA GGAGGGGACG
ATCGAGCAGG AGGAGCAGGA GATGCTCTAC AATGTTCTGG AACTCGGGGA CACAACCGCT
CGCGAGGTGA TGACCCCCCG CGTCGACGTC GCGATGATCG AGGACACCAG CACCCTTGAG
AGTTCCCTCA CTATCTTCCA TGAGACTGGT TTCTCCAGGC TCCCGGTTTA TCACGAGAAG
ATCGATAACC TGACAGGGGT CCTCAACATC AAGGATGTCT ACACGGTGAT CGTCGGACAT
AAGAAGGATG TAAAGATCTC GGATCTGATG TACGATCCGT ATTTCGTCCC TGAAACCAAG
AAGATCGACG ACCTCTTAAA GGAGTTGCAG CTCAGAAAAG TTCCGATGGC GATGGTGATG
GATGAATATG GTGGTTTTGT TGGGGTTGTG ACGGTCGAGG ACATCCTTGA GGAACTGGTT
GGTGACATCC TCGATGAGTT CGATGATGAG GAACCCGAAC TGTCCCGGAT CGGCGAGGGG
ATCTATATGC TGGATGCGCG TATGTGGGTC GATGATCTGA ACGAACAACT GGATATTGCG
CTTCCGACCT CCGATACCTA CGAGACGATC GGGGGGCTGT TGATCGAGCA GCTCGGCCAT
ATTCCGCATC CTGGTGAGAC AGTCAGGGTC GAGGAGAGCA ATGCGACGCT GGTCGTTATG
CAGATGCGGG GCAAACGGAT CGTCAAGGTG AAGATGATCC TCTCCGATGA TCGGGAGAAG
CGTTAA
 
Protein sequence
MLQLDLFNAS IFGLCIILSA FFSSSEVALI SITRAKVRAL VNDGRSGAAQ LFKLKQNQDH 
ILIILRIGNT IAIVAAAAVA TSIAIEAFGD PGLGIAIGGT VLILLIFGEI GPKLFATRYT
EPLALRVAPP ILFLSRVVGP FLWLSDKVSR SLVPGDVSTE PTVTEDEIRE WIDVGMEEGT
IEQEEQEMLY NVLELGDTTA REVMTPRVDV AMIEDTSTLE SSLTIFHETG FSRLPVYHEK
IDNLTGVLNI KDVYTVIVGH KKDVKISDLM YDPYFVPETK KIDDLLKELQ LRKVPMAMVM
DEYGGFVGVV TVEDILEELV GDILDEFDDE EPELSRIGEG IYMLDARMWV DDLNEQLDIA
LPTSDTYETI GGLLIEQLGH IPHPGETVRV EESNATLVVM QMRGKRIVKV KMILSDDREK
R