Gene Mpal_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0738 
Symbol 
ID7270472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp736901 
End bp738784 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content58% 
IMG OID643569381 
Producthypothetical protein 
Protein accessionYP_002465825 
Protein GI219851393 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.140724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCC ATACTCTCTT CACTCACTCT CTTTTCCGTG TCTGCAGTCT CCTGATCTGC 
CTTGTCCTGC TGGTCAGCGC AGTTTCAGCC GCCACAACCG AGGTTCATGT GGTGAAGACC
GCGGCCGACG GCAGCACCGT CCTCGCCGAA CGGACCGTCG ACTACCAATG GCTCGAATCA
AACCTGCCCG TGCTCGGTGA CGGCGTCACC CACTACTACC TCCAGGGCCC CGTCTTCGAC
GGCGATGTGT GGGATCCCAC CGAGTCGGTG AACGTCGAGG AGAAGGATAT GGGCGCCCTG
AAAGGGACCG ATCTCCGCGA TATCTGCGAC CTCGTGGGGG GAATGCAATC CGACGATACG
CTTGTCGTCA AGGCCTCAGA CGGCTTCTCG AAGCGATTCT CGTACAATAC CATCTATACT
CCCCCAGCTC GGGAGGGTCC GCTCGTCCTG ACCTGGTACA GGGGCGACGA AGGGTACGTC
CCGTCGTACT CGAACGGCAT GCGGATCGTA TTCTTCGCCG ACACCTCCGT GAACCCCTGG
GGCTATCACG TCTTCGGCCT CAGCGACATG AAGGCTTCGA TGCCCTCGAA CGAATGGAAC
TTCTTTGACA TCTACCCGAC CACGACCGGG CTCTCGGCCA TGTATGTGAA CGAACTCCGG
ATCGTCTCGA CTCAGCAGGC GCCCACAACC ACTGAGACGA CCACCTCCGC GACACCGACA
CTAACGCCCA CAACAACCCC AACCGACACC CAGACCGAGC TGCCAACGGT GGTCGTTACG
ACTATAAACA CCACGCCGAC GTCGACAACC ACAGCAACCC CGACTGACAC CCAGACCGAG
CAGCCGACAG TGGTCGTCAC GACCATAAAC ACCACACCAA CGTCGACAAA CACTGCAACC
CCATCCGTCA CTCAGACCCA GACACCCACA GCGACCACCA CGACACTCAC GATCATCCCG
ACGGTCTCTG TCACCGGAAC GACGGTGCCG ACGACTGTGG GGACGATCGA ACAGGTCTCC
CCCACGACGG TGGCGACCAC CCCGTCGATA ACCACAGTCA GAACCATCGA TACCGGAGTG
ACTACCAGAT CCACCCCGGT CGTAACAGCC AGCACCGCCC CTGCGGTATC CACAATACCA
CAGACCACAA CAGCACTCGC CCCGGTCATA TCGTCCCCCT CAACCACGAG TCCAGTACCC
TCGATCTCCC TATCAGGTAT GAATACGCTC TTACCCTCCC ACGAGCAGAC CGTCGTTCCA
GCTCCAGACA TCACCAGCAC CGGTATCCAG TCCTCGACGA CATCCAGTGC AGGAACCATC
GCACCGATCA TCACAGCCAC GAGCAGGTCT TCAGTTCAGA CAGCCAGTTC TTCTGGAGGA
GCCCCCGTCT CAACCGGAGA TTCATCTTCA GACTCATCGA GCGACGACTA TACCGGCGTC
GGTGGGACAA ACACAACGAC CGCAACGACG GCGATCACAA CTGTAACTAC ACCTCATACA
CCGACTGGAA ATGCAACGCC CACTCCGACC CAGGTAAATA CCAGCACTCC AGTGACGACA
ACGTCGACCC CACCAGTTCT CGACACAGGA AGTCAGGACG AATACTCGCC GCTGATCCTT
CCCGGAGGTT CGAACTCAAA ATCCTCGTCT GGAGGACAGA CCTCTACCAA GAACTTCCTC
TCCACGATAC AGTCGGCCAT CGACCGCCTC TCGTCGTCAG ACCTGAGTAT CTTCCTGCTG
ATCGCGGTGG CACTGCTCTT CATCCTGCTG ATCTTTGCAG GGCTGATCAT CATAGTTCTC
CTCCTGCTCC TGGCACTGGC CGGGATCCTG TACCTGCGGC AACGAAGGGA GAAGGAACAG
AAGGATGAAC AGAACAGAGA ATGA
 
Protein sequence
MESHTLFTHS LFRVCSLLIC LVLLVSAVSA ATTEVHVVKT AADGSTVLAE RTVDYQWLES 
NLPVLGDGVT HYYLQGPVFD GDVWDPTESV NVEEKDMGAL KGTDLRDICD LVGGMQSDDT
LVVKASDGFS KRFSYNTIYT PPAREGPLVL TWYRGDEGYV PSYSNGMRIV FFADTSVNPW
GYHVFGLSDM KASMPSNEWN FFDIYPTTTG LSAMYVNELR IVSTQQAPTT TETTTSATPT
LTPTTTPTDT QTELPTVVVT TINTTPTSTT TATPTDTQTE QPTVVVTTIN TTPTSTNTAT
PSVTQTQTPT ATTTTLTIIP TVSVTGTTVP TTVGTIEQVS PTTVATTPSI TTVRTIDTGV
TTRSTPVVTA STAPAVSTIP QTTTALAPVI SSPSTTSPVP SISLSGMNTL LPSHEQTVVP
APDITSTGIQ SSTTSSAGTI APIITATSRS SVQTASSSGG APVSTGDSSS DSSSDDYTGV
GGTNTTTATT AITTVTTPHT PTGNATPTPT QVNTSTPVTT TSTPPVLDTG SQDEYSPLIL
PGGSNSKSSS GGQTSTKNFL STIQSAIDRL SSSDLSIFLL IAVALLFILL IFAGLIIIVL
LLLLALAGIL YLRQRREKEQ KDEQNRE