Gene Mpal_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1799 
Symbol 
ID7270345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1892021 
End bp1893982 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content58% 
IMG OID643570414 
Producthypothetical protein 
Protein accessionYP_002466828 
Protein GI219852396 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACC AGATGACTGA TCCGAACTAT CGTGCATCGC TGTCAGTACT GACCAGTATC 
GTCCTGATCC TGCTCGCAAT CACCATTACT GTTCCTGTTG GAGCAGAGAG TGTACTTTCC
GATCACTATC ATATCAATAT CGCCACGGTG AATGGTGGAG AGCGATACTT CAAGCTTGAC
GATCCCGGGA TGAATGCGCT TCACATCTCC GGCGACTCCT CGAATTATTT TGGAAATTCG
ATCGCGTCTG ACTCCCAGTC GGGCTCCTTC TTTATGACCG ATACCGGGGG GAGAGGGTTT
GATGACGACG GGATCCTGAT GCTCGCCGTA AAGGGTGACG TCCCTGACGA CTTCAAGGTC
CATATCCGGG CGAGCGGCTA CACCTGGACG CCGTCCTCTG TGGTTAATCA AAAACCCAGT
ATCTATGCTC ATGAGGATGG ATCTGTCGAC AGTACCTTCT CAAAGGGAGA TTTTGCCTAC
GGACCTCAGA ACTGGAAGCC GAGCCCTGCG TCGAACTATC CGATCTATGA TGGCCAGGAT
ATGGGGGATG CTTCGCAGCA GTTCCATCTG GTTTTTATCG ATCTGAATGC AGGGATTCTG
GGCCAGAAAT CAGGCATCGA TCCTTCGACT CTTCAGGATA ATGGTGCGAT CAGGGTCGAG
TACTCGTTCG AGAATTTGAA TACCTTCGCC GCGTTCGACG CCTATGCCTG GTGTCTGAAC
TCCAACCAGG GGGAAGGGGT CACATGGACC AATCGTCTCT CTGCAGCGGG CAGCAGCGGT
TTCACGGTGA AGGGGACTGA ACCCACCCCC ACGACAGAAC CGACGACAAC AGAACCGACA
ACGACGGTTG CAACCCCGAA CGAGACGGTA TCGGTCACGC CGACAGAGAC CCCAACGTCT
GTCGAGACGA CCGTGAATGT TACATCAACC GTCACCATCA ATGAAACCAC ACCGACAACG
ACGGCGGTCC CCTCCACCAC GGTATCTCCC ACCTCCGCTC CGACCACGCC GACTGCAGCG
ACCGGTACGA CAGTGACGTC CACGCAGCCG GTCAGCACCG CCACTACTGC TGGCGTTCTG
GTGACTCCGA CCTCCCTGCC AACCGGCACG ACCCCCGGAG TCATCGTCTC CCAGACGACA
GTCCCGGCAT CGCTGACCAC CGTTCCATCG GCCACGGAGA CCCAGAGTCC CGAACCGTCG
GTTTCAGTTC CTGTTCTGAG CACACTCTTC CCTTCACATG CCCAGACCGC CGTTCCGACC
CAGGCTGCAG TGACCAGTGC TCCTCAGACG CAGTCTCCAA CAGTCGGTAC AGGGACGGTC
GTATCAGATG TCACTGTACA GGCAACCGCG ACCCCGCAGA ATGCACAGAC GGTCGCGTCC
GGCGCCACCA CTGTGCAGGC GACTGTTTCC TCAGGGAGTA CCCAGACGAT CGCTACCCAA
TCCTCTTCAC AGCCGGCCGG TAGCAATTCC GCAGAAGGGC CGGTTTCGAC CGGAGACTCC
TCCTCCGGTT CATCGAGCGA CGACTACACC GGAGTCGGAG GGACGGTTGG GACGACGGCG
ACGCCGGCCC CCACGACCGC CACGCCGACC CAGACGCAGT CAGCCGATAC AACGCCCACC
CCGACTCAGG CGAATTCCAG CACCCCGGTG ATGACGACGC CGACGCTCCC CCTTATCGAT
ACCGGGAACC AGAACGAGTA CTCGCCGGTC TCTATCTCCG GTTCGTCGGG TGCCGCCTCG
TCCCGCGATC GGAGTTCATC TCAGAGTTTC CTCTCGACGA TCCAGTCGAC CATCGATCGG
CTATCATCGT CGGACTTGAG TCTGCTTCTG CTGATCGGAG CGGTCCTGTT CTTCTTCCTG
CTCGTCTTTG CAGGGTTGAT CATCATGGTC CTGCTCCTGC TGCTGCTTCT GGTCGGGATC
CTGTATCTCC GGCAGAGGAG GGAGGAACAG CATGAGAATT GA
 
Protein sequence
MTNQMTDPNY RASLSVLTSI VLILLAITIT VPVGAESVLS DHYHINIATV NGGERYFKLD 
DPGMNALHIS GDSSNYFGNS IASDSQSGSF FMTDTGGRGF DDDGILMLAV KGDVPDDFKV
HIRASGYTWT PSSVVNQKPS IYAHEDGSVD STFSKGDFAY GPQNWKPSPA SNYPIYDGQD
MGDASQQFHL VFIDLNAGIL GQKSGIDPST LQDNGAIRVE YSFENLNTFA AFDAYAWCLN
SNQGEGVTWT NRLSAAGSSG FTVKGTEPTP TTEPTTTEPT TTVATPNETV SVTPTETPTS
VETTVNVTST VTINETTPTT TAVPSTTVSP TSAPTTPTAA TGTTVTSTQP VSTATTAGVL
VTPTSLPTGT TPGVIVSQTT VPASLTTVPS ATETQSPEPS VSVPVLSTLF PSHAQTAVPT
QAAVTSAPQT QSPTVGTGTV VSDVTVQATA TPQNAQTVAS GATTVQATVS SGSTQTIATQ
SSSQPAGSNS AEGPVSTGDS SSGSSSDDYT GVGGTVGTTA TPAPTTATPT QTQSADTTPT
PTQANSSTPV MTTPTLPLID TGNQNEYSPV SISGSSGAAS SRDRSSSQSF LSTIQSTIDR
LSSSDLSLLL LIGAVLFFFL LVFAGLIIMV LLLLLLLVGI LYLRQRREEQ HEN