Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1799 |
Symbol | |
ID | 7270345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1892021 |
End bp | 1893982 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643570414 |
Product | hypothetical protein |
Protein accession | YP_002466828 |
Protein GI | 219852396 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACC AGATGACTGA TCCGAACTAT CGTGCATCGC TGTCAGTACT GACCAGTATC GTCCTGATCC TGCTCGCAAT CACCATTACT GTTCCTGTTG GAGCAGAGAG TGTACTTTCC GATCACTATC ATATCAATAT CGCCACGGTG AATGGTGGAG AGCGATACTT CAAGCTTGAC GATCCCGGGA TGAATGCGCT TCACATCTCC GGCGACTCCT CGAATTATTT TGGAAATTCG ATCGCGTCTG ACTCCCAGTC GGGCTCCTTC TTTATGACCG ATACCGGGGG GAGAGGGTTT GATGACGACG GGATCCTGAT GCTCGCCGTA AAGGGTGACG TCCCTGACGA CTTCAAGGTC CATATCCGGG CGAGCGGCTA CACCTGGACG CCGTCCTCTG TGGTTAATCA AAAACCCAGT ATCTATGCTC ATGAGGATGG ATCTGTCGAC AGTACCTTCT CAAAGGGAGA TTTTGCCTAC GGACCTCAGA ACTGGAAGCC GAGCCCTGCG TCGAACTATC CGATCTATGA TGGCCAGGAT ATGGGGGATG CTTCGCAGCA GTTCCATCTG GTTTTTATCG ATCTGAATGC AGGGATTCTG GGCCAGAAAT CAGGCATCGA TCCTTCGACT CTTCAGGATA ATGGTGCGAT CAGGGTCGAG TACTCGTTCG AGAATTTGAA TACCTTCGCC GCGTTCGACG CCTATGCCTG GTGTCTGAAC TCCAACCAGG GGGAAGGGGT CACATGGACC AATCGTCTCT CTGCAGCGGG CAGCAGCGGT TTCACGGTGA AGGGGACTGA ACCCACCCCC ACGACAGAAC CGACGACAAC AGAACCGACA ACGACGGTTG CAACCCCGAA CGAGACGGTA TCGGTCACGC CGACAGAGAC CCCAACGTCT GTCGAGACGA CCGTGAATGT TACATCAACC GTCACCATCA ATGAAACCAC ACCGACAACG ACGGCGGTCC CCTCCACCAC GGTATCTCCC ACCTCCGCTC CGACCACGCC GACTGCAGCG ACCGGTACGA CAGTGACGTC CACGCAGCCG GTCAGCACCG CCACTACTGC TGGCGTTCTG GTGACTCCGA CCTCCCTGCC AACCGGCACG ACCCCCGGAG TCATCGTCTC CCAGACGACA GTCCCGGCAT CGCTGACCAC CGTTCCATCG GCCACGGAGA CCCAGAGTCC CGAACCGTCG GTTTCAGTTC CTGTTCTGAG CACACTCTTC CCTTCACATG CCCAGACCGC CGTTCCGACC CAGGCTGCAG TGACCAGTGC TCCTCAGACG CAGTCTCCAA CAGTCGGTAC AGGGACGGTC GTATCAGATG TCACTGTACA GGCAACCGCG ACCCCGCAGA ATGCACAGAC GGTCGCGTCC GGCGCCACCA CTGTGCAGGC GACTGTTTCC TCAGGGAGTA CCCAGACGAT CGCTACCCAA TCCTCTTCAC AGCCGGCCGG TAGCAATTCC GCAGAAGGGC CGGTTTCGAC CGGAGACTCC TCCTCCGGTT CATCGAGCGA CGACTACACC GGAGTCGGAG GGACGGTTGG GACGACGGCG ACGCCGGCCC CCACGACCGC CACGCCGACC CAGACGCAGT CAGCCGATAC AACGCCCACC CCGACTCAGG CGAATTCCAG CACCCCGGTG ATGACGACGC CGACGCTCCC CCTTATCGAT ACCGGGAACC AGAACGAGTA CTCGCCGGTC TCTATCTCCG GTTCGTCGGG TGCCGCCTCG TCCCGCGATC GGAGTTCATC TCAGAGTTTC CTCTCGACGA TCCAGTCGAC CATCGATCGG CTATCATCGT CGGACTTGAG TCTGCTTCTG CTGATCGGAG CGGTCCTGTT CTTCTTCCTG CTCGTCTTTG CAGGGTTGAT CATCATGGTC CTGCTCCTGC TGCTGCTTCT GGTCGGGATC CTGTATCTCC GGCAGAGGAG GGAGGAACAG CATGAGAATT GA
|
Protein sequence | MTNQMTDPNY RASLSVLTSI VLILLAITIT VPVGAESVLS DHYHINIATV NGGERYFKLD DPGMNALHIS GDSSNYFGNS IASDSQSGSF FMTDTGGRGF DDDGILMLAV KGDVPDDFKV HIRASGYTWT PSSVVNQKPS IYAHEDGSVD STFSKGDFAY GPQNWKPSPA SNYPIYDGQD MGDASQQFHL VFIDLNAGIL GQKSGIDPST LQDNGAIRVE YSFENLNTFA AFDAYAWCLN SNQGEGVTWT NRLSAAGSSG FTVKGTEPTP TTEPTTTEPT TTVATPNETV SVTPTETPTS VETTVNVTST VTINETTPTT TAVPSTTVSP TSAPTTPTAA TGTTVTSTQP VSTATTAGVL VTPTSLPTGT TPGVIVSQTT VPASLTTVPS ATETQSPEPS VSVPVLSTLF PSHAQTAVPT QAAVTSAPQT QSPTVGTGTV VSDVTVQATA TPQNAQTVAS GATTVQATVS SGSTQTIATQ SSSQPAGSNS AEGPVSTGDS SSGSSSDDYT GVGGTVGTTA TPAPTTATPT QTQSADTTPT PTQANSSTPV MTTPTLPLID TGNQNEYSPV SISGSSGAAS SRDRSSSQSF LSTIQSTIDR LSSSDLSLLL LIGAVLFFFL LVFAGLIIMV LLLLLLLVGI LYLRQRREEQ HEN
|
| |