Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0715 |
Symbol | |
ID | 4463488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 751807 |
End bp | 754926 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639699725 |
Product | Na+/solute symporter |
Protein accession | YP_843145 |
Protein GI | 116754027 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.723376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGCAC CTCTCTTCGC GTTCTCCATC ATACTTACGT ACCTCCTCCT CCTGAGCATC ATAGCATATT ACGCCGACAG GCAGAGGCAG GCGGGCAGGA GCATCGTCTC TAATCCATAT GTCTATGCTT TATCCCTTGC AGTATACTGT ACAGCATGGA CATTCTATGG AAGTATTGGA AGGGCAGCAA CAAGCGGCCT CGGTTTTCTC ACGATATACA TCGGACCAAC ACTTGCAATG CTCCTCGGAT GGGTGATGAT ACGCAAGATA GTCCGCATAT CAAAGGAGTA TCGACTCACG TCTATCAGCG ATTTCATAAG CTTCAGGTAC GGCAGAAGCT ATGCAATAGG CGCGATAGTG ACCATTGTGA GCATGATGGT AGTCATACCG TATGTCGCAC TCCAGTTGAT CGCGATATCA AGCTCAATAC AGATAATAAG CGGAGGAGAT ACATTCTGGG GTACAAAGCT CTCCGTGGCG GTCCTGCTCG CTGTATTCGC CATAATATTC GGAGCGCGCC ACCTGGATCC GATGGAGCGG CACGAGGGTT TGGTGGCAGC GGTTGCGTTT GAATCAATTG TCAAGCTGGC CGCATTTGTT GTGGCAGGTG TTTACATAAC ATGGGGGATA TTCAACGGAT ACTCTGAGAT AATAGACCGC GTCCTCTCAA CAGAGAATTT CTCTCATCTC ATAAACATAG ATTACACATC CTGGTTCTCG CTCACCCTGA TATCCTTCTT CGCTGCGTTT CTTCTCCCCA GACAGTTCCA CGTCATGGTT GTTGAGAATG CTGATGAATC GCATATACGA AAGGCGATGT GGCTCTTCCC GCTTTACCTC CTTCTGATTA ATCTTTTCGT TCCAGCGATA GCCGGAGCAG GCCTGCTTCT GGGAGTTCCT GGCGTGAAAG ATATGTTCGT GATAGAGATC CCGTACTCCG CGGGCAACAT ACCTCTCGCA GTGCTCGTGT TCATAGGAGG AGCGTCTGCT GCCACTGCGA TGGTGCTGGT GGACGGTGTT GCAGTTGGCC ACATGATGCT GAACGAGCTG GAGCTGCCGT ACCTCATGAG GTACTTGGGG AGGGGAAGAG GTCTCCCCGG GCTTCTGCTC AACGCTAAGC GCATCAACAT CATTCTTGTG GTGATGCTCG GATACCTTTA CTCCAGGGTC GTCGAGTACC AGAGCCTCGT GGATATAGGT TTGATATCAT TCGTGGCAGC AAGCCAGATG GGGCCAGCGG TGATCGGGGG CCTTTACTGG AGAAAGGGCA GCAGGGAGGG GGCGATCGCA GGCATGAGCG CAGGGTTTGT GCTCTGGCTC TACACTGCAC TCATCCCCAC AGTTGTCAAG GCCGGCTGGC TTCCGCAGTC CATTCTCGAA TCAGGACCCT TCGGGATATC TGCGCTCACT CCGACTAACC TCTTTGGTGT GGATCTGGAT CCCTGGACTA ACTCGGTGTT CTGGAGCATC TTCGCAAATG CGAGCCTGTA TGTGCTATTC TCCCTGATGA GCAGCCCGAC GCCTGAGGAG AGGGTCCAGG CAGAGGGGTT CGTTGAGATA TTCAGCGAGA GAAGGGAAGC AGTTCCGATC GAGAGACCTG CCATAAGACT GGGTACGGTC GACGAGGTTG AGTCGATGCT CGCCAGGTAT ATAGGAGCGG AGAAGGCCAG GATGCTGATA GATGCGGATC TTGCGAGGCT TGGAGTCTCA AGAGATAAAG TCGATGCCAG GCACCTCCTG GACCTCTGGG ATCATGTGGA GAGGGTTCTC ACAGGCTCGC TTGGCACATC AACAACAAGG ATAATAGTTG AGGAGCATAT CACACCCAGG CCAGTTGTTG AGAGGGTGGA AGCAGCTCCG CAGAAATTCA GTCTCGAGCC TGGAAAGATC TACTTCTCAG CTCAGAATGC CTATGAGGTG TTCACGGATC AGGTAACGCA TGGATTTGAG GGGCTGTGTG TCACACGCAG ACCGCCGGAG GATGTGAGAA GCAGGTACAG TCTCAGGAAG ACGCCGATCA TATGGCTTAA CCAAAAGAGA GAGGGTGGTG AGAAGCAGAT ATCTCCCACA AATCTTCCAC TGCTCTTCCT CACGATAAAG ACATTCGTCG AGACCTGCAG GAAGGGTATA ATACTTCTGG ACAATCTCGA GCATCTTGTT CTTGTCAATG AGAATGTGAT ACCTGCGGAG GATCTGCTGG ATTTTGTCAA CCAGCTGGAG AATCTAGTCC ACAGAACGAA CACCAGACTC ATCCTGGCGG ATTCGTCTGA TTTCATGGGG TTCTGTGCTG TATCTGAGGC GGAGCCTGTG GAAGTAAGGG GTCTGATATT CACGGCAGGC CCTCTGCCGT CTTATCTCCT CAGGGTTTTT ATACTTGCCA TAATCGGCGG GACGAGATCT CCAGAGGCTG CAATGGACAT CGCAAATTCC GTTCTCAGCG AACAATCAGA GGTCGCAGAG GGCGCATCAT GCGATCCCGA CATGCGCGGC AGGGGATTGA TCGAGATAGA TACGCGGTGC AAGATCACGA GAAGGCATTT CTTCACGATA ATAAGACGCA TCTGCACCTC CGTGAGCAGG GTTGATCCCG ACTTCGATTC TGTGAAAGTA TTGAGGCCGC TCATCGAAAA GTACGGCTTC AGCATCTATG AGCTGATACT GAATCCAGGA ACGACATATG CGATCGAGGA GGACAAGCCG GTCCGGTGCT TCGAGATATT CAGCGAGCTG GTACACGCTG GGTTAGAGGG GTTGTGCATA TCCAGGTACA ACCCTGAGAG CCTCCGTGAG AAGTACGGGA TCTCACCTGA AACAGTCATA TGGCTCACAC AGAAGACAGA GGAGGGGAAG TTCAGATCCG TGGATCCAAC GAACTTCCCG AGGCTCAGCT CAATGATATC AGATTTTCTG AGGAGGACCG AGTACCCTGT GATACTCCTG GAGGGTATTG GATACCTGAT AACGCAGAGC AACTATGAGA CCGTGCTGAG GTTTATACAG TCGCAGAGGG ATGAGGTCTC GCTGAGAGGA GCTGTGATGC TCGTGCATAT CGATCCTCTC TCTCTTGACA CAAAGGAGCT GCACAGGCTG GAGGGAGAGA TGGAACAGCT GGAGATCTGA
|
Protein sequence | MSAPLFAFSI ILTYLLLLSI IAYYADRQRQ AGRSIVSNPY VYALSLAVYC TAWTFYGSIG RAATSGLGFL TIYIGPTLAM LLGWVMIRKI VRISKEYRLT SISDFISFRY GRSYAIGAIV TIVSMMVVIP YVALQLIAIS SSIQIISGGD TFWGTKLSVA VLLAVFAIIF GARHLDPMER HEGLVAAVAF ESIVKLAAFV VAGVYITWGI FNGYSEIIDR VLSTENFSHL INIDYTSWFS LTLISFFAAF LLPRQFHVMV VENADESHIR KAMWLFPLYL LLINLFVPAI AGAGLLLGVP GVKDMFVIEI PYSAGNIPLA VLVFIGGASA ATAMVLVDGV AVGHMMLNEL ELPYLMRYLG RGRGLPGLLL NAKRINIILV VMLGYLYSRV VEYQSLVDIG LISFVAASQM GPAVIGGLYW RKGSREGAIA GMSAGFVLWL YTALIPTVVK AGWLPQSILE SGPFGISALT PTNLFGVDLD PWTNSVFWSI FANASLYVLF SLMSSPTPEE RVQAEGFVEI FSERREAVPI ERPAIRLGTV DEVESMLARY IGAEKARMLI DADLARLGVS RDKVDARHLL DLWDHVERVL TGSLGTSTTR IIVEEHITPR PVVERVEAAP QKFSLEPGKI YFSAQNAYEV FTDQVTHGFE GLCVTRRPPE DVRSRYSLRK TPIIWLNQKR EGGEKQISPT NLPLLFLTIK TFVETCRKGI ILLDNLEHLV LVNENVIPAE DLLDFVNQLE NLVHRTNTRL ILADSSDFMG FCAVSEAEPV EVRGLIFTAG PLPSYLLRVF ILAIIGGTRS PEAAMDIANS VLSEQSEVAE GASCDPDMRG RGLIEIDTRC KITRRHFFTI IRRICTSVSR VDPDFDSVKV LRPLIEKYGF SIYELILNPG TTYAIEEDKP VRCFEIFSEL VHAGLEGLCI SRYNPESLRE KYGISPETVI WLTQKTEEGK FRSVDPTNFP RLSSMISDFL RRTEYPVILL EGIGYLITQS NYETVLRFIQ SQRDEVSLRG AVMLVHIDPL SLDTKELHRL EGEMEQLEI
|
| |