Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0672 |
Symbol | |
ID | 4463312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 708154 |
End bp | 710055 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639699682 |
Product | Na+/solute symporter |
Protein accession | YP_843102 |
Protein GI | 116753984 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATCCAT ATCACATATT CCTCCTCATC CTGGGCGTAT ATATTTCTGT GCTCGTCGGC ATAGGCTTCA TCTCATCGAA GCGGCAGGGA TCTGTGACTG AGTTCTGGCT GGCAAGCCGG GAGCTGGGGG CAGCCTCCTT GGGATTCTCG GCTGCGGCTT CATGGCTCAC GGCCAGCGCT TTGCTGCTTT CCACAGGTCT TTTCATGCTC ATAGGCGTCA GCTCGATATG GGTGTGGGTA TTTCCGAACA TAGCTGCGCT GGCGATCATA GCGCTGATCG CAGGCAGGAT CAAGCGCATA CCTGCGATGA CCCAGCCTGA GCTTCTGGAG ATAAGATACG ATCCAATGGT CAGAGCTCCG GTCGCGGTGG CGATAACGAT AATGATGATT CTCTTCTCAG TGGTTGATTT CATCGGGTTC AAGCTCGTTC TGGGGACTTT CTTCGGGGTC GAGCCACTGT ACGCGGTTCT GATCATGGCG GTGTCCGTGG CGCTCTACGT CAGCCTCGGT GGGTTCAGAG CGGTTGTCTG GACCGACAGG GTCCAGTACA TCTTCCTGGC AGGGCTCGCG GTTGTTGTGG CGATTCTCTC CCTGAAGCTC TCGGCAGATC GGGGCGCATC CATCATCGCG GAGAGCGCTG CCCTCGGCGG GGAGTGGTGG AACCCGTTCA TGATGGGGGG CGTGCTCGGC GCCCTTGTCT TCCAGCTCGC CCTGCTCCCC GGCTGGATCG CGGAGCAGGA CCCGTGGCAG AGGGTCTGGG CCGCGCGCGA TGAGAGAAGC GCGAGGATGG GCTTGCTCCT GGGGTCGATG CTTCTCGCGA TCGTTTACGG CGCATGCTTC CTCACGGCGA TAGCCCTGCG CGCGATCTAC CCGCTCCCTG AGGGCGAGGT GGAGGCTGAG ATGCTGTACC TCAGGTTCAT CTCCGAGAAC GTGCCACCTG CAGCAGTCGC GCTGCTCACA ATAGGATTCG CTGCCGCGTC CATGTCGTGC ACAGACACCT TTGCGACATC TGGCGCGTCC TGCATCGTCA GGGACATCTT CCAGAGGTTC GTCAAACCGG ATGCCACCAT GCGGGAGATG CTGATAATCA GCAGGGTGCT TGTGGTCATG ATGATCTCCA TCTCAGCGTT CATCGCGCTG AATGTGGAGA GCATCATGGA GGCTGTGATA ATAGCCACCG TTATAGGGAC AACATCCTAC TTCTTCCCGA TCGTCGGCGG GCTTTACTGG AAGAGGGCCA CATCCTGGGG CGCGCTTGCA GCTGTCATCG TCGGTGGCGT GACACAGATC GTGATGATCG CGTATGAGAA GTTCATTTTT GGGGAGCCAC TCGACACGAT CTCTCCACTC CTCACAGAGC ACGGCGTGCT CGTCGGCCTG ACTCTGAGCG CATCGGTATT CGCGATCGTC TCCCTTCTCA CGCCGCCCAC CGACGACAGG AGGCTCGCGC CATTCTTCTC AGATATCGCA GAGAGGCTCT TTGGTGGAGC CATGATCACA GTTGACAGGA AGAACTCCAG GTATCCTATC ATCATGAGAA TGATAGAGGA GAGGGGTTTC GGAGAGAGAA CGCATCTCGA TCTGGCGCTG AGGGTGAACC CGCTGAAGGC GGATGGAGCT CAGATCAGGG GAGAGATAAG CTGGGATAGA CTCATCGAGG ATCTGAGATC CAGGCATCCG GAGTGGTACA CGCCGACCGG GAGCCATATC GCGTACAGGC TCTCACAGCA TGACATGCTT GCGTGCGTGA AGCTCTACAG GGGGGATGAG ATGGAGATCA GACTCTCTGC TGAACCAAGG CTCTCCCAGA GGGAGAGGTT CAGGGACGAG ATGTATCTCG CGGTAGAGGA GATAGAGGAG TCGCTTCTGA GCATGGGATA CACGACATCA CTGCCCGCAT GA
|
Protein sequence | MDPYHIFLLI LGVYISVLVG IGFISSKRQG SVTEFWLASR ELGAASLGFS AAASWLTASA LLLSTGLFML IGVSSIWVWV FPNIAALAII ALIAGRIKRI PAMTQPELLE IRYDPMVRAP VAVAITIMMI LFSVVDFIGF KLVLGTFFGV EPLYAVLIMA VSVALYVSLG GFRAVVWTDR VQYIFLAGLA VVVAILSLKL SADRGASIIA ESAALGGEWW NPFMMGGVLG ALVFQLALLP GWIAEQDPWQ RVWAARDERS ARMGLLLGSM LLAIVYGACF LTAIALRAIY PLPEGEVEAE MLYLRFISEN VPPAAVALLT IGFAAASMSC TDTFATSGAS CIVRDIFQRF VKPDATMREM LIISRVLVVM MISISAFIAL NVESIMEAVI IATVIGTTSY FFPIVGGLYW KRATSWGALA AVIVGGVTQI VMIAYEKFIF GEPLDTISPL LTEHGVLVGL TLSASVFAIV SLLTPPTDDR RLAPFFSDIA ERLFGGAMIT VDRKNSRYPI IMRMIEERGF GERTHLDLAL RVNPLKADGA QIRGEISWDR LIEDLRSRHP EWYTPTGSHI AYRLSQHDML ACVKLYRGDE MEIRLSAEPR LSQRERFRDE MYLAVEEIEE SLLSMGYTTS LPA
|
| |