Gene Mthe_0715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0715 
Symbol 
ID4463488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp751807 
End bp754926 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content52% 
IMG OID639699725 
ProductNa+/solute symporter 
Protein accessionYP_843145 
Protein GI116754027 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.723376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGCAC CTCTCTTCGC GTTCTCCATC ATACTTACGT ACCTCCTCCT CCTGAGCATC 
ATAGCATATT ACGCCGACAG GCAGAGGCAG GCGGGCAGGA GCATCGTCTC TAATCCATAT
GTCTATGCTT TATCCCTTGC AGTATACTGT ACAGCATGGA CATTCTATGG AAGTATTGGA
AGGGCAGCAA CAAGCGGCCT CGGTTTTCTC ACGATATACA TCGGACCAAC ACTTGCAATG
CTCCTCGGAT GGGTGATGAT ACGCAAGATA GTCCGCATAT CAAAGGAGTA TCGACTCACG
TCTATCAGCG ATTTCATAAG CTTCAGGTAC GGCAGAAGCT ATGCAATAGG CGCGATAGTG
ACCATTGTGA GCATGATGGT AGTCATACCG TATGTCGCAC TCCAGTTGAT CGCGATATCA
AGCTCAATAC AGATAATAAG CGGAGGAGAT ACATTCTGGG GTACAAAGCT CTCCGTGGCG
GTCCTGCTCG CTGTATTCGC CATAATATTC GGAGCGCGCC ACCTGGATCC GATGGAGCGG
CACGAGGGTT TGGTGGCAGC GGTTGCGTTT GAATCAATTG TCAAGCTGGC CGCATTTGTT
GTGGCAGGTG TTTACATAAC ATGGGGGATA TTCAACGGAT ACTCTGAGAT AATAGACCGC
GTCCTCTCAA CAGAGAATTT CTCTCATCTC ATAAACATAG ATTACACATC CTGGTTCTCG
CTCACCCTGA TATCCTTCTT CGCTGCGTTT CTTCTCCCCA GACAGTTCCA CGTCATGGTT
GTTGAGAATG CTGATGAATC GCATATACGA AAGGCGATGT GGCTCTTCCC GCTTTACCTC
CTTCTGATTA ATCTTTTCGT TCCAGCGATA GCCGGAGCAG GCCTGCTTCT GGGAGTTCCT
GGCGTGAAAG ATATGTTCGT GATAGAGATC CCGTACTCCG CGGGCAACAT ACCTCTCGCA
GTGCTCGTGT TCATAGGAGG AGCGTCTGCT GCCACTGCGA TGGTGCTGGT GGACGGTGTT
GCAGTTGGCC ACATGATGCT GAACGAGCTG GAGCTGCCGT ACCTCATGAG GTACTTGGGG
AGGGGAAGAG GTCTCCCCGG GCTTCTGCTC AACGCTAAGC GCATCAACAT CATTCTTGTG
GTGATGCTCG GATACCTTTA CTCCAGGGTC GTCGAGTACC AGAGCCTCGT GGATATAGGT
TTGATATCAT TCGTGGCAGC AAGCCAGATG GGGCCAGCGG TGATCGGGGG CCTTTACTGG
AGAAAGGGCA GCAGGGAGGG GGCGATCGCA GGCATGAGCG CAGGGTTTGT GCTCTGGCTC
TACACTGCAC TCATCCCCAC AGTTGTCAAG GCCGGCTGGC TTCCGCAGTC CATTCTCGAA
TCAGGACCCT TCGGGATATC TGCGCTCACT CCGACTAACC TCTTTGGTGT GGATCTGGAT
CCCTGGACTA ACTCGGTGTT CTGGAGCATC TTCGCAAATG CGAGCCTGTA TGTGCTATTC
TCCCTGATGA GCAGCCCGAC GCCTGAGGAG AGGGTCCAGG CAGAGGGGTT CGTTGAGATA
TTCAGCGAGA GAAGGGAAGC AGTTCCGATC GAGAGACCTG CCATAAGACT GGGTACGGTC
GACGAGGTTG AGTCGATGCT CGCCAGGTAT ATAGGAGCGG AGAAGGCCAG GATGCTGATA
GATGCGGATC TTGCGAGGCT TGGAGTCTCA AGAGATAAAG TCGATGCCAG GCACCTCCTG
GACCTCTGGG ATCATGTGGA GAGGGTTCTC ACAGGCTCGC TTGGCACATC AACAACAAGG
ATAATAGTTG AGGAGCATAT CACACCCAGG CCAGTTGTTG AGAGGGTGGA AGCAGCTCCG
CAGAAATTCA GTCTCGAGCC TGGAAAGATC TACTTCTCAG CTCAGAATGC CTATGAGGTG
TTCACGGATC AGGTAACGCA TGGATTTGAG GGGCTGTGTG TCACACGCAG ACCGCCGGAG
GATGTGAGAA GCAGGTACAG TCTCAGGAAG ACGCCGATCA TATGGCTTAA CCAAAAGAGA
GAGGGTGGTG AGAAGCAGAT ATCTCCCACA AATCTTCCAC TGCTCTTCCT CACGATAAAG
ACATTCGTCG AGACCTGCAG GAAGGGTATA ATACTTCTGG ACAATCTCGA GCATCTTGTT
CTTGTCAATG AGAATGTGAT ACCTGCGGAG GATCTGCTGG ATTTTGTCAA CCAGCTGGAG
AATCTAGTCC ACAGAACGAA CACCAGACTC ATCCTGGCGG ATTCGTCTGA TTTCATGGGG
TTCTGTGCTG TATCTGAGGC GGAGCCTGTG GAAGTAAGGG GTCTGATATT CACGGCAGGC
CCTCTGCCGT CTTATCTCCT CAGGGTTTTT ATACTTGCCA TAATCGGCGG GACGAGATCT
CCAGAGGCTG CAATGGACAT CGCAAATTCC GTTCTCAGCG AACAATCAGA GGTCGCAGAG
GGCGCATCAT GCGATCCCGA CATGCGCGGC AGGGGATTGA TCGAGATAGA TACGCGGTGC
AAGATCACGA GAAGGCATTT CTTCACGATA ATAAGACGCA TCTGCACCTC CGTGAGCAGG
GTTGATCCCG ACTTCGATTC TGTGAAAGTA TTGAGGCCGC TCATCGAAAA GTACGGCTTC
AGCATCTATG AGCTGATACT GAATCCAGGA ACGACATATG CGATCGAGGA GGACAAGCCG
GTCCGGTGCT TCGAGATATT CAGCGAGCTG GTACACGCTG GGTTAGAGGG GTTGTGCATA
TCCAGGTACA ACCCTGAGAG CCTCCGTGAG AAGTACGGGA TCTCACCTGA AACAGTCATA
TGGCTCACAC AGAAGACAGA GGAGGGGAAG TTCAGATCCG TGGATCCAAC GAACTTCCCG
AGGCTCAGCT CAATGATATC AGATTTTCTG AGGAGGACCG AGTACCCTGT GATACTCCTG
GAGGGTATTG GATACCTGAT AACGCAGAGC AACTATGAGA CCGTGCTGAG GTTTATACAG
TCGCAGAGGG ATGAGGTCTC GCTGAGAGGA GCTGTGATGC TCGTGCATAT CGATCCTCTC
TCTCTTGACA CAAAGGAGCT GCACAGGCTG GAGGGAGAGA TGGAACAGCT GGAGATCTGA
 
Protein sequence
MSAPLFAFSI ILTYLLLLSI IAYYADRQRQ AGRSIVSNPY VYALSLAVYC TAWTFYGSIG 
RAATSGLGFL TIYIGPTLAM LLGWVMIRKI VRISKEYRLT SISDFISFRY GRSYAIGAIV
TIVSMMVVIP YVALQLIAIS SSIQIISGGD TFWGTKLSVA VLLAVFAIIF GARHLDPMER
HEGLVAAVAF ESIVKLAAFV VAGVYITWGI FNGYSEIIDR VLSTENFSHL INIDYTSWFS
LTLISFFAAF LLPRQFHVMV VENADESHIR KAMWLFPLYL LLINLFVPAI AGAGLLLGVP
GVKDMFVIEI PYSAGNIPLA VLVFIGGASA ATAMVLVDGV AVGHMMLNEL ELPYLMRYLG
RGRGLPGLLL NAKRINIILV VMLGYLYSRV VEYQSLVDIG LISFVAASQM GPAVIGGLYW
RKGSREGAIA GMSAGFVLWL YTALIPTVVK AGWLPQSILE SGPFGISALT PTNLFGVDLD
PWTNSVFWSI FANASLYVLF SLMSSPTPEE RVQAEGFVEI FSERREAVPI ERPAIRLGTV
DEVESMLARY IGAEKARMLI DADLARLGVS RDKVDARHLL DLWDHVERVL TGSLGTSTTR
IIVEEHITPR PVVERVEAAP QKFSLEPGKI YFSAQNAYEV FTDQVTHGFE GLCVTRRPPE
DVRSRYSLRK TPIIWLNQKR EGGEKQISPT NLPLLFLTIK TFVETCRKGI ILLDNLEHLV
LVNENVIPAE DLLDFVNQLE NLVHRTNTRL ILADSSDFMG FCAVSEAEPV EVRGLIFTAG
PLPSYLLRVF ILAIIGGTRS PEAAMDIANS VLSEQSEVAE GASCDPDMRG RGLIEIDTRC
KITRRHFFTI IRRICTSVSR VDPDFDSVKV LRPLIEKYGF SIYELILNPG TTYAIEEDKP
VRCFEIFSEL VHAGLEGLCI SRYNPESLRE KYGISPETVI WLTQKTEEGK FRSVDPTNFP
RLSSMISDFL RRTEYPVILL EGIGYLITQS NYETVLRFIQ SQRDEVSLRG AVMLVHIDPL
SLDTKELHRL EGEMEQLEI