Gene Mthe_0672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0672 
Symbol 
ID4463312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp708154 
End bp710055 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content58% 
IMG OID639699682 
ProductNa+/solute symporter 
Protein accessionYP_843102 
Protein GI116753984 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATCCAT ATCACATATT CCTCCTCATC CTGGGCGTAT ATATTTCTGT GCTCGTCGGC 
ATAGGCTTCA TCTCATCGAA GCGGCAGGGA TCTGTGACTG AGTTCTGGCT GGCAAGCCGG
GAGCTGGGGG CAGCCTCCTT GGGATTCTCG GCTGCGGCTT CATGGCTCAC GGCCAGCGCT
TTGCTGCTTT CCACAGGTCT TTTCATGCTC ATAGGCGTCA GCTCGATATG GGTGTGGGTA
TTTCCGAACA TAGCTGCGCT GGCGATCATA GCGCTGATCG CAGGCAGGAT CAAGCGCATA
CCTGCGATGA CCCAGCCTGA GCTTCTGGAG ATAAGATACG ATCCAATGGT CAGAGCTCCG
GTCGCGGTGG CGATAACGAT AATGATGATT CTCTTCTCAG TGGTTGATTT CATCGGGTTC
AAGCTCGTTC TGGGGACTTT CTTCGGGGTC GAGCCACTGT ACGCGGTTCT GATCATGGCG
GTGTCCGTGG CGCTCTACGT CAGCCTCGGT GGGTTCAGAG CGGTTGTCTG GACCGACAGG
GTCCAGTACA TCTTCCTGGC AGGGCTCGCG GTTGTTGTGG CGATTCTCTC CCTGAAGCTC
TCGGCAGATC GGGGCGCATC CATCATCGCG GAGAGCGCTG CCCTCGGCGG GGAGTGGTGG
AACCCGTTCA TGATGGGGGG CGTGCTCGGC GCCCTTGTCT TCCAGCTCGC CCTGCTCCCC
GGCTGGATCG CGGAGCAGGA CCCGTGGCAG AGGGTCTGGG CCGCGCGCGA TGAGAGAAGC
GCGAGGATGG GCTTGCTCCT GGGGTCGATG CTTCTCGCGA TCGTTTACGG CGCATGCTTC
CTCACGGCGA TAGCCCTGCG CGCGATCTAC CCGCTCCCTG AGGGCGAGGT GGAGGCTGAG
ATGCTGTACC TCAGGTTCAT CTCCGAGAAC GTGCCACCTG CAGCAGTCGC GCTGCTCACA
ATAGGATTCG CTGCCGCGTC CATGTCGTGC ACAGACACCT TTGCGACATC TGGCGCGTCC
TGCATCGTCA GGGACATCTT CCAGAGGTTC GTCAAACCGG ATGCCACCAT GCGGGAGATG
CTGATAATCA GCAGGGTGCT TGTGGTCATG ATGATCTCCA TCTCAGCGTT CATCGCGCTG
AATGTGGAGA GCATCATGGA GGCTGTGATA ATAGCCACCG TTATAGGGAC AACATCCTAC
TTCTTCCCGA TCGTCGGCGG GCTTTACTGG AAGAGGGCCA CATCCTGGGG CGCGCTTGCA
GCTGTCATCG TCGGTGGCGT GACACAGATC GTGATGATCG CGTATGAGAA GTTCATTTTT
GGGGAGCCAC TCGACACGAT CTCTCCACTC CTCACAGAGC ACGGCGTGCT CGTCGGCCTG
ACTCTGAGCG CATCGGTATT CGCGATCGTC TCCCTTCTCA CGCCGCCCAC CGACGACAGG
AGGCTCGCGC CATTCTTCTC AGATATCGCA GAGAGGCTCT TTGGTGGAGC CATGATCACA
GTTGACAGGA AGAACTCCAG GTATCCTATC ATCATGAGAA TGATAGAGGA GAGGGGTTTC
GGAGAGAGAA CGCATCTCGA TCTGGCGCTG AGGGTGAACC CGCTGAAGGC GGATGGAGCT
CAGATCAGGG GAGAGATAAG CTGGGATAGA CTCATCGAGG ATCTGAGATC CAGGCATCCG
GAGTGGTACA CGCCGACCGG GAGCCATATC GCGTACAGGC TCTCACAGCA TGACATGCTT
GCGTGCGTGA AGCTCTACAG GGGGGATGAG ATGGAGATCA GACTCTCTGC TGAACCAAGG
CTCTCCCAGA GGGAGAGGTT CAGGGACGAG ATGTATCTCG CGGTAGAGGA GATAGAGGAG
TCGCTTCTGA GCATGGGATA CACGACATCA CTGCCCGCAT GA
 
Protein sequence
MDPYHIFLLI LGVYISVLVG IGFISSKRQG SVTEFWLASR ELGAASLGFS AAASWLTASA 
LLLSTGLFML IGVSSIWVWV FPNIAALAII ALIAGRIKRI PAMTQPELLE IRYDPMVRAP
VAVAITIMMI LFSVVDFIGF KLVLGTFFGV EPLYAVLIMA VSVALYVSLG GFRAVVWTDR
VQYIFLAGLA VVVAILSLKL SADRGASIIA ESAALGGEWW NPFMMGGVLG ALVFQLALLP
GWIAEQDPWQ RVWAARDERS ARMGLLLGSM LLAIVYGACF LTAIALRAIY PLPEGEVEAE
MLYLRFISEN VPPAAVALLT IGFAAASMSC TDTFATSGAS CIVRDIFQRF VKPDATMREM
LIISRVLVVM MISISAFIAL NVESIMEAVI IATVIGTTSY FFPIVGGLYW KRATSWGALA
AVIVGGVTQI VMIAYEKFIF GEPLDTISPL LTEHGVLVGL TLSASVFAIV SLLTPPTDDR
RLAPFFSDIA ERLFGGAMIT VDRKNSRYPI IMRMIEERGF GERTHLDLAL RVNPLKADGA
QIRGEISWDR LIEDLRSRHP EWYTPTGSHI AYRLSQHDML ACVKLYRGDE MEIRLSAEPR
LSQRERFRDE MYLAVEEIEE SLLSMGYTTS LPA