Gene Mthe_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0468 
Symbol 
ID4462625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp484494 
End bp486080 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content56% 
IMG OID639699470 
ProductNa+/solute symporter 
Protein accessionYP_842899 
Protein GI116753781 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.280127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTGT TACTTCTCAA CGTATTGGTG GTCGTCTATC TGCTTGTCAC GCTCTACCTG 
GGATACAGGG GCTGGGTGAC GACGAGGGAC ACCGAAGGAT ACATGGTGGC GGGCAGAAAG
ATCCATCCTT ACATAATGGC GATGAGCTAC GGGGCAACAT TCATCAGCAC CTCCGCGATC
GTGGGGTTCG GCGGCATGGC AGGTCTCTTC GGCATGGGGC TTCTGTGGCT GACATTCCTC
AACATCTTTG TGGGGATATT CATAGCGTTC ATAATTTACG GAAAGCGCAC GAGGAGGATG
GGTTACAACC TGGGAGCTAT GACGCTCCCG GAGCTGATGG GCCGGAGGTT CGACAGCCAG
TTCATCCAGT GGTTCAGCGG GCTTGTGATC TTTCTCGGGA TGCCTTTATA TGCATCCGTC
GTTTTAATTG GTGCTGCCAG GTTCATGGAG ACCACGCTCT CAATAGACTT CAACCTCGCG
CTGCTTATCT ACTCGCTCAT AATCGCTGCC TACGTGGTCT GGGGCGGCCT GAAGGGGGTC
ATGTACACAG ATGCGATGCA GGGCACGGTA ATGTTTCTGG GGATGATATT TCTTCTCATC
ATGACCTATC TCCACCTTGG AGGTGTGACA GCTGCCCACC AGGCGCTCAC AGACATGGCG
AACCTGGTTC CTCAGAGCCT TGCGGCACAG GGTCACAGAG GATGGACGAG CATGCCAGCG
CTCGGAAGCG CATGGTGGTG GACACTGGTC TCGACCGTGG TCCTCGGTGT CGGGATTGGC
GTCCTCGCGA TGCCGCAGCT CGCGGTGAGA TTCATGACGG TGAGATCCGG CAGGGAGCTG
AACCGCGGTG TCCTGATCGG CGGCGTGTTC ATACTCGCAA TGACCGGCGT TGCGTTCGAT
GTCGGGGCTC TGTCAAACGT CTTCTTCCAT GCCACACAGG GCAAGATAGC GATAGAGGTG
GCTAAGGGGA ACGCGGACAG CATAATACCT GTTTACATCA ATGCGGCGAT GCCGGAGTGG
TTTGTGTACC TCTTCATGCT CACACTCCTC GCAGCAGCAA TGTCCACATC GAGCTCTCAG
TTCCATGCCC AGGGCACCGC GATCGGCAGG GATATCTATG AGACGCTGAC AGGGAAAAAG
GGAACAGGGT CAATACTGGT GACGAGGGCA GGGATCATAG TGGCTGTGGT CATCGCGGTC
GTCCTCGGAT ACATACTTCC GGAGAACATC ATCGCCAGGG GCACATCCAT ATTCTTCGGG
CTCTGTGCAG CCGCGTTCCT CCCGATGTAC ACATGCGCTC TCTTCTGGAG GCGGGCCACA
AGAGCGGGCG CGATCGCGGG GCTCGTGACC GGCACGTTCT CGAGCGTGCT CTGGATGGTC
TTCGTCCATA AAAAAGAGGC AGAGGCCCTC GGCATATCCA GATTCATATT CGGCAGGGAT
GTGCTCATAA CCCAGATGCC CTGGCCTGTA GTGGATCCAA TAGTGGTTGC GCTGCCGCTG
GCGTTTTTAG TCACGATCGT TGTGAGCATA CTCACGAGAC CACCGGACAC CGCGCATCTT
GATCGATGCT TTAAGGGCAT CAAATAA
 
Protein sequence
MNLLLLNVLV VVYLLVTLYL GYRGWVTTRD TEGYMVAGRK IHPYIMAMSY GATFISTSAI 
VGFGGMAGLF GMGLLWLTFL NIFVGIFIAF IIYGKRTRRM GYNLGAMTLP ELMGRRFDSQ
FIQWFSGLVI FLGMPLYASV VLIGAARFME TTLSIDFNLA LLIYSLIIAA YVVWGGLKGV
MYTDAMQGTV MFLGMIFLLI MTYLHLGGVT AAHQALTDMA NLVPQSLAAQ GHRGWTSMPA
LGSAWWWTLV STVVLGVGIG VLAMPQLAVR FMTVRSGREL NRGVLIGGVF ILAMTGVAFD
VGALSNVFFH ATQGKIAIEV AKGNADSIIP VYINAAMPEW FVYLFMLTLL AAAMSTSSSQ
FHAQGTAIGR DIYETLTGKK GTGSILVTRA GIIVAVVIAV VLGYILPENI IARGTSIFFG
LCAAAFLPMY TCALFWRRAT RAGAIAGLVT GTFSSVLWMV FVHKKEAEAL GISRFIFGRD
VLITQMPWPV VDPIVVALPL AFLVTIVVSI LTRPPDTAHL DRCFKGIK