Gene Mthe_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0520 
Symbol 
ID4463433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp535529 
End bp536500 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content55% 
IMG OID639699525 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_842951 
Protein GI116753833 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTCT TCCTGATCGT CGGAATGGTG GTGGTTGCCT CGCTGGGCTG CATAACAAAA 
ACACCATCCG AGAATATAAC ACTCAGAATC GGCTACCAGC CGAGCACCCA TCAGATAGCG
GAGATGGTCG CGATGGAGAA GGGTTGGTGG CTCGAGGATC TGAAGCCGTT TGGCGTTACG
GCAGTCGAGG AGTACGAGTT CCCCTCCGGC CCACCTGAGA TGCAGGCGAT GCTTGCCGGC
AGCCTGGATG TCGCTTACGT TGGAACAGCG CCGCCAATAT CAGCGATATC AGGCGGTCTC
GATGCAAAGA TAGTTGCAGG CGTCAACACC AACGGCTCTG CTCTTGTACT CGCACCTGAT
AAGGAATACA GTGGCCCCGA GTCGCTGAAG GGCATGAGCA TAGCTACGTT CCCGCCAGGC
TCGATACAGG ATACGGTGCT CAAAAAATGG CTGAGGGAGA ACGGCGTCGA TACATCCGAG
GTGAAAGTGC TTCCTATGGG GCCGGGTGAT GCTGTGACAG CGATGTTCGC CGGCCAGGTA
GACGGCACGT TCCTGCCTGA GCCATCGCCA TCGGTAATTG AGATGTCCAA TAAAGGAAAG
GTCGTCGTAT ACTCTGGAGA GATGTGGCCG AACCATGCCT GCTGCAGCCT GGTCGTCAGC
GGCAAGCTCA TCAGGGAGCA TCCGGAGCTT GTCGAGCAGA TCGTAAAGAC GCATATCAAG
GCAACAGAGT ATGTGTATGC TCATCCTGAT GAGGCAGCGA GGATCTATGC CAACCGGACG
AAGCAGGATC TGAGCGTTGT GGAGTACTCG ATGAAGAACT GGGATGGGAG GTGGATAAGC
GATCCTCATG TGCAGATCCC ATCCACAATG GAGTACGCCA GGGTCAACTA CGAGCTGAAT
TACATAAGCA GAATGCCATC TGAAGAGGAG CTCTTTGATG TGAGCTTCTA CGATAAGGCG
AGGGGTGAGT GA
 
Protein sequence
MPVFLIVGMV VVASLGCITK TPSENITLRI GYQPSTHQIA EMVAMEKGWW LEDLKPFGVT 
AVEEYEFPSG PPEMQAMLAG SLDVAYVGTA PPISAISGGL DAKIVAGVNT NGSALVLAPD
KEYSGPESLK GMSIATFPPG SIQDTVLKKW LRENGVDTSE VKVLPMGPGD AVTAMFAGQV
DGTFLPEPSP SVIEMSNKGK VVVYSGEMWP NHACCSLVVS GKLIREHPEL VEQIVKTHIK
ATEYVYAHPD EAARIYANRT KQDLSVVEYS MKNWDGRWIS DPHVQIPSTM EYARVNYELN
YISRMPSEEE LFDVSFYDKA RGE