Gene Mthe_0510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0510 
Symbol 
ID4463423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp521350 
End bp522882 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content57% 
IMG OID639699513 
Productextracellular solute-binding protein 
Protein accessionYP_842941 
Protein GI116753823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0306064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGGT TGGTAAAGAT CCTCTGCACT GCCATGCTGA TATCGCTTCT TCTGCCGGCA 
GTTGCAGAGG AGATACCGGT ATACACGATA GCGGACACGA CCGGGGACTG GGGGTACCCA
TCGCCGTACC TCCACTACTC CAGGGGTCCC GGATATGTGC GGATGAGCTT CATCTTCGAT
ACCTTGGTCT GGAAGGACCA GAACGGCTTC GTGCCTGCGC TCGCAGAGAG CTGGGAGTAT
CTGAAGGACG AGAACGCGTA CATCTTCAAT CTGAATCCGA ACGCGAAATG GCATGACGGC
GAGCCATTCA CAGCGGACGA TGTGGTCTTC ACGATCAACT ACATAAAAGA GCATCCGTAC
CAGTGGGTCG ACAGCAGCAC TGTGGATAGG GCTGAGAAGA TCGATGATCA CACGGTTAAG
ATCTACCTGG CGAAGCCCTA CGCCCCGTTC CTGGACCAGG TCGCAGGTAC ACTGCCGATC
CTCCCGGAGC ACATCTACAT CGATGTGACG AAGCCGGAGG ATTTCCAGGA TGCGAAGGCG
CTTACAGGCA CAGGACCGTT CAAGCTTGTC GATTACAACA AGGCCCAGGG CACCTATCTG
TACGAGGCTA ATGAGGATTA CTACCAGGGC GCTCCGAAGG TGAAGCAGCT CAAGTTCGTC
AAGCTGAGCG AGGAGATGGC AGGGCCGGCG CTGAAGCGAG GCGAGGTCGA TGCTGCGGCT
GTGCCTCCAG AGGTCGCTGA GGGGCTCAGG GGCAGCTTCG TGGTTCTCGA GGGCGCGCAC
GACTGGCTCG CGAAGCTCAT GATAAACCAC AAGAAGGAGC CGTTCTCCGA TGTCAGGTTC
AGACAGGCTC TTGCGTATTC CATAGACAGG GAGAAGCTCG TTGAGATAGC CCAGCGCGGG
TATGGTGTTC CTGCGAGCCC CGGCCTCTTC GCGCCTGACA GCGAGTGGTA CAATCCGGAT
GTTGAGCAGT ATGCCCATGA TCCGGAGAAG GCGGGCGAGC TCCTGAAGGA GCTGGGGTAT
GAGAAGAAGG GGGTCTTCTT CGAGAAGGAC GGAAAGACGC TGGAGGTTGA GCTTCTCGTC
ACATCGAGCA ATGAGCGCGC TGGCGAGCTC ATAAAAAGAG ATCTGGAGGA GGCGGGCATA
AAGGTCAACA TGCGCAGCGT CGACTCGAAG ACTCTTGATA ACGCGGTCAA CGAGTGGAAC
TTCGATCTCG CGCTCAGCGG GCACGGCGGA ATGGGCGGAG ATCCGGCGAT ACTGAACAAG
GTGATCACAG GGTCTGGCTT CAACAGCGCC AGATACGATA ACCAGGAGCT GAGTGATCTT
CTCAAGAGGG AGATCTCGGA GATGGATCCG GAGAAGAGAA GAGAGCTGGT GAACGAGATA
CAGGAGGTCT ACGCGAGAGA ACTTCCAGCG CTCCCGCTCT ACTACCCGAC CAACTACTGG
GCACACGACG GGAAGGTCGA TCTCTTCTAC ACAAAGAACG GCGTTGGGAG CGGCGTGCCA
ATTCCGCTGA ACAAGATGGC CTTCCTGGCC TGA
 
Protein sequence
MSRLVKILCT AMLISLLLPA VAEEIPVYTI ADTTGDWGYP SPYLHYSRGP GYVRMSFIFD 
TLVWKDQNGF VPALAESWEY LKDENAYIFN LNPNAKWHDG EPFTADDVVF TINYIKEHPY
QWVDSSTVDR AEKIDDHTVK IYLAKPYAPF LDQVAGTLPI LPEHIYIDVT KPEDFQDAKA
LTGTGPFKLV DYNKAQGTYL YEANEDYYQG APKVKQLKFV KLSEEMAGPA LKRGEVDAAA
VPPEVAEGLR GSFVVLEGAH DWLAKLMINH KKEPFSDVRF RQALAYSIDR EKLVEIAQRG
YGVPASPGLF APDSEWYNPD VEQYAHDPEK AGELLKELGY EKKGVFFEKD GKTLEVELLV
TSSNERAGEL IKRDLEEAGI KVNMRSVDSK TLDNAVNEWN FDLALSGHGG MGGDPAILNK
VITGSGFNSA RYDNQELSDL LKREISEMDP EKRRELVNEI QEVYARELPA LPLYYPTNYW
AHDGKVDLFY TKNGVGSGVP IPLNKMAFLA