Gene Mthe_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0785 
Symbol 
ID4462424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp834346 
End bp835884 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content54% 
IMG OID639699796 
Productextracellular solute-binding protein 
Protein accessionYP_843214 
Protein GI116754096 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0600833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGAT TGAAAGACGT TTTGTGCACT GCAATACTGA TATCGCTTCT TCTGCCGGCA 
GTTGCAGAGG AGATACCGGT ATACACGATA GCGGACACGA CCGGGGACTG GGGGTACCCA
TCGCCGTACC TCCACTACTC CAGGGGTCCG GGTTACGTGC GGATGAGTTT CATCTTCGAT
ACACTGGTCT GGAAGGACCA GAACGGCTTC GTGCCTGCGC TCGCGGAGAG CTGGGAGTAT
CTGGAGGGCG AGAACGCGTA TCTCTTCAAT CTGAATCCGA ACGTGAAATG GCATGACGGT
GAGCCATTCA CCGCAAACGA CGTTGTATTC ACGATCAACT ACATAAAAGA TCATCCGTAC
CAGTGGGTCG ACAGCAGCAT TGTGGATAGG GCTGAGAAGA TCGATGATCA CACGGTGAAG
ATCTACCTGG CGAAGCCCTA CGCGCCGTTT TTGGACCAGG TGGCTGGCAC TCTGCCGATT
CTCCCAGAGC ACATCTACAG CGATGTGACA AACCCGGAGG ATTTCCAGGA TCCAAAGGCG
CTTACAGGCA CAGGACCGTT CAAGCTTGTC GATTACGACA AGGCCCAGGG GACATATCTG
TATGAGGCGA ACGAGGATTA CTACCAGGGC GCTCCGAAGG TTAAGCAGCT GAAGTTTGTA
AAGCTGAGCT CAGAGATGTC CGGGCCGGCG CTGAAGCGAG GCGAGGTCGA TGCCACGGCA
GTTCCACCAG AGGTCGCTGA CGAGCTCAGG GGCAGCTTCG TCGTTCTCGA AGGCGCTCAT
GACTGGCTCG CGAAGCTGAT GATAAACCAC AAGAAGGAGC CGTTCTCCGA TGTGAGGTTC
AGGCAGGCTC TCGCATACGC CATAGATAGA GAGAAGCTCG TCGAGATAGC CCAGCGCGGA
TATGGCGTTC CTGCGAGCCC CGGACTATTC GCGCCTGACA GCGAATGGTG CAACCCGGAT
GTGGAGCAGT ACGAGCATGA TCCTGAGAAG GCCGGGGATC TGCTGAAGGA GATGGGGTAT
GTGAAGAAGG GAGATTTCTT CGAGAAGGAT GGAAATACAC TGGAGGTGGA GCTTCTCGTC
ACATCGAGCA ATGAGCGCGC GGGCGAGATC ATAAAGAACG ACCTGGAGGA GGCTGGCATA
AAGGTCAACA TGCGCAGCGT CGACTCGAAG ACACTTGATA ACGCGGTCAA CGAGTGGAAC
TTCGATCTCG CGCTCAGCGG CCACGGCGGA ATGGGTGGAG ATCCCGAGAT ACTCAATCGC
ATAATAGGCG AGGGATACAC ATTTTGCAGC GATCGATACA TCGTGAACCA AACGCTAAAC
GATCTGCTGG ACCAAGAGGT TGCTGAAATG AACCCTGACA GGCGAAAGCA GATTGTAAAA
GAGATTCAGC TGGTCTACTC GCAGCAGTTG CCTTCGTTGC CATTGTACTA TCCTGAATCT
TTCTGGGCGC ATAATGGCAA GGTCGATCTG TATTACACTA AGAGAGGGAT CGCAAACGGC
ATACCCATAC CGCTGAACAA GCTATTATTT GCGAAGTAA
 
Protein sequence
MAGLKDVLCT AILISLLLPA VAEEIPVYTI ADTTGDWGYP SPYLHYSRGP GYVRMSFIFD 
TLVWKDQNGF VPALAESWEY LEGENAYLFN LNPNVKWHDG EPFTANDVVF TINYIKDHPY
QWVDSSIVDR AEKIDDHTVK IYLAKPYAPF LDQVAGTLPI LPEHIYSDVT NPEDFQDPKA
LTGTGPFKLV DYDKAQGTYL YEANEDYYQG APKVKQLKFV KLSSEMSGPA LKRGEVDATA
VPPEVADELR GSFVVLEGAH DWLAKLMINH KKEPFSDVRF RQALAYAIDR EKLVEIAQRG
YGVPASPGLF APDSEWCNPD VEQYEHDPEK AGDLLKEMGY VKKGDFFEKD GNTLEVELLV
TSSNERAGEI IKNDLEEAGI KVNMRSVDSK TLDNAVNEWN FDLALSGHGG MGGDPEILNR
IIGEGYTFCS DRYIVNQTLN DLLDQEVAEM NPDRRKQIVK EIQLVYSQQL PSLPLYYPES
FWAHNGKVDL YYTKRGIANG IPIPLNKLLF AK