Gene Moth_0762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0762 
Symbol 
ID3831475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp798717 
End bp800606 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content53% 
IMG OID637828693 
Producthypothetical protein 
Protein accessionYP_429623 
Protein GI83589614 
COG category[S] Function unknown 
COG ID[COG2604] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.211386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCAT CTACTAATCT GGTCTATCGC AAAAACGCCA GGGTGTTGCA GCGGTATAGC 
CCGGAACTTT TCCGGGACCT GGAGGCCACA GCCCTGCCCC TGGATCGGCA GCTTGCCCCG
GCCCAAAATG GCGAACCAAC ACTAATAGCC ATCACAGGTG GTAAGGAGAT AGCCCTGCAC
AGCCGCTATG ATCCCCGGCG CGAGGCTGTA ACCTGGGCCC GGGGCGTTGA TGAAAACGCA
GATATGGTAG TTGTCCTGGG GATGGGCCTG GGTTACCATC TGGAAGCCCT GAAGGACTTA
TACCCCCATA AAGCAGTTCT AGTACTGGAG CCGGAACTGG CGGCGGTAAA GCTGGCCTTC
GCCGCCAGGG ACATGACCCA CTTGCTGAAG AGCGGGCAGT TTTACCTGCT AGCGGTGGCC
GACCCCGAGG ATGCGGCCGC CCAACTCAGC AACATTCTGG CCGAAAACGC CGGTAAAAGA
ATAGCCCTGC ATACTTTGCC GGCTTATGAG CAACTCTATG CTGGCTACTG GCAACGAGTG
TGCCAGGGGG TTACCGACAG GCTGCGCCAG CGCCGGGTCA ACTGGGCCAC CACCGAAAAG
TTCATGATGC AGTGGTTATG TAACTTTCGT GATAACTTTT TACCCTACAT AAAAGCCCCT
GGGGTTATCC ACCTTTTCGA CGCTTTCAGC GGTAAACCGG CTCTGATTGT GGCCGCGGGA
CCCTCTCTAG AAAAAAATAT CCATCTCTTA CCTTCCCTTA AGGGAAGGGT ACTGATCATG
GCTGCCGGTT CAGCCATCAG GATTCTAGAA AAAAACGGCA TCAAACCGGA TCTGCTGGTT
TCTTTTGATC CTGGAGATGC CAATTACCAG CACTTTGCCG GATTTGACGG GAGGGGAGTA
CCTCTGGTTT ACGCTCCGGT CATCTTTCCC CGCATCGTTC AGGAGTACCA GGGGCCCACT
TTTAGCTGTG AATTGAATGT TTCACCATTT ATTGAATGGT TTGATGAAAA GCTGGGCGAG
AAAAAGGGTG TTCTGATCAG CGGTCCCTCA GTGGCCAATG TCTGCCTGGA CCTGGCGGTG
AAGATGGGCG CTAACCCCAT TATCCTTATA GGCCAGGATC TGGCCTTCAC CAACAACAAG
ACCCACGCCG ACGGCGCCAG GCACCAGCAG AGGATAGACC CCAGCCAGGG GAACTACATT
TGGGTAGAAG ATATATATGG CGATAGAGTG CCGACCACTA CAGCCTTTTA TTCCATGCTC
GTCTGGTATG AACAGTACTT GGGCAACCTC AAGGGAAAGC GCCTGGTCAT AGATGCCACA
GAAGGGGGTG CCCGTATTCG GAGTACCGAA ATTATGTCTT TGCAGGAAGT GAGAGATAAG
TACCTTCGGG AAACGTTTTC ACCAGGGGAA ATTATTGCAG CAAAGCATGA CGTTTATGCA
GTACCAGATG GGGAACAACT TCGGCGTCTT GAAGAGGCCT TTAGCGAGCT TAGTTCCCGG
CGGGAGGATT TGCGGGCCTG CTTTGAGGAA GGTATAGAAG TGGCCCGGCA ACTGCTGGAA
AAATGCCACA AGAAAACAGT AAAGTTAACT AACTACGAGC GTGCCCGCCG GAAATTCATG
GGCCTGGACC GGCGCATCAC CGGCAATATC CTTTATCGAC TTTTCCTAGA GCAGGGTCTG
GCCGCCCGTA TTGACGCCAT CAACCGTATA TTGGGCGAAA GAGTAAACGA CGAGCAGGAA
TTGCCTGCGC GAGGAGAAAA ACTAGCTTCC CTGTACCTGT CCTTCTTTAC CGAGGTTGAG
CGCTATGCCG AATTTACAAC AGAGATACTG AAAGAAATAG AAGAAAAGAT TCGGCGTGAA
AGCGCCTCAA CTAGCTGTTC AAAAGCTTAG
 
Protein sequence
MSPSTNLVYR KNARVLQRYS PELFRDLEAT ALPLDRQLAP AQNGEPTLIA ITGGKEIALH 
SRYDPRREAV TWARGVDENA DMVVVLGMGL GYHLEALKDL YPHKAVLVLE PELAAVKLAF
AARDMTHLLK SGQFYLLAVA DPEDAAAQLS NILAENAGKR IALHTLPAYE QLYAGYWQRV
CQGVTDRLRQ RRVNWATTEK FMMQWLCNFR DNFLPYIKAP GVIHLFDAFS GKPALIVAAG
PSLEKNIHLL PSLKGRVLIM AAGSAIRILE KNGIKPDLLV SFDPGDANYQ HFAGFDGRGV
PLVYAPVIFP RIVQEYQGPT FSCELNVSPF IEWFDEKLGE KKGVLISGPS VANVCLDLAV
KMGANPIILI GQDLAFTNNK THADGARHQQ RIDPSQGNYI WVEDIYGDRV PTTTAFYSML
VWYEQYLGNL KGKRLVIDAT EGGARIRSTE IMSLQEVRDK YLRETFSPGE IIAAKHDVYA
VPDGEQLRRL EEAFSELSSR REDLRACFEE GIEVARQLLE KCHKKTVKLT NYERARRKFM
GLDRRITGNI LYRLFLEQGL AARIDAINRI LGERVNDEQE LPARGEKLAS LYLSFFTEVE
RYAEFTTEIL KEIEEKIRRE SASTSCSKA