Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0762 |
Symbol | |
ID | 3831475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 798717 |
End bp | 800606 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637828693 |
Product | hypothetical protein |
Protein accession | YP_429623 |
Protein GI | 83589614 |
COG category | [S] Function unknown |
COG ID | [COG2604] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.211386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCAT CTACTAATCT GGTCTATCGC AAAAACGCCA GGGTGTTGCA GCGGTATAGC CCGGAACTTT TCCGGGACCT GGAGGCCACA GCCCTGCCCC TGGATCGGCA GCTTGCCCCG GCCCAAAATG GCGAACCAAC ACTAATAGCC ATCACAGGTG GTAAGGAGAT AGCCCTGCAC AGCCGCTATG ATCCCCGGCG CGAGGCTGTA ACCTGGGCCC GGGGCGTTGA TGAAAACGCA GATATGGTAG TTGTCCTGGG GATGGGCCTG GGTTACCATC TGGAAGCCCT GAAGGACTTA TACCCCCATA AAGCAGTTCT AGTACTGGAG CCGGAACTGG CGGCGGTAAA GCTGGCCTTC GCCGCCAGGG ACATGACCCA CTTGCTGAAG AGCGGGCAGT TTTACCTGCT AGCGGTGGCC GACCCCGAGG ATGCGGCCGC CCAACTCAGC AACATTCTGG CCGAAAACGC CGGTAAAAGA ATAGCCCTGC ATACTTTGCC GGCTTATGAG CAACTCTATG CTGGCTACTG GCAACGAGTG TGCCAGGGGG TTACCGACAG GCTGCGCCAG CGCCGGGTCA ACTGGGCCAC CACCGAAAAG TTCATGATGC AGTGGTTATG TAACTTTCGT GATAACTTTT TACCCTACAT AAAAGCCCCT GGGGTTATCC ACCTTTTCGA CGCTTTCAGC GGTAAACCGG CTCTGATTGT GGCCGCGGGA CCCTCTCTAG AAAAAAATAT CCATCTCTTA CCTTCCCTTA AGGGAAGGGT ACTGATCATG GCTGCCGGTT CAGCCATCAG GATTCTAGAA AAAAACGGCA TCAAACCGGA TCTGCTGGTT TCTTTTGATC CTGGAGATGC CAATTACCAG CACTTTGCCG GATTTGACGG GAGGGGAGTA CCTCTGGTTT ACGCTCCGGT CATCTTTCCC CGCATCGTTC AGGAGTACCA GGGGCCCACT TTTAGCTGTG AATTGAATGT TTCACCATTT ATTGAATGGT TTGATGAAAA GCTGGGCGAG AAAAAGGGTG TTCTGATCAG CGGTCCCTCA GTGGCCAATG TCTGCCTGGA CCTGGCGGTG AAGATGGGCG CTAACCCCAT TATCCTTATA GGCCAGGATC TGGCCTTCAC CAACAACAAG ACCCACGCCG ACGGCGCCAG GCACCAGCAG AGGATAGACC CCAGCCAGGG GAACTACATT TGGGTAGAAG ATATATATGG CGATAGAGTG CCGACCACTA CAGCCTTTTA TTCCATGCTC GTCTGGTATG AACAGTACTT GGGCAACCTC AAGGGAAAGC GCCTGGTCAT AGATGCCACA GAAGGGGGTG CCCGTATTCG GAGTACCGAA ATTATGTCTT TGCAGGAAGT GAGAGATAAG TACCTTCGGG AAACGTTTTC ACCAGGGGAA ATTATTGCAG CAAAGCATGA CGTTTATGCA GTACCAGATG GGGAACAACT TCGGCGTCTT GAAGAGGCCT TTAGCGAGCT TAGTTCCCGG CGGGAGGATT TGCGGGCCTG CTTTGAGGAA GGTATAGAAG TGGCCCGGCA ACTGCTGGAA AAATGCCACA AGAAAACAGT AAAGTTAACT AACTACGAGC GTGCCCGCCG GAAATTCATG GGCCTGGACC GGCGCATCAC CGGCAATATC CTTTATCGAC TTTTCCTAGA GCAGGGTCTG GCCGCCCGTA TTGACGCCAT CAACCGTATA TTGGGCGAAA GAGTAAACGA CGAGCAGGAA TTGCCTGCGC GAGGAGAAAA ACTAGCTTCC CTGTACCTGT CCTTCTTTAC CGAGGTTGAG CGCTATGCCG AATTTACAAC AGAGATACTG AAAGAAATAG AAGAAAAGAT TCGGCGTGAA AGCGCCTCAA CTAGCTGTTC AAAAGCTTAG
|
Protein sequence | MSPSTNLVYR KNARVLQRYS PELFRDLEAT ALPLDRQLAP AQNGEPTLIA ITGGKEIALH SRYDPRREAV TWARGVDENA DMVVVLGMGL GYHLEALKDL YPHKAVLVLE PELAAVKLAF AARDMTHLLK SGQFYLLAVA DPEDAAAQLS NILAENAGKR IALHTLPAYE QLYAGYWQRV CQGVTDRLRQ RRVNWATTEK FMMQWLCNFR DNFLPYIKAP GVIHLFDAFS GKPALIVAAG PSLEKNIHLL PSLKGRVLIM AAGSAIRILE KNGIKPDLLV SFDPGDANYQ HFAGFDGRGV PLVYAPVIFP RIVQEYQGPT FSCELNVSPF IEWFDEKLGE KKGVLISGPS VANVCLDLAV KMGANPIILI GQDLAFTNNK THADGARHQQ RIDPSQGNYI WVEDIYGDRV PTTTAFYSML VWYEQYLGNL KGKRLVIDAT EGGARIRSTE IMSLQEVRDK YLRETFSPGE IIAAKHDVYA VPDGEQLRRL EEAFSELSSR REDLRACFEE GIEVARQLLE KCHKKTVKLT NYERARRKFM GLDRRITGNI LYRLFLEQGL AARIDAINRI LGERVNDEQE LPARGEKLAS LYLSFFTEVE RYAEFTTEIL KEIEEKIRRE SASTSCSKA
|
| |