Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2512 |
Symbol | |
ID | 3832784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2618215 |
End bp | 2619405 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637830435 |
Product | hypothetical protein |
Protein accession | YP_431337 |
Protein GI | 83591328 |
COG category | [S] Function unknown |
COG ID | [COG1641] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00299] conserved hypothetical protein TIGR00299 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0156591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGATAG CCTACTTTGA TTGTTTTTCC GGAATTAGCG GTGATATGTG CCTGGGAGCG TTAATAGCCT GTGGCCTCAG CCAGGACGAG CTAACCTCTG GCTTAAAAGG ACTGGGGCTC GAGGGATGGG AATTAAGGGT TAGGGAGGTA AAGCAACACA GCATTGCCGC CACCGATGTC GCTGTCCAGG TGACAGGGAG CCAGCCCCAC CGCCACCTGG CGGATATCCT GGGGCTGATC AATAACAGTT CTTTGCCTGC CCCGGTTAAG GAAAAATCTG CGGCGGTATT TAAAAACCTG GCCCGGGCCG AAGGCCAGGT ACATGGCATC GACGCCAGCC AGGTCCACTT TCATGAAGTC GGGGCGGTAG ACGCCATTAT CGACATCGTC GGCAGTATCC TGGGGTTGCA CCTCCTGGGT ATAGAGAAAG TCATCTCCTC CCCCTTACCT GCTGGTTCCG GCTGGGTGGA CTGCCGGCAC GGCAAATTAC CAGTTCCCGC CCCGGCAACC CTTTACCTTC TCCAGGGCTA CCCGGTTTAT GGTACTGAAG ATAAAGCCGA GCTGGTAACC CCTACCGGCG CGGCCTTGAT TACCACCCTG GCCGACAGCT TTGGCCCCTT TCCAGCCATG AACCTGACCA GGGTCGGTTT CGGTGCCGGA AAAACCGAAC TTCCCCATCC CAACCTCCTG CGCCTGGCCC TGGGTGAGAT CAACAGCGGG CAGCTGGAAG GAGAGGAAAG CAGCCTGGTT ATCGAAACAA CCATCGACGA TATGAACCCC GAATTCTTTC CCGCCCTCCT TGAGGAGACC ATGGCCGCCG GCGCTGTTGA TGCCTTCTTC ACCCCGGTAC AAATGAAAAA AGGCCGACCC GGGATCCTCT TTACGGCCCT CTGTCCGGAG AATAAACTGG CCGCTGTTGC GGCTGCCATC TTTACCCATT CCAGCACCCT GGGGTTACGT TTTCGCCGGG ACCAACGCCT GGTATGCCAG CGACGGATGG CTGAGGTAGT CACCCCTTAT GGCACTGTCC CCGTTAAACT GGGCCTCTAC CGTGATCCCA CAGGACAGGT TATTACCAAC ATCGCACCCG AATATGAATC CTGCCGTCAG ATTGCCAAGT CTGCCGGCGC CCCCCTGAAG GAAGTCTATG CTGCTGCCCT GGCCGCCGCC AGGGCGCTAA AGGCTTTTTA A
|
Protein sequence | MKIAYFDCFS GISGDMCLGA LIACGLSQDE LTSGLKGLGL EGWELRVREV KQHSIAATDV AVQVTGSQPH RHLADILGLI NNSSLPAPVK EKSAAVFKNL ARAEGQVHGI DASQVHFHEV GAVDAIIDIV GSILGLHLLG IEKVISSPLP AGSGWVDCRH GKLPVPAPAT LYLLQGYPVY GTEDKAELVT PTGAALITTL ADSFGPFPAM NLTRVGFGAG KTELPHPNLL RLALGEINSG QLEGEESSLV IETTIDDMNP EFFPALLEET MAAGAVDAFF TPVQMKKGRP GILFTALCPE NKLAAVAAAI FTHSSTLGLR FRRDQRLVCQ RRMAEVVTPY GTVPVKLGLY RDPTGQVITN IAPEYESCRQ IAKSAGAPLK EVYAAALAAA RALKAF
|
| |