Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0188 |
Symbol | |
ID | 3832261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 182302 |
End bp | 184572 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637828124 |
Product | hypothetical protein |
Protein accession | YP_429066 |
Protein GI | 83589057 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCATCA AGGGTTTATG GTCGTATCTG GCGGTTGTTG TTTGCCTGAG CCTGGTGCTG GCCGGCATCC CGGCGATGCC CACCGCTGCC GCGGCGTCGA GCGACCCGGT AGCCGATCTG GTGAACAGGC TCCAGCCGGT GTATAACTGC CTGGACGCCG GGGATAAGCA AGTTATCCAG GCGGCGAAGG ATGAGATTGC GGGGCTTAGT GATGATGAGA TAGCAAAGAT ATTAAAGGAC AAAAGGTTGA TAACGGAACA AGTTAAAAAT AATCTTAGAG TTGATGAAGA TAACGCCGCC TCTACCCTGG CGGGTATGAT AAAATACGCG GCGGGGATCT ATTACTCTCC AGATGCGAGC ACGTTGGAAA ATGAATTGCA AGGTTTCCGC AGCCAGTATA GCTCTACCTT CAGTCAATTG CTTGGAAGCG GTGTAACCGT TGACGACGCC TGGCAGTTCG CCCTGGCTAC TGAAGCAAAT TTACCTGCAG CCTTGCAGAG CAATCTTGGC AATATATTGG AGCAATTTAT AACAGGTTCG AGCTATGATG CAGTCTTGAA TAGTTTCAGC GATATTGTTT CTGACGCCGC GTCGGGGGTC ACTTCTGATT TTAAGGACGC CCTGACGGGT TTGGGTTGGA ATATCGGGCT GTTGATGCAG GTTAAAGACG CATTGCGCTC TAAAGTGCCT GACGGCAAAG CCGCGGAGCA GGCCCTGCTG AAGGGCTACA TACGCTCCCA GACGCAGCCG TTAGGCAGCA CCACTTTGAC TGTCGGCAAC ACCCAGGAAT ACGGCTTAAA GGTTTTAAAC CAGTTTGAAG TAAGTTCTTC CATAGCCAGC GTCTTGGGGT GGCAGTCATC CGATCCGAGT ATCGCCAGTT TTAACGGGAA CAAGCTTACA GCCAATAAAC CCGGGACTAT CAGGGTCCGC GCTTATCATC CCTCTGCCGG AACGAATCCG GATTTTAACA ACTCCCTCTG GCTGGCTGAA TTTCAAGTGA CAGTAAACGC TGCTTCCGGC GGTGGCGGAG GCGGCGGTGG TGGCGGAGGA GGCGGAGGCG GTGCCTCCCA GCCCGGGCAG TCTACCTCCA CTACTACTGA CTTCGGCCAG GTAACGGTCG ATAAATCCAC TGGCAGTGTG ACGACCACCA TCGATGCGGC CAAAGCAGCC GATCTGATCG CCCAGGCTTC CGGCCCGGTG GTCTTTAAAG CCGACATCCC GACCGATGTG ACCGTGAAAA CGGCTACAAT GGAATTGCCG GCCGCCGTCT TTACTAAGGC CGCGGAAGCC GGTAAGGCCC TTTCCCTCGA AGTGGCGGGT GTAAAGGCGC TTCTCCCGGC AGGGGTGATA CCGCCGGAGA TCCTGGCCGA TCCGGCAGCA ACGATTAATT TTGCCTTCCA GGTTTTGGAT ACTACAGAAG CCCAGGCGGT TACCGGCAAT CTCCCGGCCA GTATGCGCCA GGCGGCAGAC GTTATCGAAA TCGACCTGTA TACTGTTAAG GGCGATAACC AGCAGATGGT GACCCCGGCC AAGCCGGTAA CCCTCACTTT GACCTATCGC CCGGAGGGTG TGGATGCCGA TAAGCTGGGT GTCTATCGTT ACAATGTGGC TGCCGGCACG TGGGAATACA AGGGCGGCCG GGTCGATAAG GCCACCAATT CGATCAGCGC TGTCCTCAAC TCCTTCTCAA AGTACACGGT ACTGGCTTAT GATAAGACCT TCAGCGACAT CCAGGGACAC TGGGCCCAGC GGGATATCGA GATCATGGCT GCCCGCCATG TTGCGGCAGG CATCTCGGCC AGCGAATTCA AACCCGAGGG CCAGGTAACG CGGGCGGAAT TTACGGCCTT CCTGCTCCGC ACCCTGGGTA TCAGTGAGGA TAGGTCCGCT GCCAATCGCT TTGCGGATAT CCAGCCCGGA GACTGGTACT ACGGCGCCGT AGTAACTGCC TCCAGGACCG GCCTGGTGGC GGGCTATGAA GACGGCAGCT TCCGTCCCGA CAAGGCTATA AGCCGCCAGG AAATGGCCGC TATGCTTGCG CGAGCCCTGG CTTACGCCGG GCAGAAGGTG GACGTCGCGG GACGGGTGGA CGATATCTTG AGTAAGTTCA GTGACAACGG CAGCCTCGCG AGCTGGGCCA GGGAGAGCGC GGCTGTGGCG GTAGAATCCG GGCTTATTGT CGGCCGGACG GCTACCACCT TCGTGCCCCT GGGCAACGCC ACCCGGGCGG AAACGGTGGT CATGCTCAAG CGGCTGCAGG ATCGGATCTA A
|
Protein sequence | MRIKGLWSYL AVVVCLSLVL AGIPAMPTAA AASSDPVADL VNRLQPVYNC LDAGDKQVIQ AAKDEIAGLS DDEIAKILKD KRLITEQVKN NLRVDEDNAA STLAGMIKYA AGIYYSPDAS TLENELQGFR SQYSSTFSQL LGSGVTVDDA WQFALATEAN LPAALQSNLG NILEQFITGS SYDAVLNSFS DIVSDAASGV TSDFKDALTG LGWNIGLLMQ VKDALRSKVP DGKAAEQALL KGYIRSQTQP LGSTTLTVGN TQEYGLKVLN QFEVSSSIAS VLGWQSSDPS IASFNGNKLT ANKPGTIRVR AYHPSAGTNP DFNNSLWLAE FQVTVNAASG GGGGGGGGGG GGGGASQPGQ STSTTTDFGQ VTVDKSTGSV TTTIDAAKAA DLIAQASGPV VFKADIPTDV TVKTATMELP AAVFTKAAEA GKALSLEVAG VKALLPAGVI PPEILADPAA TINFAFQVLD TTEAQAVTGN LPASMRQAAD VIEIDLYTVK GDNQQMVTPA KPVTLTLTYR PEGVDADKLG VYRYNVAAGT WEYKGGRVDK ATNSISAVLN SFSKYTVLAY DKTFSDIQGH WAQRDIEIMA ARHVAAGISA SEFKPEGQVT RAEFTAFLLR TLGISEDRSA ANRFADIQPG DWYYGAVVTA SRTGLVAGYE DGSFRPDKAI SRQEMAAMLA RALAYAGQKV DVAGRVDDIL SKFSDNGSLA SWARESAAVA VESGLIVGRT ATTFVPLGNA TRAETVVMLK RLQDRI
|
| |