Gene Moth_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0188 
Symbol 
ID3832261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp182302 
End bp184572 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content57% 
IMG OID637828124 
Producthypothetical protein 
Protein accessionYP_429066 
Protein GI83589057 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATCA AGGGTTTATG GTCGTATCTG GCGGTTGTTG TTTGCCTGAG CCTGGTGCTG 
GCCGGCATCC CGGCGATGCC CACCGCTGCC GCGGCGTCGA GCGACCCGGT AGCCGATCTG
GTGAACAGGC TCCAGCCGGT GTATAACTGC CTGGACGCCG GGGATAAGCA AGTTATCCAG
GCGGCGAAGG ATGAGATTGC GGGGCTTAGT GATGATGAGA TAGCAAAGAT ATTAAAGGAC
AAAAGGTTGA TAACGGAACA AGTTAAAAAT AATCTTAGAG TTGATGAAGA TAACGCCGCC
TCTACCCTGG CGGGTATGAT AAAATACGCG GCGGGGATCT ATTACTCTCC AGATGCGAGC
ACGTTGGAAA ATGAATTGCA AGGTTTCCGC AGCCAGTATA GCTCTACCTT CAGTCAATTG
CTTGGAAGCG GTGTAACCGT TGACGACGCC TGGCAGTTCG CCCTGGCTAC TGAAGCAAAT
TTACCTGCAG CCTTGCAGAG CAATCTTGGC AATATATTGG AGCAATTTAT AACAGGTTCG
AGCTATGATG CAGTCTTGAA TAGTTTCAGC GATATTGTTT CTGACGCCGC GTCGGGGGTC
ACTTCTGATT TTAAGGACGC CCTGACGGGT TTGGGTTGGA ATATCGGGCT GTTGATGCAG
GTTAAAGACG CATTGCGCTC TAAAGTGCCT GACGGCAAAG CCGCGGAGCA GGCCCTGCTG
AAGGGCTACA TACGCTCCCA GACGCAGCCG TTAGGCAGCA CCACTTTGAC TGTCGGCAAC
ACCCAGGAAT ACGGCTTAAA GGTTTTAAAC CAGTTTGAAG TAAGTTCTTC CATAGCCAGC
GTCTTGGGGT GGCAGTCATC CGATCCGAGT ATCGCCAGTT TTAACGGGAA CAAGCTTACA
GCCAATAAAC CCGGGACTAT CAGGGTCCGC GCTTATCATC CCTCTGCCGG AACGAATCCG
GATTTTAACA ACTCCCTCTG GCTGGCTGAA TTTCAAGTGA CAGTAAACGC TGCTTCCGGC
GGTGGCGGAG GCGGCGGTGG TGGCGGAGGA GGCGGAGGCG GTGCCTCCCA GCCCGGGCAG
TCTACCTCCA CTACTACTGA CTTCGGCCAG GTAACGGTCG ATAAATCCAC TGGCAGTGTG
ACGACCACCA TCGATGCGGC CAAAGCAGCC GATCTGATCG CCCAGGCTTC CGGCCCGGTG
GTCTTTAAAG CCGACATCCC GACCGATGTG ACCGTGAAAA CGGCTACAAT GGAATTGCCG
GCCGCCGTCT TTACTAAGGC CGCGGAAGCC GGTAAGGCCC TTTCCCTCGA AGTGGCGGGT
GTAAAGGCGC TTCTCCCGGC AGGGGTGATA CCGCCGGAGA TCCTGGCCGA TCCGGCAGCA
ACGATTAATT TTGCCTTCCA GGTTTTGGAT ACTACAGAAG CCCAGGCGGT TACCGGCAAT
CTCCCGGCCA GTATGCGCCA GGCGGCAGAC GTTATCGAAA TCGACCTGTA TACTGTTAAG
GGCGATAACC AGCAGATGGT GACCCCGGCC AAGCCGGTAA CCCTCACTTT GACCTATCGC
CCGGAGGGTG TGGATGCCGA TAAGCTGGGT GTCTATCGTT ACAATGTGGC TGCCGGCACG
TGGGAATACA AGGGCGGCCG GGTCGATAAG GCCACCAATT CGATCAGCGC TGTCCTCAAC
TCCTTCTCAA AGTACACGGT ACTGGCTTAT GATAAGACCT TCAGCGACAT CCAGGGACAC
TGGGCCCAGC GGGATATCGA GATCATGGCT GCCCGCCATG TTGCGGCAGG CATCTCGGCC
AGCGAATTCA AACCCGAGGG CCAGGTAACG CGGGCGGAAT TTACGGCCTT CCTGCTCCGC
ACCCTGGGTA TCAGTGAGGA TAGGTCCGCT GCCAATCGCT TTGCGGATAT CCAGCCCGGA
GACTGGTACT ACGGCGCCGT AGTAACTGCC TCCAGGACCG GCCTGGTGGC GGGCTATGAA
GACGGCAGCT TCCGTCCCGA CAAGGCTATA AGCCGCCAGG AAATGGCCGC TATGCTTGCG
CGAGCCCTGG CTTACGCCGG GCAGAAGGTG GACGTCGCGG GACGGGTGGA CGATATCTTG
AGTAAGTTCA GTGACAACGG CAGCCTCGCG AGCTGGGCCA GGGAGAGCGC GGCTGTGGCG
GTAGAATCCG GGCTTATTGT CGGCCGGACG GCTACCACCT TCGTGCCCCT GGGCAACGCC
ACCCGGGCGG AAACGGTGGT CATGCTCAAG CGGCTGCAGG ATCGGATCTA A
 
Protein sequence
MRIKGLWSYL AVVVCLSLVL AGIPAMPTAA AASSDPVADL VNRLQPVYNC LDAGDKQVIQ 
AAKDEIAGLS DDEIAKILKD KRLITEQVKN NLRVDEDNAA STLAGMIKYA AGIYYSPDAS
TLENELQGFR SQYSSTFSQL LGSGVTVDDA WQFALATEAN LPAALQSNLG NILEQFITGS
SYDAVLNSFS DIVSDAASGV TSDFKDALTG LGWNIGLLMQ VKDALRSKVP DGKAAEQALL
KGYIRSQTQP LGSTTLTVGN TQEYGLKVLN QFEVSSSIAS VLGWQSSDPS IASFNGNKLT
ANKPGTIRVR AYHPSAGTNP DFNNSLWLAE FQVTVNAASG GGGGGGGGGG GGGGASQPGQ
STSTTTDFGQ VTVDKSTGSV TTTIDAAKAA DLIAQASGPV VFKADIPTDV TVKTATMELP
AAVFTKAAEA GKALSLEVAG VKALLPAGVI PPEILADPAA TINFAFQVLD TTEAQAVTGN
LPASMRQAAD VIEIDLYTVK GDNQQMVTPA KPVTLTLTYR PEGVDADKLG VYRYNVAAGT
WEYKGGRVDK ATNSISAVLN SFSKYTVLAY DKTFSDIQGH WAQRDIEIMA ARHVAAGISA
SEFKPEGQVT RAEFTAFLLR TLGISEDRSA ANRFADIQPG DWYYGAVVTA SRTGLVAGYE
DGSFRPDKAI SRQEMAAMLA RALAYAGQKV DVAGRVDDIL SKFSDNGSLA SWARESAAVA
VESGLIVGRT ATTFVPLGNA TRAETVVMLK RLQDRI