Gene Moth_0308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0308 
Symbol 
ID3831775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp310864 
End bp313287 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content62% 
IMG OID637828243 
Producthypothetical protein 
Protein accessionYP_429185 
Protein GI83589176 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.287416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAATTCC CCCTCATCGC CTTTAAGAAC AACATCGTCT TCAATAACCA GGGCGAAGCC 
TACGCCGTCT ACCGCCTCAA AGGCGAGGCC TACAACCACC TGCCCCTGGC GGAAAGGCAG
ATGGTCATCA AACGCCTGGA AGAAAGCTTT TACGGTTATG AAGGCAAAGG GCAGATCCTG
CTCCTGTGTG AAGAACTGCG CCTGGACGAA GGCGGCTACC TAGCGGCGGC CGGCGTCTCC
TACGACCTCC CCCGGGAGGT AGCCCTGGAG GCCTCCCGCC ACGCCCGGTC CGTGAGGGAA
GCCCTCAGCT ACGGCGCCCG CCGCCGGCGC CGTTACCTGG TCCTGCAGCT GCGCCTGGAT
CCCCAGCTGG ATGATATCAA AACCCTGCTG GCCGAAGCCC GCGACCTGGC CCTGGGCACC
TTCCTCCGCT CCGAAAAGTG GCTGCTCTCA CCCCGGCGCC TGCAGGAGGC CCTGGAAGCG
GAAGAAGAAT TATACCACCG CGTCCGAAAT ATCACCGCCG GCCGCGCCGA TTTCGGCGAT
CTGGACTTCA TCATCCGGCG CAACACCCGC CGGGTGGGAG TCCTGCCACC GCCGTTGCCG
TCCCGCGATG CCGGCCGCTT CACCCCAGCC ATCATCAGCG CCTTCAGCGA CGGCTGCCTC
CTGGAAGAGC ACCCGGGCTA CATCGCCATC ACCCAAGGCA ATGACGAAAC CCATTACCAG
GTTTTCATCA CCTTTCCCGA TCTGCCTAAA AGCATCCCGG AGACAGGGGC CGAATGGCTG
GCCAGCCTGG ATGTCAACGA AGCGGCCATT GATGCCGTGG TCCATTTCCA GGTTACCCGG
CCCTTCAAGG CCAAGAAAGC GGCCGAGAGC CGCCGGCGTT TCCTAAAAGG CCAGATCGAG
GAAGCGATAA AGGGCCGGGA TGAACCCAGC GTAGACGAAG AATACGGCCT GACCGAAGGC
CGCTACCTGG AAAGCAAGAT CCAGGCTGGC CAGCCCTTAG CGGCCATGGC CGTCACCCTG
GCCGTGGCCG GCAAAGACCT GAAAGCGGTC CGGGCCACGG CAACCAGGAC AATAGAGAGA
TACACCTCTT CCGGCTACCG GGCCGTCCGG CCGGTAGGCG ACCAGATCAA GTGCCTGTAC
TCCTTCCTGC CCGGCGCCCC GACGGCCGCG CCCCTGATCG AGTGCGATCC TGGCTTCATC
GCCGCCGCCG GTCCCCATAT CTCCCTGGAA ACGGGCGACG GCAAGGGCTT TTTCCTGGGC
TGGTCCGGGG CGGCCCCGGT CTGGTGGCGG CCGGGCTACG CCGCCCGGGA ACTGGGCCGT
TCCAACGCCG TCTTTATCAC CGGCGGCCTG GGCGGGGGCA AGAGCATGAC CATCAAGACC
ATGGGCCATT TCATCCGCCT GGCCGGCGGC GTGTTGTTCG TCATCGACCC CAAAAAGAAC
GAATACAGGG CATACGAGCG CCTGTATCCT ATCAAACGTA TCGACCTCTC TCCCGGCGGC
GACCAGGAAT TGAACCCCTT CATGCTGGCC GCCGACGAGC GCCGTTCCAA AGGGATCGCC
CTGGATTTTT TAAGCATCGC CTTGAACCTC CGCGACGACA ACGATGTGCG ACGGGTGGCC
GTATCCCAGG CGGTAGAAAG GGTGGCCGGC CGGCCGCCGG CGGAACGAAA CCTTGAGGCC
TGTCTGGAAG AATTGAACAT GATGGCCAAG GAGAAGGCCC ATCCCCAGGT CGCCCGGGAG
GCCGGCCAGT GCGCCCTGCT GTTAGAATCC CTGCGGGACG GCTCTCTAGG CCATCTTGTA
TTCGGCAAGG GAAGGGGAAA CGAGATCTCC CCGGTCACGG TGGTCAGCCT CCAGGGCCTG
CCCCTGCCGC GCACCGCCCA GAACCTCCTG GCCGGCCGGA TTACCGAAAG TGAGCGCCAG
GGCTTAGGGA TGCTCTACCT GGCGGCGGCT ATGGCCAGAG AGGTGGCCTT CTCCCTGCCG
GCACATATAA TCAAAGGCCA GATTTTTGAT GAGGTTTGGA TGCTGGCCGG CATCTCCGAA
GGTGCCCGGC TGCTGGACGA ACTGATCCGC ATGGGGGCCA GGAGCTATAA CGCCATCCCC
ATCCTGGCCA CCCAGAACGC CAGCGACGTG GCCAGCATGC AGACGATTAA AAACAACGTC
AGCTATGTCC TGTGCTTCCG GGCGCAGGAT AAAAGCGAGA TACGTAGTAA TATAGAGCTG
TTGGGAGCCG ATGTAGAAGA AGAGGAAGAG AAAAAGGGGG CCGGCCTGGC CAACCTCTTC
CGTTCCCTGG AGAGCGGCTG GTGCCTCATG AAGGACGCCC TGGGCCGCAT CGGCCAGGTA
TACATCGACC CACGGCCTGA ATACCTCCTG CAGGTGTTCG ATACCACGCC GGGCAAGGAA
GGAGAGGAAA CAAAAATTGG TTGA
 
Protein sequence
MKFPLIAFKN NIVFNNQGEA YAVYRLKGEA YNHLPLAERQ MVIKRLEESF YGYEGKGQIL 
LLCEELRLDE GGYLAAAGVS YDLPREVALE ASRHARSVRE ALSYGARRRR RYLVLQLRLD
PQLDDIKTLL AEARDLALGT FLRSEKWLLS PRRLQEALEA EEELYHRVRN ITAGRADFGD
LDFIIRRNTR RVGVLPPPLP SRDAGRFTPA IISAFSDGCL LEEHPGYIAI TQGNDETHYQ
VFITFPDLPK SIPETGAEWL ASLDVNEAAI DAVVHFQVTR PFKAKKAAES RRRFLKGQIE
EAIKGRDEPS VDEEYGLTEG RYLESKIQAG QPLAAMAVTL AVAGKDLKAV RATATRTIER
YTSSGYRAVR PVGDQIKCLY SFLPGAPTAA PLIECDPGFI AAAGPHISLE TGDGKGFFLG
WSGAAPVWWR PGYAARELGR SNAVFITGGL GGGKSMTIKT MGHFIRLAGG VLFVIDPKKN
EYRAYERLYP IKRIDLSPGG DQELNPFMLA ADERRSKGIA LDFLSIALNL RDDNDVRRVA
VSQAVERVAG RPPAERNLEA CLEELNMMAK EKAHPQVARE AGQCALLLES LRDGSLGHLV
FGKGRGNEIS PVTVVSLQGL PLPRTAQNLL AGRITESERQ GLGMLYLAAA MAREVAFSLP
AHIIKGQIFD EVWMLAGISE GARLLDELIR MGARSYNAIP ILATQNASDV ASMQTIKNNV
SYVLCFRAQD KSEIRSNIEL LGADVEEEEE KKGAGLANLF RSLESGWCLM KDALGRIGQV
YIDPRPEYLL QVFDTTPGKE GEETKIG