Gene Moth_0316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0316 
Symbol 
ID3831783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp320128 
End bp321831 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content46% 
IMG OID637828251 
Producthypothetical protein 
Protein accessionYP_429193 
Protein GI83589184 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4963] Flp pilus assembly protein, ATPase CpaE 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00338768 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAA TAGTAACATC CCTTGCCCTG GTACCAGGAG CCGGAGCCAC GTTTATAGCC 
ACAAATATTG GCGCCTGGAT GGCCAATAAA GGCATAAAAA CTTTGCTCAT AGATCTAAGC
GCTAGGGGCG TTTTAGGCCC ATTGTTCCTT GTGGAAAAAG CAAAAGAACG GGTTTACCCA
ACAACGGCAA CCTGGCAGGA ATTCAGCAAC CCCGCTTCCT CGCTCATCAA AACCCAATAC
GGCTTGGCCG TTCTCCCGGC TCCGGAGCGC GACAAAAGTA CGAATTACGA TTTAAACGTT
GAAGCCTTCT TCGACTATTT TGAGCCGTTA TTTGAAGTAA TAATTATCGA CCTGGGCGGG
GATATATATT TGCCCCACGT ATTCCCCATT ATGGAGAAGG CCTCCAAAAA CATATTGGTG
GCTGAACCGA GCAAAAGATG CGTCGAAGCT TTGCCGGGGC ACATCAAAGA AGCCTTGATG
CAATATGAGC CAGAACTAAT AATCAACCGT GTAACTTCCC GGGCTTATTA CCATCCCAGG
GATATAGCCC GGCAGTTTAA CGTAGGACAG TATATAACCA TCATTGATGA TCCCAAGTCT
AACAACGAAG CTATAAAGCA ACGCCTTCCC CTTTCGCTTT ATGGGAAAGG GAAAGCGGCG
CAGATGTTGC TGGAAATAGG GGAGCATCTT TTTCCAGAAA CCCTTAGAGA AAGTAAAGAT
ACCCAAGAGG AACAGGATAT TAGCGTCAAA AACGTTAGAA ACCCGAACGA AAGCGTGGTT
ACGGGAAGAG GAATAAAAAT CGAAGTTGAT CAGGAAAGGG AAAAAGAAGC TGTAAGGGAA
GGCCGGGTCA AGCCTATAAA AATCCCAAAG TTACTTCAAA TACAAGGCCG ACTAAAGAAA
ATATCAGTCG CCAAAATAAG GAAGAAAGCA GAGGAACAAT GCCGGTTTCA AGATGATATT
TTAATAACAG TCTGGAACCC TTCAGGGTTT TTTGTAAGTA TGACTGCTTT GAATTTAGCT
GTTGCTGCGG CAGCTGAAGG ATATGATACA GCACTTATCA ACTATAATTT TACCAACCCC
GAAACGGATA TATGGTTTGG GATAAAACAA ACCAGCGCTA AAGATGCAGA TTATAATGAT
GCGGGAATTA TGACCTTCGG TGAAGCCATC AGCCCGAAAC TGGCCGTAAA GATGCTCAAG
GAACGAGCCT GGGGCGTCAA ATACTTGCCT GCAGGAAATA AACTTGGTAA TATCGGGACC
CCGGATTATG GTGAAAAGGG GGCGCAGCTA CTGGAAAATA TAATTGTTGC CATAAAGGCG
CGGGAGGCAA GAAAACCCAA AGTGACAATC ATCGAGACCG GGACATGGTA CGAGCAGCCG
CCGGTATATG CAGCGCTAAA AACTTGCCGC GTCTTGTTCA TACCCATGTC CGGCAGCCGC
CAGGAAGGAG AAATAGTAAA GCAGCAGCTG GCGGAACTCA AAAGGGTTGA AGTAGAACCG
GTTACAGTGG AATTAATATT TGCCCCGGAA GAGATAAAGC TGCCTGCCCA AGTTTGTCAA
GAACGTTTAA TTGTCCCGGA TGATCGGGAG AGGTACCTTA AAGCGTCGGC TACCAAGAAG
CCTTACTCGC TATTAACAGA AGGAGGCCAG GAAATCTGGA GGCAGGCTCT AAAAATGGCA
AGCAATCTCA TCGAGTTCGC TTAG
 
Protein sequence
MAKIVTSLAL VPGAGATFIA TNIGAWMANK GIKTLLIDLS ARGVLGPLFL VEKAKERVYP 
TTATWQEFSN PASSLIKTQY GLAVLPAPER DKSTNYDLNV EAFFDYFEPL FEVIIIDLGG
DIYLPHVFPI MEKASKNILV AEPSKRCVEA LPGHIKEALM QYEPELIINR VTSRAYYHPR
DIARQFNVGQ YITIIDDPKS NNEAIKQRLP LSLYGKGKAA QMLLEIGEHL FPETLRESKD
TQEEQDISVK NVRNPNESVV TGRGIKIEVD QEREKEAVRE GRVKPIKIPK LLQIQGRLKK
ISVAKIRKKA EEQCRFQDDI LITVWNPSGF FVSMTALNLA VAAAAEGYDT ALINYNFTNP
ETDIWFGIKQ TSAKDADYND AGIMTFGEAI SPKLAVKMLK ERAWGVKYLP AGNKLGNIGT
PDYGEKGAQL LENIIVAIKA REARKPKVTI IETGTWYEQP PVYAALKTCR VLFIPMSGSR
QEGEIVKQQL AELKRVEVEP VTVELIFAPE EIKLPAQVCQ ERLIVPDDRE RYLKASATKK
PYSLLTEGGQ EIWRQALKMA SNLIEFA