Gene Moth_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0365 
Symbol 
ID3832721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp367931 
End bp369916 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content52% 
IMG OID637828300 
ProductTRAG protein 
Protein accessionYP_429242 
Protein GI83589233 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00756366 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAACA AGATATTTGC CGGTTTGACG GAGAAAACCG GCAACAACGG TATAGGGCTT 
TTTTTGTTGG CAGTCGGCTT TATAACGAGC CTGGCGGTAT TGTACATCAT CGACGTGTGG
CTTCTCGGCC CTGTAGCTGC CATCTTTGCG GCAGCTTATA AATGGCTGGC CGGGGGGCTG
CACGGGCATC CCAATTATAC TGCCGCCTGG TGGTACTTCC AGCACCCGGT GGCAACTGCC
AGGGCCTGGC TGGGCGGCCA CCTCTCCCAG CCGGAAGTAC GTTCCTGGTG GTTCGGCCTT
AATGTATTAA TTGCGGTAAT GTGGGCCCTC CGCCGGATAG CCTGGCAATT TGACTGGACG
ATTAGTAAAA ACCCCGGCAT AAAGATAAAA AAAGACGACG CCACCTACGG AAGCGCCAGG
TGGGCCGTGA AAAGCGACCT GGCGCGAGTT TGCGACTTCG GCTTCGGCCC GGGAATAGTT
CTGGGGGCTT TAGGAGCAGC ACCAGTGCGT ATTCCCCCTA AGCCCAAAAC CTGGATGAAC
CGCCATGTAC TCGTGGTCGG CGCACCAGGT TCCGGCAAAA GCCGTGGCTA TGTCCGGCCC
AATATCTTCG CTGCGGTCCG GTCAGGGGAG AGCGTGCTGG TGACGGATCC CAAGGGTGAG
CTTTACCGCA GTATGGCCTG CTGGCTAAAG TCAAAAGGGT ACACGGTTAA GGGCTTCAAC
CTTGTCCAAA TGGGACAATC GGATCACTGG AACCCCCTGG CGGAGATCCG GACCCCCCTG
GATGCCGACG TTTTTGCCCA GGTGGTTATC AATACCACTG AAACTGGACC GAAGAAAGGT
GGCGATGCGT TTTGGGATCG AGCCGAGCAA AATTTATTAA AGGCCCTGGC CCTCTATGTC
ACCACAGAAC TTCCTGCGGA TAAGCGCAAT TTTGGCTCTC TTTATGATAT ATTAGCTGCC
GGTGATTTTG AACAAGTAGA TGCCTTATTC GCCAAACTTC CACCGGGCCA TCCAGCAAAA
GGACCATATA ATGTTTATGC CATGGCCGGC GATAACGTCA AAGGTGGAGT GGTAATCGGG
CTGGGTACAC GGTTGCAAGT ATTCCAGCAA GAAATGGTGC GACGCATAAC TGGTGATAGC
GATATAGACT TAACGTTACC CGGCAAGGGA AAGTGCGCTT ACTTCATCAT TACCCCGGAT
ACTCACGGAG CCTTTGATTT TCTGGCTTCA TTGCTGTTCA CCTTTTTATT CGTCCGGCTG
GTAGAGGTTG CTGATACCTC TCCTAACGGC CGTTTACCGG TGCAGGTCAG ATTTCTCCTG
GATGAGTTTG CCAATATCGT AAGTATTCCG GAGTTTGAAA AGAAAATCGC CACTGTCCGC
AGCCGCGGCC TCGACTGCCA CGTTATAGTC CAGAGCATTC CCCAGCTGGA AAGGAAATAT
GGACGGACCT GGGAGGAAAT AATGGCCTGC TGCGACACGA AGTTAATTAT AGGCGTGAAA
GATGATACTA CCGCTCGCTA CGTAAGCCGT ATGCTAGGGG AAAGTACCGT GGAAACAAGG
AGCTCCACCA GGGAAGTTAA CCCAATATGG GGACAGGGGT TGTTTGACGA CAAGCGTAGC
CTCGGTATTA CCGGCCGGGA ACTGATGACC CCGGACGAGA TCCAGAAAAT GCGCTCAAAG
TTTTGCCTGG TATTCCTCCC CGACGGCACA CCGCCGGCCA AGTTAAAAGT GTTGGATTAT
GAGCAGTTCC CGGAAGCAAA GGAGTTGAAG AAGGTTATAG TTACGGAAAA GAAGGAAGAA
GAAAAAGAGC TTGAGACTGA AGACGAGCTT AACGATGGTG GAAATGAGGA TTACCACGAT
ACCATAGACC GGCAACTTGT AGAAGATGGG GAAGAAAAGT TGATGGAAGA AAACATAAAA
GAGGGAGATA GGGTTATAGT GACAGAAAAT AACACGAGTA ACGGAAAAAT CATTGTCCCG
TGGTGA
 
Protein sequence
MFNKIFAGLT EKTGNNGIGL FLLAVGFITS LAVLYIIDVW LLGPVAAIFA AAYKWLAGGL 
HGHPNYTAAW WYFQHPVATA RAWLGGHLSQ PEVRSWWFGL NVLIAVMWAL RRIAWQFDWT
ISKNPGIKIK KDDATYGSAR WAVKSDLARV CDFGFGPGIV LGALGAAPVR IPPKPKTWMN
RHVLVVGAPG SGKSRGYVRP NIFAAVRSGE SVLVTDPKGE LYRSMACWLK SKGYTVKGFN
LVQMGQSDHW NPLAEIRTPL DADVFAQVVI NTTETGPKKG GDAFWDRAEQ NLLKALALYV
TTELPADKRN FGSLYDILAA GDFEQVDALF AKLPPGHPAK GPYNVYAMAG DNVKGGVVIG
LGTRLQVFQQ EMVRRITGDS DIDLTLPGKG KCAYFIITPD THGAFDFLAS LLFTFLFVRL
VEVADTSPNG RLPVQVRFLL DEFANIVSIP EFEKKIATVR SRGLDCHVIV QSIPQLERKY
GRTWEEIMAC CDTKLIIGVK DDTTARYVSR MLGESTVETR SSTREVNPIW GQGLFDDKRS
LGITGRELMT PDEIQKMRSK FCLVFLPDGT PPAKLKVLDY EQFPEAKELK KVIVTEKKEE
EKELETEDEL NDGGNEDYHD TIDRQLVEDG EEKLMEENIK EGDRVIVTEN NTSNGKIIVP
W