Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0365 |
Symbol | |
ID | 3832721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 367931 |
End bp | 369916 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637828300 |
Product | TRAG protein |
Protein accession | YP_429242 |
Protein GI | 83589233 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00756366 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAACA AGATATTTGC CGGTTTGACG GAGAAAACCG GCAACAACGG TATAGGGCTT TTTTTGTTGG CAGTCGGCTT TATAACGAGC CTGGCGGTAT TGTACATCAT CGACGTGTGG CTTCTCGGCC CTGTAGCTGC CATCTTTGCG GCAGCTTATA AATGGCTGGC CGGGGGGCTG CACGGGCATC CCAATTATAC TGCCGCCTGG TGGTACTTCC AGCACCCGGT GGCAACTGCC AGGGCCTGGC TGGGCGGCCA CCTCTCCCAG CCGGAAGTAC GTTCCTGGTG GTTCGGCCTT AATGTATTAA TTGCGGTAAT GTGGGCCCTC CGCCGGATAG CCTGGCAATT TGACTGGACG ATTAGTAAAA ACCCCGGCAT AAAGATAAAA AAAGACGACG CCACCTACGG AAGCGCCAGG TGGGCCGTGA AAAGCGACCT GGCGCGAGTT TGCGACTTCG GCTTCGGCCC GGGAATAGTT CTGGGGGCTT TAGGAGCAGC ACCAGTGCGT ATTCCCCCTA AGCCCAAAAC CTGGATGAAC CGCCATGTAC TCGTGGTCGG CGCACCAGGT TCCGGCAAAA GCCGTGGCTA TGTCCGGCCC AATATCTTCG CTGCGGTCCG GTCAGGGGAG AGCGTGCTGG TGACGGATCC CAAGGGTGAG CTTTACCGCA GTATGGCCTG CTGGCTAAAG TCAAAAGGGT ACACGGTTAA GGGCTTCAAC CTTGTCCAAA TGGGACAATC GGATCACTGG AACCCCCTGG CGGAGATCCG GACCCCCCTG GATGCCGACG TTTTTGCCCA GGTGGTTATC AATACCACTG AAACTGGACC GAAGAAAGGT GGCGATGCGT TTTGGGATCG AGCCGAGCAA AATTTATTAA AGGCCCTGGC CCTCTATGTC ACCACAGAAC TTCCTGCGGA TAAGCGCAAT TTTGGCTCTC TTTATGATAT ATTAGCTGCC GGTGATTTTG AACAAGTAGA TGCCTTATTC GCCAAACTTC CACCGGGCCA TCCAGCAAAA GGACCATATA ATGTTTATGC CATGGCCGGC GATAACGTCA AAGGTGGAGT GGTAATCGGG CTGGGTACAC GGTTGCAAGT ATTCCAGCAA GAAATGGTGC GACGCATAAC TGGTGATAGC GATATAGACT TAACGTTACC CGGCAAGGGA AAGTGCGCTT ACTTCATCAT TACCCCGGAT ACTCACGGAG CCTTTGATTT TCTGGCTTCA TTGCTGTTCA CCTTTTTATT CGTCCGGCTG GTAGAGGTTG CTGATACCTC TCCTAACGGC CGTTTACCGG TGCAGGTCAG ATTTCTCCTG GATGAGTTTG CCAATATCGT AAGTATTCCG GAGTTTGAAA AGAAAATCGC CACTGTCCGC AGCCGCGGCC TCGACTGCCA CGTTATAGTC CAGAGCATTC CCCAGCTGGA AAGGAAATAT GGACGGACCT GGGAGGAAAT AATGGCCTGC TGCGACACGA AGTTAATTAT AGGCGTGAAA GATGATACTA CCGCTCGCTA CGTAAGCCGT ATGCTAGGGG AAAGTACCGT GGAAACAAGG AGCTCCACCA GGGAAGTTAA CCCAATATGG GGACAGGGGT TGTTTGACGA CAAGCGTAGC CTCGGTATTA CCGGCCGGGA ACTGATGACC CCGGACGAGA TCCAGAAAAT GCGCTCAAAG TTTTGCCTGG TATTCCTCCC CGACGGCACA CCGCCGGCCA AGTTAAAAGT GTTGGATTAT GAGCAGTTCC CGGAAGCAAA GGAGTTGAAG AAGGTTATAG TTACGGAAAA GAAGGAAGAA GAAAAAGAGC TTGAGACTGA AGACGAGCTT AACGATGGTG GAAATGAGGA TTACCACGAT ACCATAGACC GGCAACTTGT AGAAGATGGG GAAGAAAAGT TGATGGAAGA AAACATAAAA GAGGGAGATA GGGTTATAGT GACAGAAAAT AACACGAGTA ACGGAAAAAT CATTGTCCCG TGGTGA
|
Protein sequence | MFNKIFAGLT EKTGNNGIGL FLLAVGFITS LAVLYIIDVW LLGPVAAIFA AAYKWLAGGL HGHPNYTAAW WYFQHPVATA RAWLGGHLSQ PEVRSWWFGL NVLIAVMWAL RRIAWQFDWT ISKNPGIKIK KDDATYGSAR WAVKSDLARV CDFGFGPGIV LGALGAAPVR IPPKPKTWMN RHVLVVGAPG SGKSRGYVRP NIFAAVRSGE SVLVTDPKGE LYRSMACWLK SKGYTVKGFN LVQMGQSDHW NPLAEIRTPL DADVFAQVVI NTTETGPKKG GDAFWDRAEQ NLLKALALYV TTELPADKRN FGSLYDILAA GDFEQVDALF AKLPPGHPAK GPYNVYAMAG DNVKGGVVIG LGTRLQVFQQ EMVRRITGDS DIDLTLPGKG KCAYFIITPD THGAFDFLAS LLFTFLFVRL VEVADTSPNG RLPVQVRFLL DEFANIVSIP EFEKKIATVR SRGLDCHVIV QSIPQLERKY GRTWEEIMAC CDTKLIIGVK DDTTARYVSR MLGESTVETR SSTREVNPIW GQGLFDDKRS LGITGRELMT PDEIQKMRSK FCLVFLPDGT PPAKLKVLDY EQFPEAKELK KVIVTEKKEE EKELETEDEL NDGGNEDYHD TIDRQLVEDG EEKLMEENIK EGDRVIVTEN NTSNGKIIVP W
|
| |