Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2488 |
Symbol | |
ID | 3831591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2593409 |
End bp | 2594560 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637830410 |
Product | PilT protein-like |
Protein accession | YP_431313 |
Protein GI | 83591304 |
COG category | [R] General function prediction only |
COG ID | [COG4956] Integral membrane protein (PIN domain superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000025338 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0444425 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATTGC GCATTTTACG TGGGGCCTTT GGCCTGCTGG GAGCGGCAGC CGGCTTCTAT GTAGGTCGGG CAGGCTTAAA CCTCTGGCAG GTGGGCGGGG GAGTGAACCC GCCCCCGGGG TTGAGTTGGA CGCTACTGGC CCTGGTGACT TTCGTGGCCG GCCTCGTGGG ATACGGTGTG GCCCAACATA TTATTAACCT GGTGACCCAA TCCATGCGCT GGCTGGAAGG GAGGTTGCAA CGCACCCCGG CCCAGGAGAT AATAAGCGGT GCCCTGGGAC TAATCTGTGG ACTTATAATT GCTAATTTAT TGGGGGCCTC CTTCTTTCAT TTACCCCTGG TGGGACCCTA TATTCCCATG GTTGGGAGTA TTCTCTTTGG CTACCTGGGC TGGAGCCTGG GTACCAAGCG CAGGGACGAA GTCTGGTCCC TCTTTAATAT TTTCCCCCGC TGGGGGGGAA AAGAACGGGA TAAAGGCAAG GGTGAAAGCG TCCGATCCGG GGCCAAGATC CTTGATACCA GCGTCATTAT CGACGGCCGG ATCGCTGATA TTATTAAGAG TGGCTTTATA GAGGGAACGA TAGTTATTCC GGCCTTTGTC CTGGAGGAGT TACGCCACAT AGCCGATTCT TCCGATCTGC TAAAACGCAA CCGCGGCCGG CGCGGGCTTG ATATCCTGAA CAAAATCCGT AAGGAAACCG GCATTACCGT AAAGGTTTCC GAGGTTGATT TTGACGATCT GACGGAGGTA GATAGCAAAC TTGTCCGCCT GGCCCAGAAG ATGGGCGCTC CGGTCCTGAC CAATGATTAT AACCTGAACA AGGTGGCCGA GCTCCAGGGT GTCCGGGTGT TGAACATCAA CGAACTGGCC AATGCCGTTA AGCCGGTAGT TTTACCGGGA GAAGAAATGA CTGTTCAGGT GATTAAAGAC GGTAAGGAGA TGGGCCAGGG GGTAGCTTAC TTAGATGACG GCACCATGAT TGTCGTTGAG AACGGCCGCC GGTTTATCGG CCAGACAATT GCCGTCCTGG TAACCAGCGT TTTACAGACT GCCGCCGGGC GGATGATCTT TGCCCGGCCC AAGGCTGCTG ACCGCAAACT TGGTGCCCAT CACCAGGCCC TGGAACGGAG CGAGTACCAG TGCCTTTCCT GA
|
Protein sequence | MLLRILRGAF GLLGAAAGFY VGRAGLNLWQ VGGGVNPPPG LSWTLLALVT FVAGLVGYGV AQHIINLVTQ SMRWLEGRLQ RTPAQEIISG ALGLICGLII ANLLGASFFH LPLVGPYIPM VGSILFGYLG WSLGTKRRDE VWSLFNIFPR WGGKERDKGK GESVRSGAKI LDTSVIIDGR IADIIKSGFI EGTIVIPAFV LEELRHIADS SDLLKRNRGR RGLDILNKIR KETGITVKVS EVDFDDLTEV DSKLVRLAQK MGAPVLTNDY NLNKVAELQG VRVLNINELA NAVKPVVLPG EEMTVQVIKD GKEMGQGVAY LDDGTMIVVE NGRRFIGQTI AVLVTSVLQT AAGRMIFARP KAADRKLGAH HQALERSEYQ CLS
|
| |