Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1000 |
Symbol | |
ID | 3833303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1028701 |
End bp | 1029957 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828929 |
Product | hypothetical protein |
Protein accession | YP_429858 |
Protein GI | 83589849 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00447157 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGATAG AAGCAGCGAC AGGGAAAAAA GCCGGTCGCT TTCAGGCCCT GGCGGCCTTG AAACATCGTA ATTTTCGCCT TTTCTGGTCC GGACAGTTGA TCTCATTGAT AGGCACCTGG ATGCAGAATA TGGCCCAGGG ATGGCTGGTG TTGCAGTTGA CCAACTCGCC TTTCCTGCTG GGTTTGGTTA GTGCCATCCA GTTTACGCCC CTGCTGGTCC TGGCCCTGGT TGCCGGAGTG GTGGCCGACA GGGTACCCAA AAGGCGCCTG CTGATTTTTA CCCAGTCCAG CCTCATGCTT CTGGCCTTTA CCCTGGGGAT TTTGACCCTT ACCGGGGCGA TCCGGTACTG GCAGGTACTT ATCCTGGCCG GGTTGCTGGG CATGGTCAAT ACCTTTGATA TGCCGGCCCG TCAGGCTTTT GTCGTGGAAA TGGTAGGTAA GGGCGATCTG ATGAATGCCA TTGCCCTCAA TTCCTCGATT TTTAATGCCG CCCGGATTGT AGGTCCGGCC TTGGCCGGGC TGGTCATTGG CCGTCTGGGC ATGGCGGCCA GTTTTCTCCT CAATGGAGCC AGTTTCCTGG CAGTTATCGC CGGCCTGCTG TTAATCAGGA TACCCGAGAA AATCGACTGG CACCACCGGG TGGCAGAGGG GATGGGGGAA AGGATTGCCG AGGGCCTGCA ATATATTCGC CGGACACCAG TCGTCCTGCG CACCGTTGTC CTGATGGCCC TCCTCAGTAT CTTCGCCATG AATTTTAGCG TCCTGATCCC GGTCCTGGCC CGGGATACCC TGGGACAGCA GGCAGAGGGT TACGGACTGC TCATGTCGGC CTCCGGGGTG GGGGCTTTAT GCGGGGCCAT CTTTCTGGCG GTGTTCAGCA GCCGCGGTCC CAGCCCGTGG TTACTTCTGG GGGGTGCGGC CGGTCTCTGC CTCTTCCAAC TCTTTCTAGC GGGTACCCAT TCCTATACCC TGGCCCTTTT ATTCCTGGGG CTTACCGGCT GGTCAATGAT AACCTTTACA GCCTCGGTCA ATACCACCCT GCAGTTGAAT GTGCCGGATA ACTTGCGGGG GCGGGTCATG AGTGTCTACT CCCTCGTCTT CGGCGGGGTG ACGCCCATCG GGAGCTTGTT TAGCGGGAGC ATCGCCCACC TGTGGGGGGC ACCTGCCGGC CTTGCAGCAG GTGCAACCAT AGGCCTTATA AGCTTGCTGG CCGTTGCCGG CCAGACCTGG CGGTGTAAAC CAAATGCAGG CGTCTAA
|
Protein sequence | MEIEAATGKK AGRFQALAAL KHRNFRLFWS GQLISLIGTW MQNMAQGWLV LQLTNSPFLL GLVSAIQFTP LLVLALVAGV VADRVPKRRL LIFTQSSLML LAFTLGILTL TGAIRYWQVL ILAGLLGMVN TFDMPARQAF VVEMVGKGDL MNAIALNSSI FNAARIVGPA LAGLVIGRLG MAASFLLNGA SFLAVIAGLL LIRIPEKIDW HHRVAEGMGE RIAEGLQYIR RTPVVLRTVV LMALLSIFAM NFSVLIPVLA RDTLGQQAEG YGLLMSASGV GALCGAIFLA VFSSRGPSPW LLLGGAAGLC LFQLFLAGTH SYTLALLFLG LTGWSMITFT ASVNTTLQLN VPDNLRGRVM SVYSLVFGGV TPIGSLFSGS IAHLWGAPAG LAAGATIGLI SLLAVAGQTW RCKPNAGV
|
| |