Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1519 |
Symbol | |
ID | 3831984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1565157 |
End bp | 1566371 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829451 |
Product | major facilitator transporter |
Protein accession | YP_430371 |
Protein GI | 83590362 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.120358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.496216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGGAAAC GAATTCTCTA TCTTTTAAGC CTGGGGCATA TGGTTACCGA TATCAACCAG GGCCTCCTGC CCATCTTCCT TTCCGTCTAT AAAGATCAGT ATGGCCTGAG CTATGCAGCT GCAGGCCTGG TAGTCTTCCT TTCTAATATC AGTTCTTCGG TAATCCAACC ACTCTTCGGC TACTGGTCGG ACCGGCACCA GCTGCGCTGG CTGCTCCCTG CAGGTTGCCT GGTGGCTGGG CTGGGGATGG TGGCCGCGGG CTATGCCGCC AATTACTACC TCTTGGTAGC AGCCGTCTTT ATCAGCGGCC TGGGAGTTGC CGCCTATCAC CCGGAGGCCT CTAAATCGGC CCACTACATC AGCGGGCCTA TGCAGGCCAG TTCCATGTCT ATTTTTTCCG TCGGCGGCAA CGTGGGCTTT GGCCTGGGAC CCCTCATTGC CACCTTTATC CTCAGCCATG GCGGGCTCCG GGCCTCCTGG ATAATCCTTA TTCCTACATC TGTTACTGTA GCCCTGCTCA GCCATACCCT GCCTGCCCTG GGCCGGGCCA TGGCCGGGGG CCATCAAGCA GCCCCCGCCG GTAACCCCCG GGCAAAAACA AGCCAGGCCC CCCTGGTAGC CATTACCCTG CTGGTGCTGG TGGTCATCAT GCGTTCCTGG CTCCAGAGCG GCCTTACCTA TTACATCCCC TTCTATTACC TGAATTTCCT CCACGGGAAT GAACACTTCG CCAGCAGCAT GCTGTCTATT TTCCTCATTG CCGGGGCGGT GGGCACCCTG GTGGGTGGGC GGCTGGCCGA CTACTGGGGC GCAAAGAGGA TGATAATTGG CTCGATGCTG GTCCTCATCC CCATGGTGGT GACTTTCCCC TATGTCCAGG GAGCCTGGGT GGCCATCCTG CTCGCCCTGA GCGGTTTTGC CCTGGTTTCA ACCTTTGCCC CGGCCATTGT CCTGGCCCAG AATATTCTCC CCGAACACGT CGGTATGGCC TCGGGCCTGA TGATGGGATT TGCCATCGGC ACCGGCGGCC TGGGGACCTT CCTCCTGGGG GGTATTGCCG ACGCCCACGG GGTACCCTTC ACCATCCAGA TCATGGCCGT TATCCCGGCC ATTGGCGCCG CTCTGGCTTT CTTCATCCCG GACCTGCGCC AGCAACCGTC CGCCCGGCAA GTCAGAGTTG AACCAGTGCC GGAAGAAAAA AATAGCCGGG CTTGA
|
Protein sequence | MRKRILYLLS LGHMVTDINQ GLLPIFLSVY KDQYGLSYAA AGLVVFLSNI SSSVIQPLFG YWSDRHQLRW LLPAGCLVAG LGMVAAGYAA NYYLLVAAVF ISGLGVAAYH PEASKSAHYI SGPMQASSMS IFSVGGNVGF GLGPLIATFI LSHGGLRASW IILIPTSVTV ALLSHTLPAL GRAMAGGHQA APAGNPRAKT SQAPLVAITL LVLVVIMRSW LQSGLTYYIP FYYLNFLHGN EHFASSMLSI FLIAGAVGTL VGGRLADYWG AKRMIIGSML VLIPMVVTFP YVQGAWVAIL LALSGFALVS TFAPAIVLAQ NILPEHVGMA SGLMMGFAIG TGGLGTFLLG GIADAHGVPF TIQIMAVIPA IGAALAFFIP DLRQQPSARQ VRVEPVPEEK NSRA
|
| |