Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0574 |
Symbol | |
ID | 3832487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 597213 |
End bp | 598178 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828515 |
Product | bile acid:sodium symporter |
Protein accession | YP_429447 |
Protein GI | 83589438 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | [TIGR00841] bile acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAA AGAAAAACCG TTCCTTGCTG GAGATCATTC CTAAGTACTT TACATTATGG GTGATAGTTT TTGCCGCCCT GGCCCTGCTC AGCCCCAACT CCTTTCAATT CCTTGGCAAA TATATTTCTT ATCTCCTGGG CGTGGTCATG CTGGGCATGG GCATGACCCT GACTATGGGG GATTTTGCCG GGGTTCTGCA GCAGCCATTA AATGTGGTAG TTGGTGTGGC CCTCCAGTTT ATCATTATGC CCTTGCTGGG CTTTGCCATT GCTACCATAT TACGATTGCC ACCGGAGCTG GCCGCCGGGG TGGTACTGGT GGGGTGCGTC CCTTCCGGGA CGGCCTCCAA CGTGATGACC TTTATTGCTC AAGGAGACGT AGCCCTGTCG GTAACCATAT CTTCGATCAC GACCCTGATA GCACCTTTTA TTACTCCGTA CCTTTACTTG CTCCTGGGCG GGAAGTTTAT TCCCGTAGAA CCCCTGGCCC TGCTTATTGA CATCGCCAAG ATTGTCCTGC TGCCGATTAT TATCGGCCTG GTCATCAGGC AGGTGCTGGG CAATGAACGG GCCAGGGTGG TTAACCAGGT AATGCCCTCA GTTTCCGTCA TCGCCATCGT GATAATTATC GCCGCTGTGG TGGCCGGTAG CGCCGCCAAA CTCGTCAACG TCGCCGGCGC TGTGATCCTC GCCGTAATCC TCCATAATGG ATTGGGTTTC CTCATGGCCT ATTTTGTCGC TAGATACCTC TGCCGCATGA CCGAGGCCCA GGCCCGGGCC GTTTCCTTCG AAGTGGGTAT GCAGAACTCT GGCCTGGGGG CGGCCCTGGC CATGAAGTTC CTTACCCCGG TGGCGGCTTT GCCCAGCGCC ATCTTCAGTG TCTGGCACAA CTTGAGCGGT TCCTTCCTGG CTAATTTCTG GGCCCGGCGC GCGCCGGCAC CGGCCGCCCG GCTGGCCAGG AGGTAG
|
Protein sequence | MATKKNRSLL EIIPKYFTLW VIVFAALALL SPNSFQFLGK YISYLLGVVM LGMGMTLTMG DFAGVLQQPL NVVVGVALQF IIMPLLGFAI ATILRLPPEL AAGVVLVGCV PSGTASNVMT FIAQGDVALS VTISSITTLI APFITPYLYL LLGGKFIPVE PLALLIDIAK IVLLPIIIGL VIRQVLGNER ARVVNQVMPS VSVIAIVIII AAVVAGSAAK LVNVAGAVIL AVILHNGLGF LMAYFVARYL CRMTEAQARA VSFEVGMQNS GLGAALAMKF LTPVAALPSA IFSVWHNLSG SFLANFWARR APAPAARLAR R
|
| |