Gene Moth_0574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0574 
Symbol 
ID3832487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp597213 
End bp598178 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content56% 
IMG OID637828515 
Productbile acid:sodium symporter 
Protein accessionYP_429447 
Protein GI83589438 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID[TIGR00841] bile acid transporter 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA AGAAAAACCG TTCCTTGCTG GAGATCATTC CTAAGTACTT TACATTATGG 
GTGATAGTTT TTGCCGCCCT GGCCCTGCTC AGCCCCAACT CCTTTCAATT CCTTGGCAAA
TATATTTCTT ATCTCCTGGG CGTGGTCATG CTGGGCATGG GCATGACCCT GACTATGGGG
GATTTTGCCG GGGTTCTGCA GCAGCCATTA AATGTGGTAG TTGGTGTGGC CCTCCAGTTT
ATCATTATGC CCTTGCTGGG CTTTGCCATT GCTACCATAT TACGATTGCC ACCGGAGCTG
GCCGCCGGGG TGGTACTGGT GGGGTGCGTC CCTTCCGGGA CGGCCTCCAA CGTGATGACC
TTTATTGCTC AAGGAGACGT AGCCCTGTCG GTAACCATAT CTTCGATCAC GACCCTGATA
GCACCTTTTA TTACTCCGTA CCTTTACTTG CTCCTGGGCG GGAAGTTTAT TCCCGTAGAA
CCCCTGGCCC TGCTTATTGA CATCGCCAAG ATTGTCCTGC TGCCGATTAT TATCGGCCTG
GTCATCAGGC AGGTGCTGGG CAATGAACGG GCCAGGGTGG TTAACCAGGT AATGCCCTCA
GTTTCCGTCA TCGCCATCGT GATAATTATC GCCGCTGTGG TGGCCGGTAG CGCCGCCAAA
CTCGTCAACG TCGCCGGCGC TGTGATCCTC GCCGTAATCC TCCATAATGG ATTGGGTTTC
CTCATGGCCT ATTTTGTCGC TAGATACCTC TGCCGCATGA CCGAGGCCCA GGCCCGGGCC
GTTTCCTTCG AAGTGGGTAT GCAGAACTCT GGCCTGGGGG CGGCCCTGGC CATGAAGTTC
CTTACCCCGG TGGCGGCTTT GCCCAGCGCC ATCTTCAGTG TCTGGCACAA CTTGAGCGGT
TCCTTCCTGG CTAATTTCTG GGCCCGGCGC GCGCCGGCAC CGGCCGCCCG GCTGGCCAGG
AGGTAG
 
Protein sequence
MATKKNRSLL EIIPKYFTLW VIVFAALALL SPNSFQFLGK YISYLLGVVM LGMGMTLTMG 
DFAGVLQQPL NVVVGVALQF IIMPLLGFAI ATILRLPPEL AAGVVLVGCV PSGTASNVMT
FIAQGDVALS VTISSITTLI APFITPYLYL LLGGKFIPVE PLALLIDIAK IVLLPIIIGL
VIRQVLGNER ARVVNQVMPS VSVIAIVIII AAVVAGSAAK LVNVAGAVIL AVILHNGLGF
LMAYFVARYL CRMTEAQARA VSFEVGMQNS GLGAALAMKF LTPVAALPSA IFSVWHNLSG
SFLANFWARR APAPAARLAR R