Gene Moth_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0465 
Symbol 
ID3832403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp469181 
End bp470224 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content58% 
IMG OID637828400 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_429339 
Protein GI83589330 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.176308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA AACTAAAGCG AATCGCCATC TTGTTCCTGG CGCTGGCCCT GGCCGGCCTG 
GCCCTGGCCG GTTGCGGGGG CGGTAGCAAA CAGCAGGCTG CAGACCAGCA GACCGGGAGT
AGCGGCCAGG GGCAACAACT CACCCCCGTC AAGCTCACCA TGACCACCTG GTCGGGCTAC
GGTCCCCTCT TCCTGGCCCG GGATAAAGGC TTCTTCAAGA AACACGGCCT GGATGTACAG
TTAATCGTCA TCCAGGGCCT GGGCGAGCGC AAGCAGGCCC TGGCCGGTAA CCAGGTGGAC
GGCATCGCTA CCACCCTGGA TATTGAAACC CAGATTGTAG CTGCGGGCAT ACCCCTGAAA
CAGATCTGGG CCCTGGACGA TTCCTATGGC GGCGACGGCA TCCTGGCCAA ACCGGAGATC
AAGACCATCA AGGACCTCAA AGGTAAAAAC GTAGCCTACG ACTTCGGCAC CGCCAGCCAC
ATCCTGCTCC TCTCCATCCT GGCCAAAAAC GGCATGACTG AAAACGACAT CCACCACGTC
CAGATGTCAG CCAGCGACGC CGGGTCGACC TTCGTGGCCG GCAAAGTAGA TGCCGCCGTC
ACCTGGGAAC CCTGGCTGAG TAAGGCCGTT AAAGAGAACA AGGGTAACCT CCTGGCAACC
TCCAAGGAGA CCCCTGGGCT GATTATGGAT ACAGTCGCCC TCCGGAGCGA CTGGGCCGAC
AAACACCCCC AGGCTCTCCA GGCCATGGTC GACGCCCTGG CGGAAGCCAT GCAGTACTGG
GAAAGCAATA AGGCCGAAGC CAATGCCATT ATGGCCAAGG GACTGGGCAT CAAACAGGAA
GAGTTCGAGA GCAACCTGCA GACCCTGCGC CTCTTCAACC TGGCCCAGAA CAAGGAGATG
TTCGGCACGG CCGACAAGCC AGGAACCCTC TACACCTCCT TGCAGCAGGC AATCGACTTC
GGCTTTAACA ACAAAGTAAT TAAATCCAAA CCCGATGCTA AAGCCATGAT CGACCCGACC
TTTGTCAACA GGGCGAAAAT ATAA
 
Protein sequence
MKRKLKRIAI LFLALALAGL ALAGCGGGSK QQAADQQTGS SGQGQQLTPV KLTMTTWSGY 
GPLFLARDKG FFKKHGLDVQ LIVIQGLGER KQALAGNQVD GIATTLDIET QIVAAGIPLK
QIWALDDSYG GDGILAKPEI KTIKDLKGKN VAYDFGTASH ILLLSILAKN GMTENDIHHV
QMSASDAGST FVAGKVDAAV TWEPWLSKAV KENKGNLLAT SKETPGLIMD TVALRSDWAD
KHPQALQAMV DALAEAMQYW ESNKAEANAI MAKGLGIKQE EFESNLQTLR LFNLAQNKEM
FGTADKPGTL YTSLQQAIDF GFNNKVIKSK PDAKAMIDPT FVNRAKI