Gene Moth_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0665 
Symbol 
ID3832152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp696946 
End bp698412 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content35% 
IMG OID637828604 
Producthypothetical protein 
Protein accessionYP_429534 
Protein GI83589525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000378366 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000415551 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGATTA ACTCTTTTAA TAATTATAAT AAATTAAGCC GATCTTTAAA AAATAGTTTA 
TCTTTGTGTT TACATTTTTT AACATTATTT TCAATGATCT TGGCAGCTAT ATTAGCAATT
ATTCATCTTG CGCAACTTTC AAGATGTTGG TTAATTGTTT CCGTATTTAT TGCTATGATA
GTAGCTTTTA TTCTTGGAAA CCCTCATGGG ATTATAACAC GTTCCTCTAC CAAATCATTT
CTTTATATTA TGTTAGTAGT AACTCTATTA ATTCGCATGA TATGGATAAT TATGGTACCA
TCTTTACCGG TATCAGATTT TGCGGGATAT CATTTCATGG CTCTTGACAT GTCACAGCTC
GATTTTAAAC ATGACACAGT ACAGGAAGTA GGATATCCTG TCCTGCTCGG AATACTTTAT
GCAGTATTCG GAGGCCATGT GCTTGTAGGA AAAATACTTA ATGTAACTTT AAGTTTATTA
ACGGCTTTGG TGTTATTTCT TTTAGCCCGC GAAGCCTTTA GTGAATTCTC GGCGCGAAAC
GCTTTGATCT TGTTTTCGCT ATGGCCTGCT CAAATAATGA TGAATAGCGT CTTGGCTTCA
GAAGGACCAT TTTTACTTAT GTTTTTAGTG GTCTTGCTAT TGCTTGTTAA GGCAAAATCT
GATGACTCCA AACGTCGTCA AGCATTATGT TTTATGGCGG GTCTTATACT CGGGCTATCT
TGTACTATTA GAGCGGTAGG GTTTCTTCTA CTTTTTGTTG TTTTAGCCTA CATATGCTTT
ATAGATGGTA ACAAGACCAA CAAATGGCAA ATACTAAAAT TATTGCTGGC AGGCTTTTTT
TTGGTGGTTA TACCATATTA CCTATTTCGA TGGTTAACCT TCAAGATCCC ACCTACTGTA
TCCTCTTTAC CTTTTAATTT ACTTTATGGA ACTAACATTG GATATATAGG TATGTGGAAT
CCCGAGGACG CTGCTTTAGC TCATAAACTG ATCGAACAGT ACGGTTCAAG GGCTTCGAGA
TATATACTTG GAGTTGCATT TTCTAGGATG ATCTCTAATC CCATGGGCCT TTTGAAGCTG
ATGTGTAATA AATTTGAGGT AATGTGGAGT GACGATGCGT ACGGTGCATA TTGGAGTACT
ATTAACATTG CTCCTAACTT AATTGAGATC CCTATTATAC GTGTAGACCT ACTTTATATA
TTGTCCCAAC TTTATTATGT TTTTATGTTA TGTTTAGCAA TCATCGGCCT GATTAAGATA
CAAAAAACAA TAAAGATAAA GATGCAAAGA GATAAGATAT ATAACGTTTC GTTGCTTTTC
GTTTTAGTTA TTATTGGATT TGTAATCCTA CATTTATTTA TTGAGGTTCA ATCTAGATAC
CACTATCCAG TAGTTGCCAT AATAATTTTA TTAGCCGGTT ACGGTATTAC AGAAAATAGT
GTCAACTTTA AGAACGATAG CATCTAG
 
Protein sequence
MLINSFNNYN KLSRSLKNSL SLCLHFLTLF SMILAAILAI IHLAQLSRCW LIVSVFIAMI 
VAFILGNPHG IITRSSTKSF LYIMLVVTLL IRMIWIIMVP SLPVSDFAGY HFMALDMSQL
DFKHDTVQEV GYPVLLGILY AVFGGHVLVG KILNVTLSLL TALVLFLLAR EAFSEFSARN
ALILFSLWPA QIMMNSVLAS EGPFLLMFLV VLLLLVKAKS DDSKRRQALC FMAGLILGLS
CTIRAVGFLL LFVVLAYICF IDGNKTNKWQ ILKLLLAGFF LVVIPYYLFR WLTFKIPPTV
SSLPFNLLYG TNIGYIGMWN PEDAALAHKL IEQYGSRASR YILGVAFSRM ISNPMGLLKL
MCNKFEVMWS DDAYGAYWST INIAPNLIEI PIIRVDLLYI LSQLYYVFML CLAIIGLIKI
QKTIKIKMQR DKIYNVSLLF VLVIIGFVIL HLFIEVQSRY HYPVVAIIIL LAGYGITENS
VNFKNDSI