Gene Moth_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0641 
Symbol 
ID3832037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp670767 
End bp672305 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content56% 
IMG OID637828582 
Producthypothetical protein 
Protein accessionYP_429512 
Protein GI83589503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0502748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCG AGAAATGGTA CGCAAAGCTG GTCAATAAGA TAAAACGATC CTACCACGAC 
GTGGTCGTGG TTGTTGACCA TGACCGCCTC GGCCGGTTGC CGGAGTTGCG GGCGGCGCTC
GCTGATACTT TTGCCCTTCA CGACTACAAG GGGGAGCTGC CATTGCGGCG TTTCCTCAGG
GAGCATACCG GCCAGCGCAT GCTTATTTTT AAGCACCCTG AGCACGGCCA CCTGCCGTAC
GATGTGGAGA CCAGGTCGGA CGTGATTTCT TGGCATCTCC AGGAAATTTT CCCTAAATTG
CACCCGGCCG CGCTTGAAGG TTTGTCGGAA GAGGACTACC AGCGGGTCTT TGAAGCTTAC
GTGCAGAAGG AGAATACATT CCAGGTGCTG GGCCTGGAGG AAACCAAGGA AGTGCTGGCC
GGGTGGCTGG GTACGTCGAT GGTCGGCGGG TTCGTCCCGG ATATCGTTGC CAGACCTAGT
CAGGAAGGGG AAGGCGGGAT AAAATGCCGC TGCCAGGCCC TGGTCCGGGA GATTGAGGGA
CTCCTGTCGC GCCGCCCGGT TGACTGGCGG GCTGTAGCCC CCCTGTGGGG TGAGTTATCC
TATTGTTACT GCCAGGCGAG GAGCAAACCG CCGGAGATAG ATGCTCTGGA CCAGAAGATT
AGTGGAGCAT TTACCGAATA TATCCTGACT AGTTATCATG AGCTTTTCTA CGAAAGCTAT
CTCACCAGGC CGGCCACCGT TGATAAGGTG CTACCCTTCC TGGCCTACCA GCCGGCTGCC
AAAAAGGTAT TGATATGTAT GGATGGCATG GGGTTCCAGG AATGGTGCTG CCTGAAACAA
TATCTGGCTG GTCGCGGGAT AGATAGCTTT AGCGTTACGG CTGTCTTCAC CCTGCTGCCT
ACCTTGACCC GGGTTTCACG GCGCGCCCTG TTTTGTGGCC GGCCTGCTCT TGGGGACCGC
GTGGAGGAAG AAAGGGGCTT TTTGCAGTTT GTCCGGGAGA AATGGCTAGA AGGAGAGCGA
CGGCAGGCGG GGGTGTTCAT GAATATAGAT GGCAGGTGGC GGCATGAGTA TTTGGATTTT
GACTACCTCG CCCTGGCCTG TAACCTGGTG GACGATCTGG CTCACGCCTC TGTGAGTGTG
CAGGATAGCA AGGAATTGAT GCAGAAAAGC CTTATTATGC ATCTAGATGG CTCCGGATTT
GCCGAGACCA TGCAGCGGCT GCTGGAGGAG GGTTACCGGG TTTACCTCGT CGCCGATCAC
GGTTCGGTAT GGTGCCGGGG CAACGGCCAC CAGGCCAGCA AGTGGCTGTT GGAAGAGAAG
GCCAGGCGGG CGCTGTTATT CCCCAACAAG CTTCTGGCAG AGGATTTTGC GGCGGGGAAA
AACCTAATTG TATACGAAAA CAGCAGTCTT TTTGGAGATG CAGTGGCCGT CTTTCCGCCC
GGTCGGGAGA TGTTTGGGCC AAAGGGGGAA ACGGTGATTA GCCACGGCGG CATCCACATA
GAAGAAGTTA TTGTTCCCTT CATCGAAGTG CAGGCATAA
 
Protein sequence
MMSEKWYAKL VNKIKRSYHD VVVVVDHDRL GRLPELRAAL ADTFALHDYK GELPLRRFLR 
EHTGQRMLIF KHPEHGHLPY DVETRSDVIS WHLQEIFPKL HPAALEGLSE EDYQRVFEAY
VQKENTFQVL GLEETKEVLA GWLGTSMVGG FVPDIVARPS QEGEGGIKCR CQALVREIEG
LLSRRPVDWR AVAPLWGELS YCYCQARSKP PEIDALDQKI SGAFTEYILT SYHELFYESY
LTRPATVDKV LPFLAYQPAA KKVLICMDGM GFQEWCCLKQ YLAGRGIDSF SVTAVFTLLP
TLTRVSRRAL FCGRPALGDR VEEERGFLQF VREKWLEGER RQAGVFMNID GRWRHEYLDF
DYLALACNLV DDLAHASVSV QDSKELMQKS LIMHLDGSGF AETMQRLLEE GYRVYLVADH
GSVWCRGNGH QASKWLLEEK ARRALLFPNK LLAEDFAAGK NLIVYENSSL FGDAVAVFPP
GREMFGPKGE TVISHGGIHI EEVIVPFIEV QA