Gene Moth_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1077 
Symbol 
ID3833190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1107769 
End bp1109253 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content56% 
IMG OID637829005 
Productvesicle-fusing ATPase 
Protein accessionYP_429934 
Protein GI83589925 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTAAGG AGACTAGTGC CGGTACCGTT ATTGGAGTAC TGGTATTTCT GACCCTTAAA 
GGCTTTGATA TCACCCCTCT TCTTCTCCTG GCCGGGGTAG GATTCCTCGC TTATTGGTTG
ATTGAAAAAA AGGGTATGCT TTCTTCAACC TTTGGTGGTA AATCCTACAT CCCGGCAGTC
CCTATCTCCT TTGCCGACAT TGGCGGCCAG GAAACAGCCA TCCGGGAACT GAAAGAAGCC
CTGGAGTTTT TATTGCGGAG CAAGCAGGTA AGGGCGATGG GTATCCGCCC CTTGAAGGGC
ATCCTTTTAA GCGGGCCGCC GGGAACCGGT AAGACCCTGA TGGCCAGGGC GGCAGCTACC
TACACCGATG CCGTCTTCCT GGCTGCCAGC GGTAGTGAGT TCATCGAGAT GTATGCCGGG
GTTGGTGCCC AGCGGGTGCG GGAACTTTTT AACCGGGCCC GGGAACTGGC CCGGCGCCAG
CAAAAAAAAC GGGCCATTAT CTTTATTGAC GAATTGGAAG TCCTGGGGGG TAAACGGGGA
AGCCATACCT CCCACCTGGA ATACGACCAG ACCCTTAATC AGCTCCTGGT CGAGATGGAC
GGTTTAAAAG AAGACCAGGA TGTCCAGATT CTAGTTATTG GAGCCACCAA CCGGGCCGAT
CTCCTTGACC CGGCCCTCCT GCGGCCGGGA CGTTTTGACC GGCAGGTCAG GGTTGATTTA
CCAGACCGGG AGGGGCGCCT GGCCATTTTA AAACTGCATA CAGCCAATAA ACCCTTAGGA
CCGAAAACCG ACCTGGAGGC TATAGCCAGG GAGACCTTCG GCTTTTCCGG AGCCCACCTG
GAAAGCGTGG CCAACGAGGC CGCTATCCTC GCCCTGCGGG AAAAAGCACC GGTCATTGAG
CAGCGGCACC TGCTGGAGGC GGTTGATAAA GTAATGCTGG GCGAGAAACT GGGCCGTAAG
CCGACGCCGG AAGAGTTATA CCGGCTGGCC ATCCATGAAG CCGGCCACGC CGTAGCCGCT
GAGCTGTTAC GACCAGGGTC GGTTTCCCAT GTCACCATAA CCTCCAGGGG CCAGGCCCTG
GGCTATACCA GGCAAAAGCC CGAGCACGAC ATCTACCTTT ATACCCGGGA CTATCTCGAA
GTTCAGATAG CCATCTGCCT GGCAGGGGCG GTGGCCGAGA CCCTGGTCCT GGGCAACCGC
AGTACAGGTT CCCTCAACGA TTTTAAAGAA TCCATCCGCC TGGCCCGAGT AATTATAACT
TCGGGTTTAT CAGACCTGGG GGTAGTTGGG GAAGAGAACC TATCCAAGGA GCAAATGCAC
ACCGCCTCGA CAACAATTAT TAACCAGGAA GAAGAAAAGG TCGTTTCCCT CCTGCAACCC
CACCTGCCGG TTTTAAAGGA ACTGGCCCGC AATCTGGTAG AAAAGGAAAC TATCACTGGT
CGGGAATTAC GGAACCTCCT GCAGGAGCAG GCCAAAGCTT CTTGA
 
Protein sequence
MLKETSAGTV IGVLVFLTLK GFDITPLLLL AGVGFLAYWL IEKKGMLSST FGGKSYIPAV 
PISFADIGGQ ETAIRELKEA LEFLLRSKQV RAMGIRPLKG ILLSGPPGTG KTLMARAAAT
YTDAVFLAAS GSEFIEMYAG VGAQRVRELF NRARELARRQ QKKRAIIFID ELEVLGGKRG
SHTSHLEYDQ TLNQLLVEMD GLKEDQDVQI LVIGATNRAD LLDPALLRPG RFDRQVRVDL
PDREGRLAIL KLHTANKPLG PKTDLEAIAR ETFGFSGAHL ESVANEAAIL ALREKAPVIE
QRHLLEAVDK VMLGEKLGRK PTPEELYRLA IHEAGHAVAA ELLRPGSVSH VTITSRGQAL
GYTRQKPEHD IYLYTRDYLE VQIAICLAGA VAETLVLGNR STGSLNDFKE SIRLARVIIT
SGLSDLGVVG EENLSKEQMH TASTTIINQE EEKVVSLLQP HLPVLKELAR NLVEKETITG
RELRNLLQEQ AKAS