Gene Moth_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2371 
Symbol 
ID3832551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2496891 
End bp2497928 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content63% 
IMG OID637830290 
Productrod shape-determining protein Mbl 
Protein accessionYP_431196 
Protein GI83591187 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1077] Actin-like ATPase involved in cell morphogenesis 
TIGRFAM ID[TIGR00904] cell shape determining protein, MreB/Mrl family 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000614585 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTGGCT TCGGTCAGGA TATAGGTATT GATTTAGGGA CGGCCAGCGT CCTGGTCTAC 
CTCCAGGGTA AAGGGATTGT CCTCCGGGAA CCTTCGGTGG TGGCCCTGGA CCGGGACAGC
GGCCAGATAT TTGCCGTGGG GGAAGAAGCC CGGCGCATGC TGGGGAGGAC GCCGGGAAAT
ATTATCGCCC TGCGCCCTTT ACGGGACGGG GTTATAGCCG ACTACGACAG CACCGAAAAG
ATGCTACGCT ACTTTATTGA TAAAGCCTGC GGCCGCCAGG GCTTCCTCCG GCCAAGGGTC
ATGGTCTGCA TACCCTCCGG GGTCACCGGG GTGGAGGAGC GGGCCGTGCG CCAGGCGGCC
CTGCAGGCCG GGGCCAAGCA GGCCTTTGTC ATTGAAGAGC CCCTGGCGGC GGCCCTGGGC
GCCGGCCTGG ATATCGCCGA GCCCAGCGGT TCCATGGTGG TGGACATCGG CGGCGGCACC
ACCGACATTG CCGTCCTTTC CCTGGGGGGC ATCGTCTGTA GCAATTCTCT GCGGGTCGCC
GGGGACAAAA TGGATGAAGC CATCGTCCGC TATATCCGGC GCGAGCACAA CCTGATGATC
GGCGAGCGCA GCGCCGAAGA ATTAAAAATG AAAATCGGCA CGGTCCACCG CTCCGTCGGC
GAAGGTGAGA GTATGGACAT CCGCGGGCGC GACCTGGTGA CCGGCCTGCC GAAGACGGTG
AATATCACCT CCCTGGAGAT CTTTACCGCC CTCCAGGAAC CAGTCCAGCA GATTGTCGGG
GCGGTGAAGG AGGTCCTGGA GCAGACGCCA CCGGAGCTGG CCGCCGATCT GGTCAACAAG
GGGATCGTCA TGACCGGGGG CGGCAGCCTG ATCCGTGGCA TTGACGTCCT CCTGAGCGAG
GAGACTGGCC TGCCGGTCTA TATCGCCGAC GACCCCATCT CCTGCGTCGC CCTGGGTACC
GGCAAAGCCC TGACCATGCT GGGGGTGTTA AAGCAGAGCA ATCCTTCGGA GGGACGGCGC
CCGGTCCTGA AACGTTAA
 
Protein sequence
MFGFGQDIGI DLGTASVLVY LQGKGIVLRE PSVVALDRDS GQIFAVGEEA RRMLGRTPGN 
IIALRPLRDG VIADYDSTEK MLRYFIDKAC GRQGFLRPRV MVCIPSGVTG VEERAVRQAA
LQAGAKQAFV IEEPLAAALG AGLDIAEPSG SMVVDIGGGT TDIAVLSLGG IVCSNSLRVA
GDKMDEAIVR YIRREHNLMI GERSAEELKM KIGTVHRSVG EGESMDIRGR DLVTGLPKTV
NITSLEIFTA LQEPVQQIVG AVKEVLEQTP PELAADLVNK GIVMTGGGSL IRGIDVLLSE
ETGLPVYIAD DPISCVALGT GKALTMLGVL KQSNPSEGRR PVLKR