Gene Moth_0370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0370 
Symbol 
ID3832726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp374694 
End bp375770 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content58% 
IMG OID637828305 
Producthypothetical protein 
Protein accessionYP_429247 
Protein GI83589238 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG4942] Membrane-bound metallopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000001896 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.85834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGGC TCATAACCTT GCTGGCCATC TTTTTCTTGC TAATTTCCCC GGTTCCCCCG 
GCTTCGAGTA CCAGTATTAC CGACACCCTG AAGCAGCGGC TCCTGGATAA TGAGAATCAA
GAAAACCGCC TCCTGCAAGA GATTATGCTC CTGGATGCCC GCCTGCAGAA AGCGGAGCAG
GAGGGTCAGG AGCTGGCGAA CCGCCTGGCT GCCGTCCGGC AACAACTCCA GGCAGCCCGC
TCCCGGCAAA TCCAGGCCGA GGCCCGCCTG GCGGCAGGAC GCCGGGACCT GAACCGTAGC
CTGCGGTTTT TCCAGGTTTA CGGTACCTCT CCTTTCATTC TGGCGGCTTT TTTCAGCAAT
GATCTGCCAG ATTTCTTTAT TCGCCTGGAA CTTTTAAAAT ACCTGGGCAA TCACTTTGTA
GGTATCGTGC GCTACAACCT GGCCCTATAC CGCCAGGCCC GGGAAGAAGG CTCCCTGGTG
GCAGCCAGGG AACAAGAACT CCGGCAGGCG CAAGCAACCC TCCTTGAAAG CGAGGAGCGC
TTGACAGCCC TAAGATTGAA ACGTGAAACT GACCTGGACA GCTTACGCCG GCAGAGTACT
ACCTGGTCCC AGGACCTGCT GGCCCTGGAA AAGGCCTGGT CCGGGGCTCT GCCGACACTG
TACTACCTGT TGCAGCAACT CCCGGCTTTA CCCTGGAAAA ACCTGAAACC CGATGCGGTG
AGCGTAGACC TTTCGCGGGG TGAGGTCCAG GCTATCTTCA GCCAGCGGAA TCTAAATGCC
ACCCTCCTGA CACCGGCGGA ACTACCGGGA GTAAGCCTGA TCCTTTCCGG GGAAGGTTTA
ACCATCCCCG GCCCGGATTT TCAAATTCGG GGCAGCCTGC AGGTAGCCGG TCCCCACCAG
CTTCTATTCA CCCCCACGGA GGTGACCTTT GCCGGCCTGC CTTTGAGCCC TGCCACCAGG
AACGAGCTCC TGCCCCGGGA GAAGCTGACC ATCGATTTGC CCCCGCCCGA CTACGGCCTG
CAGTTTAAAG AGATCAATTT CGCCCCGGGA CGAATGAGCC TGATCCTTAA AAAATAA
 
Protein sequence
MQRLITLLAI FFLLISPVPP ASSTSITDTL KQRLLDNENQ ENRLLQEIML LDARLQKAEQ 
EGQELANRLA AVRQQLQAAR SRQIQAEARL AAGRRDLNRS LRFFQVYGTS PFILAAFFSN
DLPDFFIRLE LLKYLGNHFV GIVRYNLALY RQAREEGSLV AAREQELRQA QATLLESEER
LTALRLKRET DLDSLRRQST TWSQDLLALE KAWSGALPTL YYLLQQLPAL PWKNLKPDAV
SVDLSRGEVQ AIFSQRNLNA TLLTPAELPG VSLILSGEGL TIPGPDFQIR GSLQVAGPHQ
LLFTPTEVTF AGLPLSPATR NELLPREKLT IDLPPPDYGL QFKEINFAPG RMSLILKK