Gene Moth_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1069 
Symbol 
ID3833334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1099258 
End bp1100925 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content52% 
IMG OID637828997 
Producthypothetical protein 
Protein accessionYP_429926 
Protein GI83589917 
COG category[R] General function prediction only 
COG ID[COG0595] Predicted hydrolase of the metallo-beta-lactamase superfamily 
TIGRFAM ID[TIGR00649] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000100194 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0563275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAA ATGAGCGTAA GGTTTCCTTG ATCCCCCTAG GTGGCCTCGG GGAAATCGGC 
AAGAACATGA TGGCGATCAG GTACGGAAAC AGCATACTGG TCATCGATTG TGGCTTGACC
TTTCCCGAGG ATGAATTGCT GGGTGTCGAT GTGGTCATCC CGGATTACAC CTACCTGCTA
GAAAACCGGC AAATGGTAAA GGGGATTATA GTCACCCACG GCCATGAAGA TCATATCGGG
GCCCTGCCCT ATGTTTTAAA GGATCTAAAT GTACCGGTTT ATGGAACCAA ACTAACCCTG
GCCCTGATTC AGGCTAAACT GAAGGAACAG GGTAACTTTA ACGGTGTCCG GCTGCAGCAG
GTGAAGCCCA GGGATACTTT AAAAATCGGC CCTTTCAGGG TTGAGTTTAT TCACGTCAGC
CATTCCATTG CCGATACTGT CGCCCTGGCT ATTCATACGC CGGTGGGCAC CATCGTCCAC
ACCAGCGATT TTAAAATCGA TTATACACCA ATCGACGGGG AAGTCTTTGA TTTCTATAAG
TTTGCCGAGC TGGGTGAAAA GGGCGTCCTG GTGCTAATGT CGGACAGCAC CAATGTTGAA
CGCCCGGGCT TTACCATGTC CGAACGCGTA GTCGGTGGGA CCTTTGATGA GGTGTTTCGC
CGGGCACGGG AACGGATAAT TATCGCCAGC TTCGCATCCA ATATCCACCG GGTCCAGCAG
ATAATATCTA CGGCTTACAA GTACAATCGC AAAGTAGCTG TGGTAGGCCG CAGCATGGTA
AACGTGGTCA ATATTGCCCA GGAGATTGGG TACCTGAATA TCCCGGAAGG TACTCTGGTG
GAGCTGAGCG AACTGGCGCA CTTGCCTAAA AACCAGACGG TAATCATATC CACCGGCAGC
CAGGGGGAGC CAATGTCGGC CCTGACCCGG ATTGCCCGGA ATGATCACCG CCAGATTGAA
ATTGTTCCAG GGGATACGGT GATTATTTCC GCCTTGCCCA TCCCGGGCAA TGAAAAACTG
GTGGCGCGAA CGGTAGACCA GCTGTTTAAA CAGGGTGCCG ATGTCTATCA TGAAGCCGTT
GAAGGGGTTC ACGTTTCCGG TCACGCCAGT CAGGAGGAAT TGAAACTGGT TCTCAGCCTG
GTCAAGCCCA AGTTTTTCGT CCCCGTTCAC GGCGAGTACC GCATGTTGAT TAAACACGCC
CGCCTGGCCG AAGAGCTGGG AATACCCCCG GAAAACATTT TTGTGGCTGA GAACGGCCAG
GTAATGGAGT TTACCAGGGA GGAAGGTAAC TTTAATGGCC GCGTCACCGC CGGTCGTCTC
CTGATTGACG GCCTGGGAGT GGGTGATGTG GGCAATATCG TCCTGCGGGA TCGCAAACAA
CTGGCCCAGG ACGGCCTGCT CATTGTGGTC CTTACCCTGA GTAAAGAAAC CGGAAGTGTA
GTCGCCGGGC CGGATATTAT CTCGCGGGGA TTTGTCTACG TACGCGAGAG TGAGGAACTG
CTGGATGAGG CCAAAGAAAG GGTCCGCCAG GCCCTTGATA AGTGTAGTGA ACGCAAGGTA
AATGACTGGT CAACCATTAA AGGCAATATT CGCGATAACC TGAGTAAATT CCTCTACGAG
AAAACCAGGC GGCGCCCCAT GATCCTGCCT ATTATTATGG AGGTGTAG
 
Protein sequence
MAENERKVSL IPLGGLGEIG KNMMAIRYGN SILVIDCGLT FPEDELLGVD VVIPDYTYLL 
ENRQMVKGII VTHGHEDHIG ALPYVLKDLN VPVYGTKLTL ALIQAKLKEQ GNFNGVRLQQ
VKPRDTLKIG PFRVEFIHVS HSIADTVALA IHTPVGTIVH TSDFKIDYTP IDGEVFDFYK
FAELGEKGVL VLMSDSTNVE RPGFTMSERV VGGTFDEVFR RARERIIIAS FASNIHRVQQ
IISTAYKYNR KVAVVGRSMV NVVNIAQEIG YLNIPEGTLV ELSELAHLPK NQTVIISTGS
QGEPMSALTR IARNDHRQIE IVPGDTVIIS ALPIPGNEKL VARTVDQLFK QGADVYHEAV
EGVHVSGHAS QEELKLVLSL VKPKFFVPVH GEYRMLIKHA RLAEELGIPP ENIFVAENGQ
VMEFTREEGN FNGRVTAGRL LIDGLGVGDV GNIVLRDRKQ LAQDGLLIVV LTLSKETGSV
VAGPDIISRG FVYVRESEEL LDEAKERVRQ ALDKCSERKV NDWSTIKGNI RDNLSKFLYE
KTRRRPMILP IIMEV