Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1069 |
Symbol | |
ID | 3833334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1099258 |
End bp | 1100925 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637828997 |
Product | hypothetical protein |
Protein accession | YP_429926 |
Protein GI | 83589917 |
COG category | [R] General function prediction only |
COG ID | [COG0595] Predicted hydrolase of the metallo-beta-lactamase superfamily |
TIGRFAM ID | [TIGR00649] conserved hypothetical protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000100194 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0563275 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAA ATGAGCGTAA GGTTTCCTTG ATCCCCCTAG GTGGCCTCGG GGAAATCGGC AAGAACATGA TGGCGATCAG GTACGGAAAC AGCATACTGG TCATCGATTG TGGCTTGACC TTTCCCGAGG ATGAATTGCT GGGTGTCGAT GTGGTCATCC CGGATTACAC CTACCTGCTA GAAAACCGGC AAATGGTAAA GGGGATTATA GTCACCCACG GCCATGAAGA TCATATCGGG GCCCTGCCCT ATGTTTTAAA GGATCTAAAT GTACCGGTTT ATGGAACCAA ACTAACCCTG GCCCTGATTC AGGCTAAACT GAAGGAACAG GGTAACTTTA ACGGTGTCCG GCTGCAGCAG GTGAAGCCCA GGGATACTTT AAAAATCGGC CCTTTCAGGG TTGAGTTTAT TCACGTCAGC CATTCCATTG CCGATACTGT CGCCCTGGCT ATTCATACGC CGGTGGGCAC CATCGTCCAC ACCAGCGATT TTAAAATCGA TTATACACCA ATCGACGGGG AAGTCTTTGA TTTCTATAAG TTTGCCGAGC TGGGTGAAAA GGGCGTCCTG GTGCTAATGT CGGACAGCAC CAATGTTGAA CGCCCGGGCT TTACCATGTC CGAACGCGTA GTCGGTGGGA CCTTTGATGA GGTGTTTCGC CGGGCACGGG AACGGATAAT TATCGCCAGC TTCGCATCCA ATATCCACCG GGTCCAGCAG ATAATATCTA CGGCTTACAA GTACAATCGC AAAGTAGCTG TGGTAGGCCG CAGCATGGTA AACGTGGTCA ATATTGCCCA GGAGATTGGG TACCTGAATA TCCCGGAAGG TACTCTGGTG GAGCTGAGCG AACTGGCGCA CTTGCCTAAA AACCAGACGG TAATCATATC CACCGGCAGC CAGGGGGAGC CAATGTCGGC CCTGACCCGG ATTGCCCGGA ATGATCACCG CCAGATTGAA ATTGTTCCAG GGGATACGGT GATTATTTCC GCCTTGCCCA TCCCGGGCAA TGAAAAACTG GTGGCGCGAA CGGTAGACCA GCTGTTTAAA CAGGGTGCCG ATGTCTATCA TGAAGCCGTT GAAGGGGTTC ACGTTTCCGG TCACGCCAGT CAGGAGGAAT TGAAACTGGT TCTCAGCCTG GTCAAGCCCA AGTTTTTCGT CCCCGTTCAC GGCGAGTACC GCATGTTGAT TAAACACGCC CGCCTGGCCG AAGAGCTGGG AATACCCCCG GAAAACATTT TTGTGGCTGA GAACGGCCAG GTAATGGAGT TTACCAGGGA GGAAGGTAAC TTTAATGGCC GCGTCACCGC CGGTCGTCTC CTGATTGACG GCCTGGGAGT GGGTGATGTG GGCAATATCG TCCTGCGGGA TCGCAAACAA CTGGCCCAGG ACGGCCTGCT CATTGTGGTC CTTACCCTGA GTAAAGAAAC CGGAAGTGTA GTCGCCGGGC CGGATATTAT CTCGCGGGGA TTTGTCTACG TACGCGAGAG TGAGGAACTG CTGGATGAGG CCAAAGAAAG GGTCCGCCAG GCCCTTGATA AGTGTAGTGA ACGCAAGGTA AATGACTGGT CAACCATTAA AGGCAATATT CGCGATAACC TGAGTAAATT CCTCTACGAG AAAACCAGGC GGCGCCCCAT GATCCTGCCT ATTATTATGG AGGTGTAG
|
Protein sequence | MAENERKVSL IPLGGLGEIG KNMMAIRYGN SILVIDCGLT FPEDELLGVD VVIPDYTYLL ENRQMVKGII VTHGHEDHIG ALPYVLKDLN VPVYGTKLTL ALIQAKLKEQ GNFNGVRLQQ VKPRDTLKIG PFRVEFIHVS HSIADTVALA IHTPVGTIVH TSDFKIDYTP IDGEVFDFYK FAELGEKGVL VLMSDSTNVE RPGFTMSERV VGGTFDEVFR RARERIIIAS FASNIHRVQQ IISTAYKYNR KVAVVGRSMV NVVNIAQEIG YLNIPEGTLV ELSELAHLPK NQTVIISTGS QGEPMSALTR IARNDHRQIE IVPGDTVIIS ALPIPGNEKL VARTVDQLFK QGADVYHEAV EGVHVSGHAS QEELKLVLSL VKPKFFVPVH GEYRMLIKHA RLAEELGIPP ENIFVAENGQ VMEFTREEGN FNGRVTAGRL LIDGLGVGDV GNIVLRDRKQ LAQDGLLIVV LTLSKETGSV VAGPDIISRG FVYVRESEEL LDEAKERVRQ ALDKCSERKV NDWSTIKGNI RDNLSKFLYE KTRRRPMILP IIMEV
|
| |