Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2067 |
Symbol | |
ID | 3831098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2159458 |
End bp | 2160396 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829995 |
Product | hypothetical protein |
Protein accession | YP_430905 |
Protein GI | 83590896 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3285] Predicted eukaryotic-type DNA primase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02778] DNA polymerase LigD, polymerase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0552463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGGG AAATGACAAC AGGACAGCAG GTTAACGGGG AGCCCTATAT GCTGCCCCGG TTGCCGGGAC AGCAGCTCCG GCTGACCAAC CTGGACAAGG TCTTCTGGCC GGAGGGGCTG ACCAAGTTTG ATCTCGTCGA ATATTATGTC GACATGGCCC CCGGCATCCT ACCTTACCTG CGGGAACGTC CCCTGGTCCT GAAGCGCTAC CCGGACGGCA TTACAGGGGA GGCCTTTTAC CAGAAAGAGT GCCCTGCCTA TGCCCCAGAG TGGGTGGCGA CCCTGCCTGT CTATCACACC GATAGCGATA AAACCATCAA TTACGTTCTC TGCAATAACG AAGCAACCCT GGCCTGGCTG GCCAACCAGG GGTGCATCGA GGTCCATGCC TGGCTCTCCC GGGCCGGTCG CCTGGAATAC CCGGATATCG TTGTCATGGA CCTCGACCCT GCGGACGGCA CTACCCTTGT CGATGTGCTG GAAATCGCCC TCTTGGTCAA CCGGGCTTTA AAGGAACTCC ACCTCACCGG CTACCCCAAA AATTCAGGCG CCAGGGGCCT GCATATTTTC ATCCCCCTTT ATCCCCGCTG GACCTTCCGG GAAGTTACGG CTGCCATGGG ATACCTGGCG CATCTCATTG TGCAGGTTTA CCCCCGCAAA GCCACCACCG AGCACCTTAT CCACAGGCGC CGGGGCAAAG TCTACCTGGA TTACCTGCAA AATGTACAGG GGCGGTCCAT GACCTTTCCC TACAGCCTAC GGCCCCTGCC CGGGGCCCCG GTTTCCGCCC CCTTGACCTG GGAAGAAGTG GCGGCGAAAA AGATTTATCC CGGAGATTTC AATATCATCA GGCGCCGCCT GGAAGAATGG GGCGACTTGT ACCGGGAACT CCTGGAGCGC CCCAATGATT TAACACCCCT GTTAGAGCTG GCCATATAA
|
Protein sequence | MDREMTTGQQ VNGEPYMLPR LPGQQLRLTN LDKVFWPEGL TKFDLVEYYV DMAPGILPYL RERPLVLKRY PDGITGEAFY QKECPAYAPE WVATLPVYHT DSDKTINYVL CNNEATLAWL ANQGCIEVHA WLSRAGRLEY PDIVVMDLDP ADGTTLVDVL EIALLVNRAL KELHLTGYPK NSGARGLHIF IPLYPRWTFR EVTAAMGYLA HLIVQVYPRK ATTEHLIHRR RGKVYLDYLQ NVQGRSMTFP YSLRPLPGAP VSAPLTWEEV AAKKIYPGDF NIIRRRLEEW GDLYRELLER PNDLTPLLEL AI
|
| |