Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2082 |
Symbol | |
ID | 3831832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2173097 |
End bp | 2174017 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637830008 |
Product | hypothetical protein |
Protein accession | YP_430918 |
Protein GI | 83590909 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3285] Predicted eukaryotic-type DNA primase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02778] DNA polymerase LigD, polymerase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 68 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCAGG GCCGGCAATA TATACGGTCC CAATTGCTGA AGCGACAGCT CCACTTGACC AACCTGGACA AGGTTTTCTG GCCGGAAGGT CTGACCAAGT TTGATCTTAT CAAGTATTAT GTCGACATGG CTCCCTTTCT CTTGCCCTAC CTCCGGGATC GCCCCCTGGT CCTTAAGCGC TACCCGGATG GCATAACAGG GGAGGCCTTT TACCAGAAAG AGTGCCCTGC CTATGCCCCG GACTGGGTTA CGACCCTGCC TGTCTATCAT GCTGATAGCA GCAAGACTAT TAATTATGTT CTCTGTAATA ATGAAGAAAC CCTGATCTGG CTGGCCAATC AGGGGTGTAT CGAGGTCCAT GCCTGGCTCT CCAGGGCCGG CCGCCTGGAA TACCCCGATA TCGCCGTCAT GGATCTGGAC CCAAGTGCGG GGGCAACTTT TAAAGATGTC TTGGATATCG CCCTCCTGGT CCACCAGGCT TTAAAGGAGT TTAACCTCAG CGGCTATCCT AAAACCTCCG GTGCTACTGG GTTGCATATC TTTATCCCCC TTGAACCTCG CTGGACCTTT CACCAGGTGA CAGCCGCCAT GGGGTACCTG GCGCGGCTGG TCGCCGGGGT TTACCCCCGC AAGGCCACCA CCGAACGGTC GATCCCGAAG CGTAAAGATC GGGTCTACCT GGACTACCTG CAGAACGTCC GCGGACGGTC CATGGCCTTC CCCTACAGCC TGCGACCCTT ACCCGGGGCG CCGGTTTCAA CGCCCCTGAC CTGGGAGGAG GTAAAGAGGG GGATGTTCAG CCCCAAAGAC TTCAACATCC ACACCGCCCG GGAGCGCCTG CAGGCGTATG GCGACCTTTA TCGGGGTTTT CTGGCGCAAC CAAACGATCT GGAACCGCTG CTTAAACTGG CCGGGGTTTA A
|
Protein sequence | MGQGRQYIRS QLLKRQLHLT NLDKVFWPEG LTKFDLIKYY VDMAPFLLPY LRDRPLVLKR YPDGITGEAF YQKECPAYAP DWVTTLPVYH ADSSKTINYV LCNNEETLIW LANQGCIEVH AWLSRAGRLE YPDIAVMDLD PSAGATFKDV LDIALLVHQA LKEFNLSGYP KTSGATGLHI FIPLEPRWTF HQVTAAMGYL ARLVAGVYPR KATTERSIPK RKDRVYLDYL QNVRGRSMAF PYSLRPLPGA PVSTPLTWEE VKRGMFSPKD FNIHTARERL QAYGDLYRGF LAQPNDLEPL LKLAGV
|
| |