Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1515 |
Symbol | |
ID | 3831980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1560632 |
End bp | 1561840 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637829447 |
Product | exodeoxyribonuclease VII large subunit |
Protein accession | YP_430367 |
Protein GI | 83590358 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1570] Exonuclease VII, large subunit |
TIGRFAM ID | [TIGR00237] exodeoxyribonuclease VII, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGAGG TTAAAGTCCT GGCGGTGAGG GAACTCACCG CCTATTTGCA GCGCCTCCTG GCCAACGACG GCCGCCTGGC CAATGTCTGG GTAAAGGGCG AGATCTCCAA CCTGCGCTCT CCAACCTCGG GCCATCTCTA TTTCAGTCTC AAGGACCAGG CTGCGACCCT GCGCTGCGTG ATGTTTCAGG GCCGTAGCCG GGGCCTATCC CTGGGCCTGC GCGACGGCCT GGAGGTAATC GCCAGGGGTC AGGTGGCCAT TTACCCCCGG GACGGCGTCT ACCAGCTCTA TGTAGCCGAA ATCTTCCCGG CAGGGACCGG CCTGGCCAGC CTGGCCCTCC AGGAGTTGAC CGCCCGCCTG GAACGGGAGG GTCTCTTTGC CGCCGACCGC AAGCGGCCCC TGCCCCTGTT GCCACGCCGG GTGGGCCTGG TGACCTCCCC AACCGGGGCC GCCCTACGGG ATATGATAAC CATCAGCCGG CGCCGCTTCC CCGGGATTGA ACTGATCCTG GCGCCGGCCA GGGTCCAGGG GGAGGTGGCG CCCCGCCAGC TGGCCCTGGC CCTGGAACTT CTGGCAAAAA GGGGAGGCGT TGATGTCATT ATTATCGGCC GCGGTGGCGG CTCGGCCGAA GATTTAAGCG CCTTCAACAC CGAACTGGTG GCCAGGGCCA TCTATGCCTG TCCGGTACCC GTCATTGCTG CCGTAGGCCA TGAGACGGAT CTCACCCTGG CCGACAGGGT AGCCGACCGG CGGGCGCCTA CTCCTTCGGC GGCGGCGGAA ATGGCCGTAC CGGTGCGGGC AGAACTGGAA CAGCGCCTGA AGAGCCTGGC GGAGCGGGCC CGCCGCGGTA TGGAACACCG CCTGGAGTTA GCCCGGGCTC GGCTGGAGCG CCTGACCAAA AGCAGCGGCC TGGATCGGCC CCGCCAGGAG CTATATTACC GCCAGCAGTA CGTGGATGGC CTGGAACAGC GCCTGCTGGC CTCCTGGGAG CGTCGTTCCC GGGAGCGGGA GCAGGGTCTT AATCTCCTGG CTGCCCGCCT GGAAGCCGCC AGCCCCCTGG CCATCCTGGC GCGTGGCTAC GCCGTTTGCC GCCGTCCCGG CGACGGAGCG CCCCTGAAAT CCAGCCGGGA AGTGCTCCCC GGGGAGAAGG TGGAGGTCAT CCTGAAGGAA GGCCTTCTCC GGTGCCAGGT CGAAGAAGTC GGCGGATAG
|
Protein sequence | MPEVKVLAVR ELTAYLQRLL ANDGRLANVW VKGEISNLRS PTSGHLYFSL KDQAATLRCV MFQGRSRGLS LGLRDGLEVI ARGQVAIYPR DGVYQLYVAE IFPAGTGLAS LALQELTARL EREGLFAADR KRPLPLLPRR VGLVTSPTGA ALRDMITISR RRFPGIELIL APARVQGEVA PRQLALALEL LAKRGGVDVI IIGRGGGSAE DLSAFNTELV ARAIYACPVP VIAAVGHETD LTLADRVADR RAPTPSAAAE MAVPVRAELE QRLKSLAERA RRGMEHRLEL ARARLERLTK SSGLDRPRQE LYYRQQYVDG LEQRLLASWE RRSREREQGL NLLAARLEAA SPLAILARGY AVCRRPGDGA PLKSSREVLP GEKVEVILKE GLLRCQVEEV GG
|
| |