Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1507 |
Symbol | |
ID | 3831734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1551935 |
End bp | 1553611 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829439 |
Product | DNA repair protein RecN |
Protein accession | YP_430359 |
Protein GI | 83590350 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0497] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00634] DNA repair protein RecN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCCAGG AACTGCAAAT CGAAAACCTG GCGCTAATCG AGTCCCTGCA GCTCAACCTG GAGCCGGGGC TGACGGTTCT TACCGGGGAA ACGGGTGCCG GCAAGTCCAT TATCGTCGAC GCCGTGGGAC TGCTGGTAGG GGCCCGAGCC TCGGGCGAGT ATATCCGCGC CGGGGCGGAT AAGGCGGTAG TCAGGGGCCT CTTCCAGGTA GCCGGCCTGC CGGGCCTTAA GGAAACCCTG TCTGCCATGG GGGTGGCTGC GGAGGACGAC GGGACCCTGC TCCTTTGCCG GGAGATCAGC CGCAGCGGTC GCCACAGCTG CCGGGTAAAC GGGCGCAGCC TGACTCTGGG TATGTACCAG AGAATTGGCC AGATGCTGGT AGATATCCAC GGCCAGCACG CCTACCAGTC CCTCCTGCGG CCGGCTTATC AAATGGATAT GCTGGATAGC CTGGCCGGCC TGATGGAGCT CAGGCAGGAG GTCGGAGAGT TATATACCAG GTGGCAGGAT TTAAAAAAGG AACTGGAGGA ACTCTGCGGC GACAGGGGGG AGCGGGAACG GCAACGTGAT CTCTGGCAAT ACCAGCTCCA GGAAATAGGC GCCGCCAACC TTACCCCCGG GGAAGAGGAA GAATTGAGCC GCCAGCGGGA AATCCTCAAT AACGGGGAAA GGCTGGCCAG GGGTGCGGCG GTCGTTTATG CTGCCCTCTT TGAGGAAGAA GGCAGATCGG CCTACGATCA GCTCAGCCGG GCCTTGGCGG AACTTGAGGC CCTGGCAGCC ATTGATCCCG GCTTGCAGAC CTGGCAGGGC ACCCTGGAGG GGATAACGGC CCAGGTCGAA GAGATAGCCC GGAGCGTTCG CCGCTATGGG GAAGGGCTGG AGTACGATCC TGCCCGCCTG CAGGAGATTG AAAACCGTCT GGAACTGATA AAAGATTTAA AACGCAAGTA CGGCGGCAGT ATCGAGGCCA TTTTACAGTA CCAGGCGGAA ACGGCGGCCG CCCTGGAAAG GCTTGAGCAA ATGGCGGGAC AGGCGGCCGC CCTGGAAAAG GAGATCGAAC TGGCCGCGGA AAAATACAGG GAAAAAGCCT TGCTTTTACG CCGGCGGCGT ATGGAAGCAG CGCAAAAAAT CGAAAAAGAG CTGCTGAAAG TCCTTAAGGA TCTGGCCATG CCGGCAGCCA GGATCAGGGT GGACTGCAGC GAAGCACCAC AGCCAGGCCC AGCGGGTATG GACAACATTA CATTTCTTTT TCAACCCAAT CCCGGAGAGG GCAGCCGGCC CCTGGCCCAG ATAGCTTCCG GAGGGGAGAT GGCCCGGGTA ATGCTAGCCT TAAAGAGTAT TCTGGCCGAT GTTGACGCCG TGCCAACATT GATATTTGAT GAGATTGACG CCGGCGTTGG CGGTGCTGCG GCCCGGGCGG TGGCCAGGAC CCTGGCGGCC ATAGGCCGCC GGCGACAGGT CCTTTGCATT ACCCACTCGG CCCAGCTGGC CAGTTTTGCC GGGCAGCACT TCCGGGTTAG CAAGGAGGTG CGGGAAGGCC GCACCTATAC CCGGGTCGAT GTTCTCCGGG GTGAAGAGCG GGTTGATGAG CTGGCCCGGT TATTATCCGG TTCCGCCAGC AGTGTGGCGC GAGAACATGC CGCTGCCCTT TTACAACAAT CAGGGTCCAT AAAATGA
|
Protein sequence | MLQELQIENL ALIESLQLNL EPGLTVLTGE TGAGKSIIVD AVGLLVGARA SGEYIRAGAD KAVVRGLFQV AGLPGLKETL SAMGVAAEDD GTLLLCREIS RSGRHSCRVN GRSLTLGMYQ RIGQMLVDIH GQHAYQSLLR PAYQMDMLDS LAGLMELRQE VGELYTRWQD LKKELEELCG DRGERERQRD LWQYQLQEIG AANLTPGEEE ELSRQREILN NGERLARGAA VVYAALFEEE GRSAYDQLSR ALAELEALAA IDPGLQTWQG TLEGITAQVE EIARSVRRYG EGLEYDPARL QEIENRLELI KDLKRKYGGS IEAILQYQAE TAAALERLEQ MAGQAAALEK EIELAAEKYR EKALLLRRRR MEAAQKIEKE LLKVLKDLAM PAARIRVDCS EAPQPGPAGM DNITFLFQPN PGEGSRPLAQ IASGGEMARV MLALKSILAD VDAVPTLIFD EIDAGVGGAA ARAVARTLAA IGRRRQVLCI THSAQLASFA GQHFRVSKEV REGRTYTRVD VLRGEERVDE LARLLSGSAS SVAREHAAAL LQQSGSIK
|
| |