Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2171 |
Symbol | |
ID | 8603508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2545806 |
End bp | 2547533 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | DNA repair protein RecN |
Protein accession | YP_003299775 |
Protein GI | 269126405 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00879264 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGCTCG TGCGGCCGAT GGTCGAGGAG GTTCGGATCC AGGGCCTCGG CGTGATCGAC GAGGCGGTCC TGGAACTGTC CCCGGGGTTC AACGTGGTCA CCGGAGAGAC CGGGGCGGGC AAGACCATGG TGGTCACCAG CCTGGGCCTG CTGTTCGGCG GGCGCGCCGA CCCCCAGCGG GTGCGGCCCG GCGCCGGACG CGCCACCGTC GAGGGCCGGA TCGTGGTCGA CCCCGCCGGG CGGGTGGCCT CCCGGGTCGC CGAGGCCGGC GCCGAGCTGG ACGAAGACAC CCTGATCATC ACCCGGTCGG TGTCGGCCGA GGGCCGCTCG CGGGCCCATC TGGGCGGCCG GTCGGTGCCG GTCGGCCTGC TGATCGCGCT GGCCGACGAC CTGGTGGCCG TGCACGGCCA GTCCGACCAG CAGCGGCTGC TGCAGCCCGG CCGGCAGCGC GCCGCGCTGG ACCGCTACGC CGGGCGGGAG CTCGCCGCTC CGCTGCGCGC CTACACCGCC GCCTACCAGC GGCACCGCAA GGTCACCGCG CAGCTGGAGG AGCTGACCGC CCAGGCCCGC GAACGCGCCG CCGAGGCCGA GATGCTGCGC TTTGGCCTGG AGGAGATCGA AAAGGTCGAC CCCAAGGCCG GTGAGGACGC CGAGCTGGCC GCCGAGGCCG AGCGGCTGGG CAACGCCGAC GCCCTGCGCA CCGCCGCCAC CACCGCCCAC CAGGCCCTGC TGGGCGACCC GGACGCCGCC GCGCTGACCC CCGCCGACGT GATGACCCTG CTGGGGACGG CGCGCAACGC CCTGGAGGGC GTCCGCGACC ACGACCCGGC GCTGGGGCAG CTGGCCGACC GGCTGGCCGA GGCCGGCTAC CTGATCTCCG ATGCCGCCAC CGAGCTGGCC GCCTACGCCG AGTCGGTGGA GGCCGACCCG ATCCGCCTGG CGGCCGTGCA GGAACGGCGC GCCGAGCTGA ACGCGCTGAC CCGCAAATAC GGCCCCACCC TGGAGGACGT GCTGGCCTGG GCGCAGCGGT CGGCGGCGCG GCTGGCGGAG CTGGAGGGCG ACGATGAGCG CATCGAGCGG CTGCGCGCCG AGCAGGCCGA GCTGGAGCAG CGGCTGGGGG AGCTGGCCGC CGAGCTGACG GCGGTGCGCT CCCGCGCCGC CGAGCGTTTT TCGGCCGCGG TCACCGAGGA GCTGACCGCG CTGGCCATGC CGCACGCCCG GGTGGTGGTG AACGTCACCG CCACCGAGGA GTTCGGCCCG CACGGCGCCG ACGAGGTGGA GATCCTGCTG GCCCCCCACC CGGGGGCGCC GCCGCTGCCG CTGCACAAGG GCGCCTCCGG CGGCGAGCTG TCGCGGGTGA TGCTGGCGAT CGAGGTGGTC TTCGCCGGCG CCGACCCGGT GCCCACGTTC GTCTTCGACG AGGTCGACGC CGGGGTGGGC GGCAAGGCCG CGGTCGAGAT CGGCCGCCGC CTGGCCCGCC TGGCCCGCAC CTCCCAGGTG ATCGTGGTCA CCCACCTGCC GCAGGTCGCC GCGTTCGCCG ACCAGCACCT GCTGGTCGCC AAGTCCGACG ACGGCTCGGT CACCCGCAGC GGCGTGACCG TGCTGGACCA CGAGGGCCGC GTCAGGGAGC TGTCGCGGAT GCTGGCCGGG CTGGAGGACT CCGAGCTCGG CCGCGCCCAC GCCGAGGAGC TGCTGGAGCT GGCCGCCCGG GAGCGGCAGG CCACCTGA
|
Protein sequence | MTLVRPMVEE VRIQGLGVID EAVLELSPGF NVVTGETGAG KTMVVTSLGL LFGGRADPQR VRPGAGRATV EGRIVVDPAG RVASRVAEAG AELDEDTLII TRSVSAEGRS RAHLGGRSVP VGLLIALADD LVAVHGQSDQ QRLLQPGRQR AALDRYAGRE LAAPLRAYTA AYQRHRKVTA QLEELTAQAR ERAAEAEMLR FGLEEIEKVD PKAGEDAELA AEAERLGNAD ALRTAATTAH QALLGDPDAA ALTPADVMTL LGTARNALEG VRDHDPALGQ LADRLAEAGY LISDAATELA AYAESVEADP IRLAAVQERR AELNALTRKY GPTLEDVLAW AQRSAARLAE LEGDDERIER LRAEQAELEQ RLGELAAELT AVRSRAAERF SAAVTEELTA LAMPHARVVV NVTATEEFGP HGADEVEILL APHPGAPPLP LHKGASGGEL SRVMLAIEVV FAGADPVPTF VFDEVDAGVG GKAAVEIGRR LARLARTSQV IVVTHLPQVA AFADQHLLVA KSDDGSVTRS GVTVLDHEGR VRELSRMLAG LEDSELGRAH AEELLELAAR ERQAT
|
| |