Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3663 |
Symbol | |
ID | 8605014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 4211260 |
End bp | 4213020 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | endonuclease/exonuclease/phosphatase family protein |
Protein accession | YP_003301234 |
Protein GI | 269127864 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00731274 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGA CGACGGCGCA GGCCGCACCG CGACCGGCCC GGCAGATCCA TGACATCCAG GGGCGGGGGC ACATCTCCCC CTTCCTGGGC AAGAAGGTCA CCCGGATTCC GGGAGTGGTG ACCGCCACCG CCGAGAACGG CTTTTGGATG CAAGGGACCT CACCCGATGA GGACCCCGGC ACCTCCGAGG GCATCTTCGT GTTCACCCGC ACCCGTCCCC GGGTCACCGT GGGGGACGAG GTCCAGGTGA GCGGCACGGT CAACGAGTTC CGCCCCGGCG GGCCGAAGTC GGCCAACCTC TCCCGGACCG AGATCGACTC CTCTTCGATC ACCGTGCGGC GGCACGGGGC CCCGCTGCCG GCCCCGACGG TCCTGGGGCC GGGCGGGCGC CGGCCCCCCG GCGAGGTCAT CGACGACGAC GTGCGGGGAG ACGTCGAAAA GTCCGGCAAG TACCAGCCCG GCCGGGACGG GCTGGACTTC TATGAATCGC TGGAGGGCAT GCTCGTCGCC GTCGACGACG CGGTGGCGGT CGGCCCCCGC TCGGAGTTCG GCGAGATCCC CGTCCTGCCC GCCGGCGGCC AGGGCGCCGG GATGCGCAGC GCACGCGGGG GCATCGTGGC CCGCGCCAAG GACGCCAACC CCGAACGGAT CATCCTCGAC GACGCCCTGC AGCCGCTGCC GCCGATGAAC GTGGCCGACC GGCTGCCGGG CCGCACGCTC GGCGTGCTCG ACTACAGCTA CGGCGACTAC AAGCTGCTGG TGCTGTCGGT CCCCGCCGTC CGCGGCGGCG GCCTGACGCG CACCCGCACC CGCCCGCAGC GGCCCGACGA GCTGGCGGTG GCCACGGTCG CGCTGGACGG TCTGGACCCC GGCGACCCGC CGCAGCGCTT CGCGGCGATC GCCGACCAGA TCGTCACCGC CCTGGCCGCC CCCGACCTGG TGACGGTGAC CGGCCTGCAG GACAACAGCG GCTCCGACGA CGACGGGACG GTGGCCGCCG ACCAGACCGT CGCGCAACTG CTGGCGGCGA TCTCGGCGGC GGGCGGCCCC GGCTACGACT GGCGTTCGAT CGACCCGCTC GACGGCGCGG ACGGCGGCGA GAGCGGCGGC AACATCCGGA CCGGCTTCCT GTTCCGCACC GACCGGGGGC TGCGCTTTGT GGACCGTCCC GGCGGCACCG CCACCGCCCC GGTCGAGGCG GTCCCCGACG GCGCCCGCGA GGCGGCGCTG TCGATCAGCC CCGGCCGCGT CGCGCCGCAG CACCCGGCCT GGCGGCAGGC GCGCAAGCCG CTGGCCGGCG AGTTCCGGTG GCGGGGACGG CGGCTGATCG CCATCGCCAA CCACTGGACG GCCCGCGGCG CCGACGACTC CCTGTATGCG CGCTTCCAGC CGCCGCGCAG GCCCAGCGAG GCCTCCCACG CCGAGCAGGC GAAGGTGCTG GCCGGTTTCG TCCGCAAACT GCGCCGGGCG GGCCCCGGCA CCGGCGTCAT CGTGACCGGC GGGCTGAACT CCTACGGCCA CTCCCGCGCC CTGCGCACGC TGACCGACCG CACCGGGCTG CGCGACCTGG TGGCGGAGCT GCCCCGCAAA GAGCGCTACA CCGTGATCTC CGCCGGCAAC GGCCATGACC CCGACCACAT CCTGGTGAAC CGGGCGCTGG CCGAGCTGCC CCATGAGATC GAGGCCGTCC GCCTCAACGC CGAGTTCGCC GACCGGGTCA GCGAGCACGA CCCGCTGGTG CTGCGGCTCC GCACCGGCTG A
|
Protein sequence | MSATTAQAAP RPARQIHDIQ GRGHISPFLG KKVTRIPGVV TATAENGFWM QGTSPDEDPG TSEGIFVFTR TRPRVTVGDE VQVSGTVNEF RPGGPKSANL SRTEIDSSSI TVRRHGAPLP APTVLGPGGR RPPGEVIDDD VRGDVEKSGK YQPGRDGLDF YESLEGMLVA VDDAVAVGPR SEFGEIPVLP AGGQGAGMRS ARGGIVARAK DANPERIILD DALQPLPPMN VADRLPGRTL GVLDYSYGDY KLLVLSVPAV RGGGLTRTRT RPQRPDELAV ATVALDGLDP GDPPQRFAAI ADQIVTALAA PDLVTVTGLQ DNSGSDDDGT VAADQTVAQL LAAISAAGGP GYDWRSIDPL DGADGGESGG NIRTGFLFRT DRGLRFVDRP GGTATAPVEA VPDGAREAAL SISPGRVAPQ HPAWRQARKP LAGEFRWRGR RLIAIANHWT ARGADDSLYA RFQPPRRPSE ASHAEQAKVL AGFVRKLRRA GPGTGVIVTG GLNSYGHSRA LRTLTDRTGL RDLVAELPRK ERYTVISAGN GHDPDHILVN RALAELPHEI EAVRLNAEFA DRVSEHDPLV LRLRTG
|
| |