Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1894 |
Symbol | |
ID | 8603221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 2236828 |
End bp | 2238465 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | X-Pro dipeptidyl-peptidase domain protein |
Protein accession | YP_003299502 |
Protein GI | 269126132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00645212 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGGGT TGGGGCTGAT GCGCCGGCTG GGGGCCGGCG CGCTGCGGCT GCCGCACGGC CCGTACCGGG TGAGGGTGCA CCGGGACCTG GAGGTCCCCG CGCCGGACGG GGTGACGCTG CTGGCCGACC GGTACGCACC GCAGGGGCTG GACCGTCCAC CGGTCATCTT GGTGCGCTCC CCGTATGGGC GGCGCGGGCC GTTCGGGCTG ATGTGCGGGT ACGTGTTCGC CACCCACGGG TTCCAGACGG TGGTGCAGAG CGTCCGCGGC GGGTTCGGCT CCGGCGGGGC GTTCGAGCCG CTGGACGAGC GCGAGGACGG GCTGGCGACG GTGGCCTGGC TGAAGGCCCA GCCGTGGTTC GGCGGCGCGT TCGCCATGCA CGGCCCCAGC TATCTGGGCT ATGTGCAGTG GGCGGTGGCC GCGGACGCCG GGCCGGAGCT GAAGGCCTTG TCGATGCAGG TGACGGCCTC GCAGTTCCGG GATGCGATCA ATCTGGGCGG CGGGTTCGCC CTGGAGTCCA CGCTGACCTG GGTGGATCTG ACCACCCGCA TCCAGCGGCC GCTGTCGGGG CTGGTCGCCG GGGTGCTCTC GCCGCGCCGG GCCCGGCGGG CGGCATTGTC GGGCCGTCCG CTGGCCGAGC TGGACGCGCT GGCCACCGGG GCGCCGGTGC GGTTCTTCCA GGACCTGCTG GCCAACGGCC CCGACACCCC GTTCTGGCAC AAGCGGGACT TCAGCCGGGC GGTGGGGCGG GTGCAGGCCC CGGTGAACAT GACCGGCGGC TGGTATGACG TGTTCCTGCC CTGGCAGCTG GAGGACTACG CGGCGCTGCG GGCGGCCGGA CGCAACCCGC ACCTGACCAT CGGCCCGTGG TGGCATTCGG ATCCACGGCT GACGCGCCAC TCCATGCGCG ACTCGCTGCA GTGGTTCCGC GCCCACCTGC TCGGCGACCG CTCCGAGATG CGCCGGGACC CGGTGCGGCT TTACATCACC GGCTCCGGGG AGTGGCGGGA CTTCCCCCAC TGGCCGGTGC CCGGGATCGA CGAGCAGCGC TGGCACCTGC AGCCCGGCGG CGGGCTGTCC CCCGACGGGC CGCCGCAGAG CCCGCCCGAC TCCTACCGGT ACGACCCGAT GCACCCCACC CCGTGCATCA GCGGCCCCTC GCTGCTGGGC GACTGCTCCC CCGCCGACCA GCGCCGCCTG GAGGCCCGCC GCGACGTGCT GGTCTACACC TCCCCGCCGC TTTCCAAGGG GCTGGAGATC ATCGGGCCGA TCCGCGCGGA ACTGCATGTG CGCTCCGACC GGGCGCACGC CGACTTCGTG GTGCGGCTGT GCGACGTGGC CCCCGACGGG GTGTCGCTGA ACCTGTGCGA GGGCGTGCGC CGGGTGCGCC CGGGGGTGGG CGAGACCGAC GGCGAGGGCG TCTGCAAGAT CACGGTGGAT CTGTGGCCGG CCGGGCACCG GTTCCGCCCC GGGCACCGGC TGCGCGTGCA CGTGGCCGGC GGCGCCTACC CCCGGGTGGC CCGCAATTCC GGCACCGGCG AGCCGCTGGG CGCGGAGACG GCCTGGCTGG CCGCCCGCCA CGAGGTCTTC CACGACCCCG GCCGCCCTTC GGCGATCATC CTGCCGGTGC TGCGCTGA
|
Protein sequence | MEGLGLMRRL GAGALRLPHG PYRVRVHRDL EVPAPDGVTL LADRYAPQGL DRPPVILVRS PYGRRGPFGL MCGYVFATHG FQTVVQSVRG GFGSGGAFEP LDEREDGLAT VAWLKAQPWF GGAFAMHGPS YLGYVQWAVA ADAGPELKAL SMQVTASQFR DAINLGGGFA LESTLTWVDL TTRIQRPLSG LVAGVLSPRR ARRAALSGRP LAELDALATG APVRFFQDLL ANGPDTPFWH KRDFSRAVGR VQAPVNMTGG WYDVFLPWQL EDYAALRAAG RNPHLTIGPW WHSDPRLTRH SMRDSLQWFR AHLLGDRSEM RRDPVRLYIT GSGEWRDFPH WPVPGIDEQR WHLQPGGGLS PDGPPQSPPD SYRYDPMHPT PCISGPSLLG DCSPADQRRL EARRDVLVYT SPPLSKGLEI IGPIRAELHV RSDRAHADFV VRLCDVAPDG VSLNLCEGVR RVRPGVGETD GEGVCKITVD LWPAGHRFRP GHRLRVHVAG GAYPRVARNS GTGEPLGAET AWLAARHEVF HDPGRPSAII LPVLR
|
| |