Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3215 |
Symbol | |
ID | 8604561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3714809 |
End bp | 3716071 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_003300792 |
Protein GI | 269127422 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0024754 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAGCT ATTCCGAAGC AGACCGGGCG CGGTGGGTGC CCGAGCAGCC CAAGCGCCGT GACCGCACCG CCTTCGAACG GGACCGGGCG CGGGTGCTGC ATAGCGCGGC GCTGCGCCGG CTGGCCGCCA AGACCCAGGT GGCCGACCCC GGCTCCGACG ACTTTTTGCG CACCCGGCTG ACGCACTCGC TGGAGTGCGC CCAGGTGGGC CGGGAGCTGG GCAAGTCGCT GGGCTGCGAC CCCGACCTGG TGGAGACCGC CTGCCTGGCG CACGACATCG GCCACCCGCC CTTCGGCCAC AACGGCGAGT CCGCCCTGAA CACGGTGGCC GAACGCTGCG GCGGATTCGA GGGCAACGCC CAGAGCCTGC GCATCCTGAC CCGGCTGGAA CCCAAGTCGT TCGCCCCCGA CGGCCGCAGC GTCGGCCTCA ACCTGACCAG GGCCTCGCTG GACGCGGTGA TGAAATACCC CTGGCGCCGG CAGGACGCGC CCACCGGGGC GCCGTCGGGC GGCTCGGGCG CACCGTATGG GATCTACGAC GACGACATCG AGGTGGCGGC CTGGGTCCGC CAAGGGGCGC CCAAGGACCG GCTGTGCCTG GAGGCCCAGG TCATGGACTG GGCCGACGAT GTGGCCTACT CCGTGCACGA CCTGGAGGAC GCCCTGGTCG TCGGCCATGT GGACTTCGCC CGCCTGGCCG ACCCCACCGA GCGCCGCGCG GTGGCCGAGA CGGCCGCCAA GCTCTACTGC CCCGGCACCG ACCTGACCGA GCTGGAGGAG GTCTTCGCCG AGCTGCTGGC CGAGCCGTAC TGGCCGGACC ACTTCGACGG CACGCTGCGC ACCCTGGCCG CCCTCAAGAA CCTCACCAGC ACGCTGATCG GCCGCTTCTG CCTGGCGGCC GAGGACGCCA CCCGGCGGCG TTATGGCCCC GGCCCGCTGA CCCGCTACGA CGCCGACCTG GTCGTCCCCC GCCGGCAGCG CCTGGAATGC GCGCTGCTGA AGGGGGTGAC CGCGCACTAT GTGTGGATCA GCCACGAGGC CAACCGCGCC CGGCAGCGCG AACTGATCCT GGAGCTGGCC GATTGGATGC TGGCCGGCGC CCCCGGCACC CTGGAACCGC AATTCCGGGT CGCCTGGCAC CGGGCGCCCG ACGACGCCGC CCGGCTGCGG GTGGTGGTCG ACCAGATCGC CTCGCTGACC GACACCTCCG CCGTGGCGTT CCACGCCCGC CTGCGCGCCG CCCGCCGGCC CGCCGGTGCC TGA
|
Protein sequence | MGSYSEADRA RWVPEQPKRR DRTAFERDRA RVLHSAALRR LAAKTQVADP GSDDFLRTRL THSLECAQVG RELGKSLGCD PDLVETACLA HDIGHPPFGH NGESALNTVA ERCGGFEGNA QSLRILTRLE PKSFAPDGRS VGLNLTRASL DAVMKYPWRR QDAPTGAPSG GSGAPYGIYD DDIEVAAWVR QGAPKDRLCL EAQVMDWADD VAYSVHDLED ALVVGHVDFA RLADPTERRA VAETAAKLYC PGTDLTELEE VFAELLAEPY WPDHFDGTLR TLAALKNLTS TLIGRFCLAA EDATRRRYGP GPLTRYDADL VVPRRQRLEC ALLKGVTAHY VWISHEANRA RQRELILELA DWMLAGAPGT LEPQFRVAWH RAPDDAARLR VVVDQIASLT DTSAVAFHAR LRAARRPAGA
|
| |