Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4041 |
Symbol | |
ID | 8605397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4616383 |
End bp | 4618173 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003301608 |
Protein GI | 269128238 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.140814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGGCCC GGCTTCGCCT GACCGCCGCC GTCGCGGCGC TGGCCTGCCT GGCCGGGTGC GCGAGCGTGC CCAGCGGCGG CCGGGTCACC TCCGGCCGGC CCGCCGAGCG CGCCGAGCCG ATCGCCGAGC CCTATGTGCG GGTGGTGCCG GTCTCGCCGC GTCCCGGCTG GGCGCCGGAC AAGATCGTCA AGGGGTTCCT CATCGCCTCC TCGGCGTTCA ACGACGACTA CAAGGTGGCC CGCCAGTACC TGACCTCCTC GGCCGACGCC GCCTGGCGGC CCGGCCCGCG GCCCCAGGTG ATCGTCCTGG ACGGCGAGCC GATCATCTCG GCCCCCCGGG AGGCCGGCGG GGAGATGCTC GTGGAGGTGA CCGCCACCCG CCTGGGCCAC ATCGGCTCAG ACGGCCAGTA CACGGCGCAG CCCGGCGGCG CCTACACCGC CACCTTCGCG TTGCAGAGCA ACTCCGCCGG CCAGTGGCGC ATCAGCCGCA TGCCGGCCGA GCTGCGCGAC GGCCTGCTGC TGAGCCAAAG CGATGTGGAC CGCGTCTTCC GGGTGCTCAA CCTGTACTTC TTCGACCCCG ACGGCAAGGT GCTGGTGCCC AACGCCGTCT TCCTGCCGCT GGTCAACCGC CAGCGGCTGG CCCGGCAGCT GGTCGCCGCG CTGCTGGCCG GGCCCACCAC CTGGCTCAAG CCCGCGGTGC GCAGCTACTT TCCCTCCGGC ACCCGGCTGC AGGATGTGGA CATCTCCGAC GGCGTGGCCA CCGTGAGCCT GAGCGCGGAG GCCTACCGGG GCGACCATGA GCGCATGTCG GCCCAGCTGA CCTGGACCCT GCAGCGGCTG CCGGAGGTGC GGCGGGTCCG GCTGCAGATC GACGGGGACG CCATCGAGCC GGGCGGGGTG GGGCCGATCC AGGCCCCCGA GGACTGGGAG GAGTACAACT CCGACACCTC CGAGGGCGGC ACCGAGGAGA CCGTCTACCT GCGCGGCCCC GACGGACGCC CGCACCAGCT GCACGACAAC GACGTCACCA CCCCGGCGAA CACCTCCGGC CGGGACCGCC TGCACCGGCC CGCGCTGTCC CCGCACGATG GGCGCAGCCG CATCGCCGGC CTGAACGCCG ACGGCGACAC GGTGCTGAGC GGGGACCTGC TGGGCGGCAC TCCGATCCGC CCGGTGATGT GGGCCGAGCA CTCCGACGGG CGCTTCACCC CGCCCTCGTG GGATCTGCGC GGCTGGCTGT GGAGCGTGGA GTCCTTCTCC GGCGGCTCGT CCCTGTGGCT TCGGCAGCGG GACCGGCAGC CGGTCCGGAT CCGCCAGTGG GAGCTGTCCG GGCACCGGGT GGTGGCGTTC CGGGTGGCCC GGGACGGGGT GCGCGTGGCG GCCATCGTGA AGATCGGCGA CAGCAGGCAG CTGCGGCTGG GCCGGATCGT GCGCGACGCC AAGGGAGTGA TCAGCGTCGG GGGTTTCCTG CCGATCAGTC CCGAACTGGT GGACGTCACC GATCTGGCCT GGGGCGACTC CACCTCGCTG GCCGTGCTGG GCCGCACCAA GCCCACCGAG ACCCAGGTGA CGCCCTACTG GGTGCCGGTC AACGGGGGCG ACATCCTGCC GGTGGGCACC GCCAGCCAGG GCGAGGCCGA GTCGATCACC GCGGCGCCGG GCTCGCGCAT CGTGATCGGT GCGCGGATCA ACGGTGAGGA CAACATCTGC CGCCAAAGCA GCCTGCGCGA CTGGCGGTTC GACCAGTGGA AGTGCGCGCC GCTGGGTTCT GACCCCACCT ATTGGAGCTG A
|
Protein sequence | MRARLRLTAA VAALACLAGC ASVPSGGRVT SGRPAERAEP IAEPYVRVVP VSPRPGWAPD KIVKGFLIAS SAFNDDYKVA RQYLTSSADA AWRPGPRPQV IVLDGEPIIS APREAGGEML VEVTATRLGH IGSDGQYTAQ PGGAYTATFA LQSNSAGQWR ISRMPAELRD GLLLSQSDVD RVFRVLNLYF FDPDGKVLVP NAVFLPLVNR QRLARQLVAA LLAGPTTWLK PAVRSYFPSG TRLQDVDISD GVATVSLSAE AYRGDHERMS AQLTWTLQRL PEVRRVRLQI DGDAIEPGGV GPIQAPEDWE EYNSDTSEGG TEETVYLRGP DGRPHQLHDN DVTTPANTSG RDRLHRPALS PHDGRSRIAG LNADGDTVLS GDLLGGTPIR PVMWAEHSDG RFTPPSWDLR GWLWSVESFS GGSSLWLRQR DRQPVRIRQW ELSGHRVVAF RVARDGVRVA AIVKIGDSRQ LRLGRIVRDA KGVISVGGFL PISPELVDVT DLAWGDSTSL AVLGRTKPTE TQVTPYWVPV NGGDILPVGT ASQGEAESIT AAPGSRIVIG ARINGEDNIC RQSSLRDWRF DQWKCAPLGS DPTYWS
|
| |