Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2714 |
Symbol | |
ID | 8604057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3166283 |
End bp | 3167680 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | protein of unknown function DUF35 |
Protein accession | YP_003300303 |
Protein GI | 269126933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00764236 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCT ACAGCGGCAT CGGCGCCTAC GCAGCGTATC TGCCCGCGTA CGCGCTGGCC GGCAACAGTC TTGAAGGCGC CGCGCCACGC CGTGGCGGCC CCATCCGCAG CGTCGCCGCG TTCGACGAGG ACGCCGTGAC GATGGCGGTC GAGGCGCTGC GGGATCTGGC GGCCCGCGCC CCCGGCGGCG ACCGGCTCGT CCTCGCCACC ACCAGTGCGC CCTACGACGG CAAGACCTCG GCCGGGATCG TGCACGCGGC CCTCGGCCTC GCCCCCGGCG TCGCCGCGGT GGACCTGCAC GGCGACCGGG CGGGCGCGAC CGCACTGGAC CTGGTGATGC GCACCGGCGC CCTGGCCGCC GTCAGCGACA TGCGCACGCC CCGCTCGGGC GCCCCCGACG AGCTCGCCCA CGGCGACGCC GCCGCCGCGT TCGCCCCCGG CGACGGTGCG GCCGTTGTGC TCGCCCACGC CGGGCAGACC GTGGAGGTGC TGGACCGGTG GCGGCTGCCC GGAGAGCGGC ACGAGCACGT GTGGGACGAG CGGTTCACCG CGAGCGTGCT GGTCGCCGCG GCCAAGGACG CCGCGCGGCG GGCCCTGGCC GACGCCGGCC TGGAGTCCGT CGATCACGTG GTCGTGGCGT CCCCCAACGC CCGCGCCGCC GCGACGCTGC GCCGCGCCTT CGGCGGCTCC GGCGCCGACG CCGAGGTCGA GCGGCTCACC GGCCACACCG GCGCCGCGCA CCTGGGGCTG CTGCTCGCCG CCACGTTGGA TGCGGCCGAG CCCGGCCAGA CGATCCTGGC GCTGTCGGCG GCCGAGGGCG CCGACGCGTT CGTGCTGCGC GTCGGCGACG GCGTGCGCGC GGCCCGGGCC GGCCGCCCGG TGCGCGAGCA GCTCGCCGCT CGCGAGCACC TCGCCTACGG CCGCTACCTG CGCTGGCGGG GCCTGCTGGA AGTGCAGGGC CCGGCGCGCC CGGCCGCCCC GGCCCCGGCC GCCCCGCCGA TGTACCGGCG GGCCGCCTGG AAGTACCGCC TGGAGGGCGC CCACTGCGGC TCCTGCGGCG GCATCACCAC CCCGCCGGGC AAGGCGTGCG CCGCGTGCGG CGACGTCGCG GAGCAGCCCC GCACCGTCTC GCTGCGCGAC CAGGTCGCCA CCGTCGTGTC GGTCACCCGG GACCGGCTGA CCACCATGCC CGAGGCCGAG GTCGCGATCG TCGTCGCCGA CGTGGCGGTC GAGGGCAGCC GCGGCGGCCG GCTCACCGCC TACGCCACCG ACGTCGCCCC CGGCGCCATC ACCGTGGGCA TGACGATGCG GCCCACGTTC CGGCGGCTGT GGAGCACCGA CGCGATCCAC AACTACTTCT GGAAGCTTCG CCCGTGGAAG GCCACCAATG ACGACTAG
|
Protein sequence | MPTYSGIGAY AAYLPAYALA GNSLEGAAPR RGGPIRSVAA FDEDAVTMAV EALRDLAARA PGGDRLVLAT TSAPYDGKTS AGIVHAALGL APGVAAVDLH GDRAGATALD LVMRTGALAA VSDMRTPRSG APDELAHGDA AAAFAPGDGA AVVLAHAGQT VEVLDRWRLP GERHEHVWDE RFTASVLVAA AKDAARRALA DAGLESVDHV VVASPNARAA ATLRRAFGGS GADAEVERLT GHTGAAHLGL LLAATLDAAE PGQTILALSA AEGADAFVLR VGDGVRAARA GRPVREQLAA REHLAYGRYL RWRGLLEVQG PARPAAPAPA APPMYRRAAW KYRLEGAHCG SCGGITTPPG KACAACGDVA EQPRTVSLRD QVATVVSVTR DRLTTMPEAE VAIVVADVAV EGSRGGRLTA YATDVAPGAI TVGMTMRPTF RRLWSTDAIH NYFWKLRPWK ATNDD
|
| |