Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1687 |
Symbol | |
ID | 8603010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 1978075 |
End bp | 1979343 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003299300 |
Protein GI | 269125930 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00182912 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAC TGCGCCTCGT CGCGGTCAGC GAGGACGGTA CCTACCTCGT GCTGGCCACC GCGGGCCGAG GCACCCGGTT CACCCTGCCC GTCGACGACC GGCTGCGCGC CGCGGTCCGT GGGCACTTCT CCCGACTCGG CCAGTTCGAG ATCGAAGTGG AGAGTCCCTT GCGCCCCAAG GAGATCCAGG CACGCATCCG GGCCGGTGAG ACGGCGGAGG AGATAGCCGA GTCCGCGGGC ATCCCGGTCG AGCGCGTCCG CTGGTTCGAG GGCCCCGTCC TGCAAGAGCG CGAGTACATG GCCCAGCAGG CCCAGCGCGT GCCGGTGCGC CGGCCGGGCG AGTCGGCCCC CGGCCCGCCG CTGGGCGAGC TGGTCGAAGA GCGGCTCAAC CGCGGCGGCG TCGACATCGA GGACGTGGAG TGGGACTCCT GGAAGTGCGA AAGCAGCAGC TGGCGGGTGC GGCTGTCGTT CTTTGAGGAG GGCCGCCCGC GGGCCGCCGA GTGGCTGTTC GACCCCCGTC GCCGCCACCT GTCCCCGATG GACGAGCTGG CCGCCCGGCT CAGCGACATG GAGTGGGAGG GCGACGGCGA GGACTCCTCC GACACGGTGA CGCCGCTGGT GCCGCGCCGC CCCACCATGA AGGTCGTCTC CGACCCGCGG GAGGTCTCCC CCGCCCAGCC GCTGCGGCCC GGGCCCGCCC CCGTGCCGGA TCCCGCCCGG CGGCCCGCCC CGCTGGCCCC GGCGCCGCCC GCGCCGCCGG CTCTCGGCCG GCCTCCGGCG CCGCCGCCGC ACGTCGCCGA GCCGCCGTCC AGGCCGCCCG CGCCCGCGCC CGAGGGTGAG GAGCCGCGGG AGACGGCTTC GGCGTCCCAG ACGGCGCAGA GCGAGCCTGA GGCCCCCAAG GCGGCGCCCG CCGGCTCGGC GGAGACCGCT CCGGAGGCGA CGCCCGCCGC CGAACCGGCC TCCGCCAAGC CCGATCCCGC GCCCAAGGCG GCGCCCGCCG CCGCGGCGAC CGAGGCCGAG CGGCCCGTCG CAGCGGCGGC GGTCAAGCCC GCCCCGCCCC GGGAGCGGTC CCGGCAGAAG CCGCCCGTGC CGCCCGCCGC GGCGGCCGAG ACGCCGCCCG CGCAGCCGGC CGTGGCCGCC CGCGACGTTC CCGCGGTGCC CGCCGCCCAG CAGCGGCAGT CCGCCCCGCG GCGCCGCAGG CCCAAGGGCA AGCGCGCCTC GGTGCCGTCG TGGGACGAGA TCATGTTCGG CGCTCGCCGT CCCGACTGA
|
Protein sequence | MQELRLVAVS EDGTYLVLAT AGRGTRFTLP VDDRLRAAVR GHFSRLGQFE IEVESPLRPK EIQARIRAGE TAEEIAESAG IPVERVRWFE GPVLQEREYM AQQAQRVPVR RPGESAPGPP LGELVEERLN RGGVDIEDVE WDSWKCESSS WRVRLSFFEE GRPRAAEWLF DPRRRHLSPM DELAARLSDM EWEGDGEDSS DTVTPLVPRR PTMKVVSDPR EVSPAQPLRP GPAPVPDPAR RPAPLAPAPP APPALGRPPA PPPHVAEPPS RPPAPAPEGE EPRETASASQ TAQSEPEAPK AAPAGSAETA PEATPAAEPA SAKPDPAPKA APAAAATEAE RPVAAAAVKP APPRERSRQK PPVPPAAAAE TPPAQPAVAA RDVPAVPAAQ QRQSAPRRRR PKGKRASVPS WDEIMFGARR PD
|
| |