Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1870 |
Symbol | |
ID | 8603197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 2191994 |
End bp | 2194348 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003299481 |
Protein GI | 269126111 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000117839 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCTGGC AACTGCACTA CACCTCGGCG CGCAAGGGCC CGACGGGCCG TTCGGGGTTT CAGTTCGTGG CCGAAACGCC CGGCCTGCCG CCGGGCGTGC GGGCGGCGGT GACCCCGTTC ATGGCCTACC GGCCGCCGCC GGATGCGCCG CTGTCGCCGA CCCCCAAGGA GCTGGAGCGT TTCCCGGTGG CGCTGTCCTA CGACCGGGTG GGCGGGCGGG CGCTGCTGGT GCGCTCGCGC TACCTGGGCC AGGACTATTC GGGCCGGCAG GGCAACTTCT TCGCGCACGC GGTGGTGGCC GAGCCGGAGG AGCTGGAGGG GCTGCGGCCG ATCGAGCTGT GGCAGGCGGA GATCTGGGCC GAGCAGCCGG GCGACGGCGA GCTGGCCCCG CTGGAGGACC TGCCGCCCGG CTCGGGCGTC TCCCCCGAGA CGCTCGCCGA ACGGCTGGCC GCGGAGAACT CCTACCCCCT GCTGGTGCGG CTGGTGGACA CGGCGGTGGC GGCGCTGGAC CGCGGGCACG GCAGGCTGGT GCTGGTGTGC CGGGACGCCG AGCCGATCGT GCACTGGATC GCGGTGCTGT CGTATTCGCT GCCGGCGGCG GTGGCGGCCC GGCTGTCGTT CACCACCTTC ACCGCCGACC CCGCCGGCAC GCCGCACCGG GTGGTGGGCA CCACGCCCGA TGTGTGGTCG GCCGTCCGCC CCGACGACCC GGCGTTCTTC CTGGACGAAA AGGAGCCCCG GGAGGAGAAG GCCGGGCGGG CCGCCGCCGG AGACGGCGGC AGCCGGTTCG CGCGCACCGT GGCGGACTGC TGGCGGGAGG CCGATTTCGC CGGGCTGGAC GCGCTCGGTG AGCTGGTCAC CGTCCAGCTC GACGGCGAAG ACGGCGGCGC GGAAAAGGCC GCGCTGGCGC CGGCGCTGGA GCGGGCGGCG GCGCTGCTGG CGCTGTGCCG GGGCCGCACC GCCCCCTCCC CCGAGGAGGA GAAGCCGGCC GCCGAGCTGC TGACCCGCCA CGGCGGGCGC GTCCCCGAGT GGGTGTGGCC GGAGCTGGTG CCGGCGCTGC CGGCGGTGGG GTTCGACCTG GCGCTGGCGA TGCACACCTG GGCGCGGCGG GTGCAGGCGG CGCAGGTCGC CGACCGGTGC GCCGAGCGCT GCGCGATGCT GGCGCTGCAG GACCCCGGCC TGGCCGGCCG GCTGCCGCGC TTTGAACTGT CGGCCCAGGC GCTGGAACGG CTGGAGCCGC AGCTGACGCA GGCGCTGCGG CAGGCCGCCG ACCTGACGAG GGTGGGGCAT CTGGTGGCGG CGGCCGTACG GGCCGGCGTC CCGCCGGCCG CCGAGGAGGT GCGCGCCGCC GCCGTCCGGT GCGCCCGCCG CGGCGCCGGC GACCTGGCCG AGGCGCTGAC GGCGGTGCCC GCCCCCTGGC GGGAGCGGCT GACCGACGGC GTGATCGCCG GGCTGGCGCA GGCCGATGCG GCCGGGCGCG CCGCCCTGCT CACCGACCGG GCCTGCGACC TGCTGCAGGC CCGGGACTGG TCGCAGGCGC CCGAGGTGGG GCTGGCGGTG CTGGCGTCGG TGGGGCGCCG GCACCGGGAC CGCCGGGTGG ATGTGACCGG CGCGCTGCTG CGGCTGGACG CATCGCAGTC GGCGGCGGTG GACGCGGTGC TGCGCGAGGT GTGGGCGGTG CCGGCGTCGG CGGCCGAGTG CGGCGCGCTG CTGGACGCCT ACGAGGAGCT GGTGCCCCGC TACCCGGCGC TGGCGGCGCT GCCGTCGCGG ACGTTCCGGC AGGCGGCGGT GGACGGCGAG CTGGCGCTGG AGGACCCGGT GCTGCTGCGG GTGGCCGGGC AGGTGCTGGC GGCCTTCGAA CCGCAGGCGC GCGCCGCCCG CGACGCCGCG GTGCTGATCG CCTACGCCAA GACCGTCGGC GCCGCGCAGC CGGACACCGC CGCCGCCGAA CTGGAGGTGG TGCACGCGGC CGCCGGCCAG GCCGACCCGC AGCTGTGCCG GACCGCCTTC ACCTGGGCGG CCCGGCGGCT GGCCCGCCGC GACACCCGCT TCCGGGCGGC GGTGCTGGCC GCCGCCTCCG CCCCGGTCCG CGCCCGGCTG GCCGGGCGGT GGATGGAGCC GCGGGGACGG CGGATCTGGG GGCGGCCGGC TTTCGGGCCC GGGCAGCGCA ACGAGCTGGT CGAGATCGCC CTGCGGCTGC GGCGGGCCGG GGTCAGCGAG CCCCGCCTGG AGGCGTGGGC GCGTTCGGCC GCCGGGGGCT GGATGGCCTC CCGGCAGCTG GAGTCGCACC TGCGCCAGGA GCCGGAGCTG CGCGCCGAGC TGCGCAGACT GCTGGCCGGC GGCGAGGAGG GCTGA
|
Protein sequence | MAWQLHYTSA RKGPTGRSGF QFVAETPGLP PGVRAAVTPF MAYRPPPDAP LSPTPKELER FPVALSYDRV GGRALLVRSR YLGQDYSGRQ GNFFAHAVVA EPEELEGLRP IELWQAEIWA EQPGDGELAP LEDLPPGSGV SPETLAERLA AENSYPLLVR LVDTAVAALD RGHGRLVLVC RDAEPIVHWI AVLSYSLPAA VAARLSFTTF TADPAGTPHR VVGTTPDVWS AVRPDDPAFF LDEKEPREEK AGRAAAGDGG SRFARTVADC WREADFAGLD ALGELVTVQL DGEDGGAEKA ALAPALERAA ALLALCRGRT APSPEEEKPA AELLTRHGGR VPEWVWPELV PALPAVGFDL ALAMHTWARR VQAAQVADRC AERCAMLALQ DPGLAGRLPR FELSAQALER LEPQLTQALR QAADLTRVGH LVAAAVRAGV PPAAEEVRAA AVRCARRGAG DLAEALTAVP APWRERLTDG VIAGLAQADA AGRAALLTDR ACDLLQARDW SQAPEVGLAV LASVGRRHRD RRVDVTGALL RLDASQSAAV DAVLREVWAV PASAAECGAL LDAYEELVPR YPALAALPSR TFRQAAVDGE LALEDPVLLR VAGQVLAAFE PQARAARDAA VLIAYAKTVG AAQPDTAAAE LEVVHAAAGQ ADPQLCRTAF TWAARRLARR DTRFRAAVLA AASAPVRARL AGRWMEPRGR RIWGRPAFGP GQRNELVEIA LRLRRAGVSE PRLEAWARSA AGGWMASRQL ESHLRQEPEL RAELRRLLAG GEEG
|
| |