Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4647 |
Symbol | |
ID | 8606009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 5259048 |
End bp | 5262176 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Lantibiotic dehydratase domain protein |
Protein accession | YP_003302206 |
Protein GI | 269128836 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCGTTT CCCGCCGTTT CCGCCACACC GGACTCGTGC TGGTACGCGC GACCACATGT CCCCCGGACG TGACGCCTCC CGGCGATGTG GACTTGGCCG ACCACGCTGC CGTGGCGCGG GAGGGAGCCG CGTGGGTGAG GCGAGTCTGG CAGCGCCGTG AGGTACGCGA AGCCGTGGAG ATGTCCAGCC CAGACCTGGC CGCCGGCATC GAGCGCCTTC TGACGGATGG CTGTCCCGCC GGGCGGCGGC TTCGCCGGAC GGTGGCCTCG CCGGCGTCGT ACCTGCTGCG GTGGGAACGG CGCGCCACCC CCTTCGGGAT GTTCGCCGGC GTCACCGCCG CTGCGCTCGG CCCGGCGGCG GTGAAGATCG ACACCGGTCA CCGTGCCGTC GTTCGCGCCG ATTCCGAATG GCTCGCCACG CTGATCGACC AGGTGGAGCG GCATCCTGCC CTGCGGATGC GGTTGACGGT GGTCGCGGAC AACACCGGCG TCGTCCGCGA CGGCCGCTTC ATCGTCCTCC GCAGGGCCGC GGTCGGAGCG CGATCACCGG GACCGCTGCA GGAAGCGTCG GTGCGTCTCA CCACACCCAT CGCCTACGCC CTCGAGCAGG CCGTCCGGCC CCAGCGGCTC GCCGCACTGG CCGATCGCAT GTCCCGGCGT TTTCCCAGCG CCGACACCGG CCGGATACAC ACGTTGCTGC ACGGCCTGAT CGACGGGGGA TTCCTCATCA CCGACCTGCG TCCGCCTTCG ACCGCTGAGG ACCCGCTGAA CCATCTCATC CAGGCGCTCC ACCGCGCAGG AGTCCGCGAG GCGGGCGACG CGGAGCTCAC GGAGATCACC AAGCTGCTCG ACCATCTGGA GGTCATCAAC CGCCAACTGA TGGAACACAA CACCGCTGCC GACCCGGCGC GTGCGGCGCC CCTGCGTGCC GCGGCCGCCG GCAGGATGCG GCATCTGGCG CGCGGCACCG GGCCCGTGCT GGCGGCCGAT CTGCGGCTCA ACGCCCACAT CGCGATCCCC GAACCGGTTG TGCACGAGGC CGAGCGAGCA GCCGATGTCT TGCTGCGGCT GTCAACGCAG CCGTTCGGCA CGACGGCGTG GTTGGACTAC CACGCCCGCT TCCGTGCCCG CTATGGCACC GGTGCACTTG TCCCGGTCCG GGAGCTGGTC ACCGACTCAG GACTGGGGTA CCCCCGCGGG TATCTGGGCG CGCCTCGTGC CCGCCCCGTG TGGCGGACAT TGACCCAACG CGACACCACT TTGATGGGGA TGATCCAGCA AGCCCTCGTG GACGGCCGTG AGGAGATCAC GCTGTCCGAG GCCGATATAG CGGCATTGAC GACAGGTGAT CACGGTGACG TCGTGGTGCC GTCGCGTATC GAACTCGGCA TCACGCTCCA CGCCACTTCA ACGGACGCGA TCAACCGCGG CGACTTCACC CTGCAGGTGG TGGCCGCGCC ACGCGCCCAC ACCAGCATGA TCGGCCGCTT CGCCCACCTG CTCGACAAGG CCGACCGGGC CAAGCTCGCC CGCACCTACG CTCCCGATGA CGACGTCGTG GTAGCGCAGT TGTCGTTCAC GCCCCGACGT CCGCACAACG ACAACGTCGT ACGCGTCGCC CCACACGCGG GAACAGTCGT CCTGCCGCTG GCCGAACACC CCGGCACCGC TCCCGGCCGG TCGGGTTCCG GTGGAACCGA GGTGGGCGTG CTCAGCTTGG ACGACCTGGC GGTCACCGCG GACGCCGATC AGATGTACCT GGTGCAGCGG TCCACCGGGC GACGTGTGCG CGCCTACATC CCCCACGCTC TGGACACCAC CGTGCAGACA CCGCCGTTGG CGCGGTTCCT GGCAGAAATC GCCGATGCAC GCAGCGCCGT GTTCGGCCCA TTCGACCTGG GAGCGGCCCG CGCCCTGCCG TACGTCCCCC GCATCCGCTA CCGGCGCACG GTGTTGTCTC CCGCACGCTG GCTCCTCACC CGCGGCGCCC TAGCCGCACC CGGCGAAGAC TGGGACGCGG CCTTGCACAG ATGGCGCCAC CGGTGGAGGG TGCCCGCTCG CGTCGTGCTG TGCCAGGGAG AATTGCGCCT GCCGTTGAAC TTGGACCGGC CTTTGGACCG CCGACTGCTG CAGGCTCGGC TGAATGCGGC TGAACGGCTC GAACTTCGTG AGGATGCCTC GTCCATCGCC TGGGGCTGGG TGGGACGCGC GGCGGAGCTG GTGATCCCGA TGGTCGCGGC CGCCGCGCCG AAGGGGCGGC CGCCGGTGAC CGCACCGCCG GGCATCATCC ATCAGCCGGG CGATGCGTTG TTGCTGCGGG CGCACTTGGT GGGCAACCCC GCGCACTTCG ACACCATCCT CACCCGTCAC CTCCCTGCGT TCGTCCAGCA GCTCGGCGTA CCGATCCGGC GATGGTGGGC GAGCAGGCAT CGCGACCTGA TCCGTCTGGA GGCCGACCAG TACCTCGTGG TGCTGTTCCG GCTCGCCGAA CGGTCCCAGT ACGGAGCAGT GGCCGCACGG CTGGCCGCCT TCGCTCAGGA GTTGCGAACC CGCGGCCTGC TCGAACAGCT CTTCCTGGCT CCCGCCGCAG AGCAGCCTGG CCGCTACGGG CACGGCCCGG CCCTGGAAGC GGCCGAGGAG GTGTTCGCCG CCGACACCGC AGCCGCCATA GCCCAGATCG CAATGGCCCA GGCCGCCGGG GTGGCGGCGC AGGCCGTCGC GGCGGCCTCC ATGACCCATC TCGCCGCTGC GTTCGCCTCC GATACCGCGG CCGGCTACCG CACTCTGGTC CGCCGCGCAA AGCGACGCTC CGGGCCGGTA GATGCAGTCG TACGCGATGC CGCCTTCCGC CTGGCCGATC CCGCAGCCGG CTTCGCCGCC GTCCGCGCCC TTCCGGGTGG CGACGCGGTC GCCGCCGCCT GGCAGCGCAG AGCCGACGCA TTGGCCGCCT ACCATCGCCT TCTCGCCCAA CAGCGCGACC CCGCAGACCT GGTGACGACG CTCCTGCACG GGCATCACGT CCGGGCGTTC GGTCCCGATC CCGAACACGA GGCCACCACC ATCCGGCCGG CCCGCGCCGC CGCCCTCCGG AATCTGGCCA CCGCCTCACC TGCGGCAGGT GCCCCGTGA
|
Protein sequence | MAVSRRFRHT GLVLVRATTC PPDVTPPGDV DLADHAAVAR EGAAWVRRVW QRREVREAVE MSSPDLAAGI ERLLTDGCPA GRRLRRTVAS PASYLLRWER RATPFGMFAG VTAAALGPAA VKIDTGHRAV VRADSEWLAT LIDQVERHPA LRMRLTVVAD NTGVVRDGRF IVLRRAAVGA RSPGPLQEAS VRLTTPIAYA LEQAVRPQRL AALADRMSRR FPSADTGRIH TLLHGLIDGG FLITDLRPPS TAEDPLNHLI QALHRAGVRE AGDAELTEIT KLLDHLEVIN RQLMEHNTAA DPARAAPLRA AAAGRMRHLA RGTGPVLAAD LRLNAHIAIP EPVVHEAERA ADVLLRLSTQ PFGTTAWLDY HARFRARYGT GALVPVRELV TDSGLGYPRG YLGAPRARPV WRTLTQRDTT LMGMIQQALV DGREEITLSE ADIAALTTGD HGDVVVPSRI ELGITLHATS TDAINRGDFT LQVVAAPRAH TSMIGRFAHL LDKADRAKLA RTYAPDDDVV VAQLSFTPRR PHNDNVVRVA PHAGTVVLPL AEHPGTAPGR SGSGGTEVGV LSLDDLAVTA DADQMYLVQR STGRRVRAYI PHALDTTVQT PPLARFLAEI ADARSAVFGP FDLGAARALP YVPRIRYRRT VLSPARWLLT RGALAAPGED WDAALHRWRH RWRVPARVVL CQGELRLPLN LDRPLDRRLL QARLNAAERL ELREDASSIA WGWVGRAAEL VIPMVAAAAP KGRPPVTAPP GIIHQPGDAL LLRAHLVGNP AHFDTILTRH LPAFVQQLGV PIRRWWASRH RDLIRLEADQ YLVVLFRLAE RSQYGAVAAR LAAFAQELRT RGLLEQLFLA PAAEQPGRYG HGPALEAAEE VFAADTAAAI AQIAMAQAAG VAAQAVAAAS MTHLAAAFAS DTAAGYRTLV RRAKRRSGPV DAVVRDAAFR LADPAAGFAA VRALPGGDAV AAAWQRRADA LAAYHRLLAQ QRDPADLVTT LLHGHHVRAF GPDPEHEATT IRPARAAALR NLATASPAAG AP
|
| |