Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1600 |
Symbol | |
ID | 8602921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 1855024 |
End bp | 1859175 |
Gene Length | 4152 bp |
Protein Length | 1383 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003299214 |
Protein GI | 269125844 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.726146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGACC GGTTCATCGA GGCCCTGCGC GGGCTCGACC CTCAGCCCAC TGCCGAAGAG CTGGCCGACG CCCTCTGGCT GGCTCGTTAT CTGTCCGCGC AGAGCAGTGC ATCCGCGAGC GCTTCACACG CCAAGCCTTC GCTCCCACCC CCTGAACGGA AGGAAGGATC TCCCCCCACC GAGCCCGCGG CCCTGCGCCC GGACCCGCCG AGCCGCTCTG CGGTGACCAG GTCCGCGTCT CCACGGGCAG GCCTGCACCT GGCCTCACCC CAGGCCTCGG GCCCGCAGGT GATGCGTGTT CCCGCGCCAC CCGCGCTCCC CCGGATATTG CGTTTGGCCC AGGCGCTGCG CCCGCTGCGG GTGGGAGCCG ACTCCAGTAC GCGCCAGTAC CTGGACGAGG AGGCGACCGC CCAGCGAATC GCCGACACCG GCATCTGGCA GCCGGAGCTG CGTCCTGAGC AGGAACGCCG CCTGGACATC GCACTGGTGG TCGACGACAG TGCCTCCATG GCGGTCTGGC AGCGCACCGT CAGTGAGTTC CAGTCCGTCC TGGAACAGCT CGCCGCCTTC CGCGACGTGC GGGTGTGGCG GATCGACACC GACGATGACC GGCTGGCCCT GTATACCGGC GCTCCACGCA CCGGGTCCGG CCGCAGCCCA GAGGAACTGA TCGATCCGAC TCACCGGCGG GTGGTCTTGG TCGTCAGCGA CTGCATCGGC CGTGCCTGGA AGGACGGCCG GTTCGCAGCG CTGCTGGCGC GCTGGGGCCG GATGGGGCCG GTGGCGATCA TCCAGCCGTT GCCGCAACGG CTGTGGTGGC GATGTGCGGC GAACGTGGTG CCGGTGTCGA TCAGGGCGAC GCAGCCGGGC CAGCCGAACG AACAGCTGGA GGTGCGGTCG CAGGAAAGCG GTGAGGCTCT TGCGGGCATC GCGATTCCGG TGCTGGAACT GGAGCCTCGC TGGCTGCGGC CATGGGCCGA GCTGGTCGGC GGCACGGCGA ACCGGCTGAC CGGCATGGCG CTGCCGACCG GCCGTCCGGT CGCCGAGGAT TCTCCCGAGG ACGAGCCGGA CGGGCTCTCC CCCGCCGAGC GGGTGAAGCG GTTCCGCGCC ACGGCCTCGC CCACGGCATT CCGCCTGGCC GCCTATCTGG CCGCCGCTCC GCTGCGACCG CCGGTGATGA GACTGGTCCA GCAGGCCATG CTGCCCGATT CCCGGCCCTC TCATCTGGCC GAGGTCTTCT TGAGCGGCCT GCTGCGCAAG ACGGGGACGA CCGCAGATCC CGAGGAGGCC GATTACGAGT TCCTCGACGG CGTCCGAGAC ATCCTGCTGA GCGCCCTCAA ACGGAGTGAG GCGCTGCGTG TGGTGCAGGA GGTCTGGGAA GTCATGCGGA ACCGGTGGGG AGCGGGGTCG GACTTTTCGG CATTGCTGCG CGCAGTGGAG CAGGGCGCCG AGACCCCACC GCTGGATCCG CCGTTCGCTC GAGTGACGGC ACAGGTGCTG GCCCGTCTGG GCGGGCGCTA TGCGGCGATC GCCGAGCGGC TGAGAGCCGA ATTCGGATCC GCCGGCCCCA CCGGGCAGCC AGATGAGTTC GAAGAGGACG ACGCGCCGTT CAGCGGCCTG CCGCAAATCA CCCGAGCCCG GCCGGCGGCG CCGGTATTGG GTGGCGGGCT GCCGCCGCGC AACCCGAACT TCATTGGACG CGACGAACTG CTGCTGCAGA CCCGCCACCT GCTGAACAGC GGCGTCACAG CGCTGCTGCC ACAGGCCCAC CTCCTGGGAG GAGAAGGCAA GTCCCAGCTG GCGATCGAGT TCGCACATCG CCACATCGCC GACTACGACC TCATCTGGTG GATTCCCGCC GAGCAGATCA CTTTGGCCAG GTCCTCTCTG ACGTTGCTGG CGCGACGGCT GGGAACGCCG CTGAGCGACG ACATCAACCG GACCGTCGAA CGAGTACTCC AGGAACTGCG GAGAGGCGGG CGGTACCGGC GATGGCTGCT GATCTATGAC AACGCCATAG ACCCCGACGA GCTCATGCCG CTGATGCCGG CTGACCTGGT CGAAGGGCGG CTTGTCGTCC CACAGCAAAG AGGACTGGGA CACGTCCTGG TCACTTCTGG TGACGAGCGG TGGCGCCAGC GGGCCACCGT GCTCCAGGTG GGCGTGTTCA CCCGTGCGGA AAGCATCGCC TTCCTGCGCC ACCAGGTACC GCGGCTGTCG GCGTCCGAGG CCCACCGGCT GGCCGCCCAG GTGGAAGACC TGCCGTTGGC ACTGGAGCAG ACCGCCGCGC TGCTGGGCGA GACCGGCCTG GAAACCGAGG AATACCTGCG GCAACTTGAG GCGCAGTTCA CCCAGCAGTG GATTCGCACC CTGCCGCCGG AGTACCCAAG ACCTCTGGCA GCGACGCTGG GACTGGCCTT CGAACGGCTG CGCCAGGACG CCCCGGCGAC CGCACGGCTG CTGGAACTGT GGGCGTTCTT CGGGTCCGAA CCAGTCCCCC GGGAGCTGCT GTCGAGCGGC GGCCATGCTC GGCTGCCGAC CGTGCTTCAG GCGATCTTGG AGGATCCGCA CCGGCTGACC CGGGCGATGA ACGACATCAG CCGCTATGCG CTCGGCCGCT TCGACCGGCA GACGGCCAGT CTCCAAGTGC ACCGGCTGGT GCGGGTGATG CTGCAGGCCA GGCTGCCGGG ACGCAGGGGC GAGCAAGTCC GCAACCGCGT CCACCGGATC CTGGCCGCCG CGATCCCGGA GGCTCCCCCG GACAACGAGA CCACCTGGGC GCGCAGGGAA CAGATCGCCC CCCATGTGGT GCCCGCAGGC GCCATCGACG GCACGACCGA GCACGCCAGA GAAGTGGTGC TCGATCAGAT GCGCTACCGG TACCTACTGG GGGACTTCGA AGGCGCCCGG GATCTCGGTG AAAAAGCCTT GGAACGGTGG AGCCCCCTGC TGGGCCCGGA TGACGAGCAA GTGCTGGACG CCGGTCGCCA GATGGGGAAC GTGCTGCGCT CCCTCGGAGA GGTCAGTGAG GCCAGACGGC TCAATGCGGA GGTCCACCGG CGCACCCTGG CCCGGTTCGG GCCGGACAAT CTGAAGACCC TGCAAATCGC CAACAGCGTC GGGGCCGATC TGCGGTTGCA GGGTGACTTC GCCGGAGCGC TACGGCTCGA CCGGCAAACC CTGCAACGCA TGGTTCGCAT ACTCGGCAGG GATAGAGATG AAACGTTCAG GGTCGTCAAC AACGTGGGCA TCGACCTGCG ATTGATGGGG CGTTTCCAGC AAGCTTATGA AATCGACTCA GACGCATTCG ATCGGTTGCT CGGCCAGCAT GGTCCGCGGC ATCGCAGCAC TTTGGTGGCG ATGAACCAGG TCGCCCGTGA TCTGCACGGG CTGGGCCGTT ACCGAGAAGC CGAAACCCTG CACCGGCAGG CGCTGGAAAT GATGCGGGAG ACGCTCGGCC ATGATCATGC CATCGTCCTG CAGGCGGAGA TGAGCCGTGT CGGCACCCTG CGTCGGCTGG GGGCCTACCG GCAGGCCAGG AAGCTGGCCG AGGCGACGTT CGAACTCCAC CGGCAGCGAT TCGGCCGCGA GCACCCCGAC ACGCTGGCCG CGCAGCGCAG CCTGGCGGTG GCGTGCGCGG TCACCGGTGA CGCCGAACGG GGACGGGAGC TGAGCGAGGA GGCATCGCGC GGATACCGCC GGCTTCTTGG CGCCGACCAT CCGTTCACGC ACGCCTGCGC CACCGACCTG GCCCTTAACC TGCGGGCACT GGGCGAGCAC GAGGCCGCGC TCCTGGCCGA CGACTCTGCC CTGCGTGCCC TGCAACGCAC GCTCGGAGCC GACCACTACT ACTCGCTCTG CTGCTCTGTG GGCCTGGTGC ACGACCTGTT CCACACCGGC CGGCTGGAAG CCGCGCTGAG CAGATCAGAG GACACGCGCG CCCATTTCCG CGAGCAGTAC GGCCCAGACC ATGTCTACAG CCTCATCTGT GAACACAATC ACCAGGTTCT GTTGAGAAGG CTGGGCCGTG AATCCTCAGG GCCTTCCCTT TCACAGAACC TCGCGAGCGT TCTTGGCGCA GATCATCCCG ATGTACGTAG AGCCAACGCA GACGAGCTGA TCGAATGCGA CATCACCCCC ATCCCCTTGT GA
|
Protein sequence | MLDRFIEALR GLDPQPTAEE LADALWLARY LSAQSSASAS ASHAKPSLPP PERKEGSPPT EPAALRPDPP SRSAVTRSAS PRAGLHLASP QASGPQVMRV PAPPALPRIL RLAQALRPLR VGADSSTRQY LDEEATAQRI ADTGIWQPEL RPEQERRLDI ALVVDDSASM AVWQRTVSEF QSVLEQLAAF RDVRVWRIDT DDDRLALYTG APRTGSGRSP EELIDPTHRR VVLVVSDCIG RAWKDGRFAA LLARWGRMGP VAIIQPLPQR LWWRCAANVV PVSIRATQPG QPNEQLEVRS QESGEALAGI AIPVLELEPR WLRPWAELVG GTANRLTGMA LPTGRPVAED SPEDEPDGLS PAERVKRFRA TASPTAFRLA AYLAAAPLRP PVMRLVQQAM LPDSRPSHLA EVFLSGLLRK TGTTADPEEA DYEFLDGVRD ILLSALKRSE ALRVVQEVWE VMRNRWGAGS DFSALLRAVE QGAETPPLDP PFARVTAQVL ARLGGRYAAI AERLRAEFGS AGPTGQPDEF EEDDAPFSGL PQITRARPAA PVLGGGLPPR NPNFIGRDEL LLQTRHLLNS GVTALLPQAH LLGGEGKSQL AIEFAHRHIA DYDLIWWIPA EQITLARSSL TLLARRLGTP LSDDINRTVE RVLQELRRGG RYRRWLLIYD NAIDPDELMP LMPADLVEGR LVVPQQRGLG HVLVTSGDER WRQRATVLQV GVFTRAESIA FLRHQVPRLS ASEAHRLAAQ VEDLPLALEQ TAALLGETGL ETEEYLRQLE AQFTQQWIRT LPPEYPRPLA ATLGLAFERL RQDAPATARL LELWAFFGSE PVPRELLSSG GHARLPTVLQ AILEDPHRLT RAMNDISRYA LGRFDRQTAS LQVHRLVRVM LQARLPGRRG EQVRNRVHRI LAAAIPEAPP DNETTWARRE QIAPHVVPAG AIDGTTEHAR EVVLDQMRYR YLLGDFEGAR DLGEKALERW SPLLGPDDEQ VLDAGRQMGN VLRSLGEVSE ARRLNAEVHR RTLARFGPDN LKTLQIANSV GADLRLQGDF AGALRLDRQT LQRMVRILGR DRDETFRVVN NVGIDLRLMG RFQQAYEIDS DAFDRLLGQH GPRHRSTLVA MNQVARDLHG LGRYREAETL HRQALEMMRE TLGHDHAIVL QAEMSRVGTL RRLGAYRQAR KLAEATFELH RQRFGREHPD TLAAQRSLAV ACAVTGDAER GRELSEEASR GYRRLLGADH PFTHACATDL ALNLRALGEH EAALLADDSA LRALQRTLGA DHYYSLCCSV GLVHDLFHTG RLEAALSRSE DTRAHFREQY GPDHVYSLIC EHNHQVLLRR LGRESSGPSL SQNLASVLGA DHPDVRRANA DELIECDITP IPL
|
| |