Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_0444 |
Symbol | |
ID | 8601741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 503068 |
End bp | 506517 |
Gene Length | 3450 bp |
Protein Length | 1149 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003298080 |
Protein GI | 269124710 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATCC TGCGCTACGA CCTGACCGCC TTCGGGCCCT TCACCGGGCT GTCGCTGGAC CTGGCGGCGC CCGGGGTGCA CCTGGTGGTC GGCCCCAACG AGGCGGGCAA GTCCACCGCC CGGCACGCGC TCGGCCAGCT GCTGTACGGC ATCGACGAGC GCACCCCGTA TGACTTCGTG CACGCCAAGC GGGACCTGCG GCTGGGCGCG CTGATCGGCC GCCGCGACGG CGGCACGCTG GAGATCGTCC GGGTCAAATC CCGCAAGGCG CCGCTGCGCA CCCCGGACGG CGACCCGATC GACCAGTCCG TGCTCTCGGC CGTCCTCGGC GGGATCGACC GGCAGACCTT CACCGCCGAG TTCGCCCTCA GCAGCACCGA GTTGCGCGAA GGCGGCAAAG CCCTGGTGGC CGGCAAAGGG GAGCTGAGCC AGGCGCTGGC CGCCTCCCGG TCCGGGCTGC GCCTGACCCG CGTCCAGGAG GCCATCAAGG CCCGCATGGA GGAGCTGTAC AAGCCGCGCG GCACCAGACC GGCCATCAAC ACCAAGCTCA GGGAGCTGAA GGAGACCAAC GCCCGCAAAA GAAAGGCGTC GCTGCGCCCC GACGACTACC TCGCCCGGGA ACGGGAGGTC GAACGGGCCC GGCAGGAACT GGAGCGGCTG AAGAAGGAAC TGCTGGATCT TCGTGCCGAC CATCTGCGGC TGGAACGGCT CGACCAGGCC CTGCCCCAGC TGAATCGCCG CCGCCATTTG CTGGAAGAAC TCGACCGGAT CCGCGCCGAG GGCCCACTGG CCCCGCCGGA TGCGGCCGAA CGCTTCCCGC ACCTGATGGA GGAGCTGCGC CAGGCCCGCC GCAGCGAAGA ACAGGCCGGG ACCCGGATCG AGAACATCGA CCGGCAGCTG GCGGAACTGC ACGTCGATGA GACGCTGGCG GCGCACGCCG ACGCCATCGA GGCCCTCTTG CAGGACATGG GCGCCGCCCA GGAGGCCGCC GAACGGCTGG AGCACCTGGC CGGGAGGGCC GCCGAGCGGC GGGAGGAGGC CGCCGCCCTG CTGGCCCGGG TGCACCCGGA CGCCGCCCTC GCCGACGAGC GGCGCTACCG CATCCCCCAG GCCGTCCGGG AGCAGGCCCG GGAACTGGGC GAAAGGCGCA CGGCCCTCGA CGCGGAGCTG CGGGAGCGGC GGCAGGCCCG CGACGACCGG CACCGCAAGC TGCAGCAGGC CCGGCGGCGC CTGGCCGGGC TGCCGCCCGT CCCCGACGAC CGGCCGTTGC GCTCGGCCCT GGACGCCGTC CCCGACGACC TGCTCCACCG CCTGACCACC ACCGCCGCAG AGGTCAACCG GCACCGCAAC CGGGCGGCCG CGATCCGGGA GCGGCTGGGG CTGCCCGAGG ACGGGGCCGT GGCCGTCCCC TCCCGCGAGC AGGTCGATGC CCACCGGCAG GAGGCCGACC GGATCGCAAA CGACCGGCGC AACCTCGCCG ACCGCATCGC CGAGCTGACC GAACGCCTGG AAGAACAGCG GCTGGAACTG GAGGGCCTGC GCGCCCACGA CCCGCCGCCC ACCCCCGAGG ACCTGGCCGC GGCCCGCGCC CGGCGCGATG AGCTTTGGGA CGGCCTCCCC GGCACCGAGT CGGAGTACCT GCGGGCCGTC CGCCACGCCG ACGAGATCGC CGACCGGATG TTCCGGGAGG CCGAACGGGT CAACCGGCTC CGCGAGACGC GCCTGGCGAT CGAACGGGAC GAGCGGAGAC TGGCCCGCCT CCAGGCCGAC CACGACGCCC TGGCCTCCCG GGAAGCGGAG CTGGAGGCCG CCTGGCGGCG CCTGTGGGAG GGCTACGCCG GCCCCGTCCC CCCGCCCGAG GCCGCCGCGG ACGCCTTGGA GGAGGTGCGG CGGCTGCAGG AGACGCAGCG GGAGGGGGAC GACGCCGCAG CCCTCTTCGA GGCGCTCGCC GACCGGGCCG CCCGCCACGC GGCCCGGCTG CGGGACCTGC TCGACCATCC CGCCGGCTCC GACGACCCCT GGACCGAGCT GCCGGAACTG CAACGGCTCG GCCGGCAGCG GCTGGAGGAG TGGCAGGAGA CGGCCGCCGC CTGCGCCGCC GCCGAGGAGA AGGTCGCCGC CGAAGAGCAC GAGCTGGAAG AGGCCGAGGC CGCCTGCGCC CACTGCGAGC GGCGGCTGGA GGAGTGGCGG GAACGCTGGA AGCGTCTGCT CGGCGAGGCC GGGCTGCCCG CCGGCCGCGA GCCCGTCGGC GCGCTGGCCG ACCTGGACCT GCTGGCCAGG GCCGAAGAGG CGCTGGCCGA CGCCGACCGG ATCGACCGGG AGGCGCAGGA GGCCGAACGG AAGGTCGCGC GCTTCCACGA AGAACTGACC CGCCTGGCCG ACCGGTGCGG CAGGGAGGTC CCCGCCGACC CGGCCGAGCG CCGCCTGCTG GTCCGCGCCC TCCACCAGGA CGCCAAGGAC AACCGGGACC GCGCCGCAGA ACGCGACCGG CTGCGGCGGG ACCGGGAGGA GCACCGCGCA GAACGCGACC AGGCCGCCCA GGCCGTGGCG CTGCTCCAGG CCGAGCTGGA AGAACTGATG CGCGCCACCG GGGCCTGTTC GGTGGAGGAA CTGGACGCCG CCGTCCGCCG CCGCGACCGC CACACCGAGC AGGAGCGCAA GCTCAAGGAG CTCACCGAGA CCATCGTCTG CGGCCCCCAC TCCCTGGAGG AGCTCATGGC GGAGGCCGCG GAGACCGACC CCGTCCGGCT CCGGGCCGAC CTGGAGGAGC TGTCGGGACG CATCGAGGAA CTGGAGAGCC TGCGCACCCG GCTCCAGGAC ACCTTCACCC GCAGCAAGGC CGAGCTGGAC CGGCTGGACG GCTCCGCCGA GGCGGCCCAG GCCGCGGCCG AGGCCGAGAC GCTGGGCGCC GCCCTGGTCG AGGAGGCCGA AGAGTACCTG CGGCTGGAGA TCGCCCACGC GATCCTGCTG GAGTGCGCCG AAACCTACCG CAGCGCCCAG CAGGACCCGG TGCTGGAACG GGCCGGGCAC CTGTTCGGCG AGCTGACCCG CGGCCGTTTC AGCGGCGTCG AACTCGACCC GGACGAAGAC CCTCCGGTCA TCGTGGCCCG CCGCAGCGGC GGTGAGCTGC TGCGCGTCCA CCAGCTCAGC GAGGCCACCG CCGACCAGCT CTACCTGGCG CTGCGCCTGG CCTCCCTGGA ACGCTACGCG CAAGAGGACC GGGCCCTGCC GTTCACGGTC GACGACATCT TCATGACCTT CGACGACGCC CGCACCCGTG CCGCCCTGCG GGTGCTGGAC GGCATGGCCG ACCGCTTCCA GGTGATCGTG TTCACCCACC ACGAGCACCT CGCCGCCCTC GCCCGCCAGG CCCTGCCGCC CGGCCGCGTC CACGTCCACA CCCTGCCGGA GTACCGGCCC GGGGAGCTCG CACCGGCGGC GGGGGCCTGA
|
Protein sequence | MRILRYDLTA FGPFTGLSLD LAAPGVHLVV GPNEAGKSTA RHALGQLLYG IDERTPYDFV HAKRDLRLGA LIGRRDGGTL EIVRVKSRKA PLRTPDGDPI DQSVLSAVLG GIDRQTFTAE FALSSTELRE GGKALVAGKG ELSQALAASR SGLRLTRVQE AIKARMEELY KPRGTRPAIN TKLRELKETN ARKRKASLRP DDYLAREREV ERARQELERL KKELLDLRAD HLRLERLDQA LPQLNRRRHL LEELDRIRAE GPLAPPDAAE RFPHLMEELR QARRSEEQAG TRIENIDRQL AELHVDETLA AHADAIEALL QDMGAAQEAA ERLEHLAGRA AERREEAAAL LARVHPDAAL ADERRYRIPQ AVREQARELG ERRTALDAEL RERRQARDDR HRKLQQARRR LAGLPPVPDD RPLRSALDAV PDDLLHRLTT TAAEVNRHRN RAAAIRERLG LPEDGAVAVP SREQVDAHRQ EADRIANDRR NLADRIAELT ERLEEQRLEL EGLRAHDPPP TPEDLAAARA RRDELWDGLP GTESEYLRAV RHADEIADRM FREAERVNRL RETRLAIERD ERRLARLQAD HDALASREAE LEAAWRRLWE GYAGPVPPPE AAADALEEVR RLQETQREGD DAAALFEALA DRAARHAARL RDLLDHPAGS DDPWTELPEL QRLGRQRLEE WQETAAACAA AEEKVAAEEH ELEEAEAACA HCERRLEEWR ERWKRLLGEA GLPAGREPVG ALADLDLLAR AEEALADADR IDREAQEAER KVARFHEELT RLADRCGREV PADPAERRLL VRALHQDAKD NRDRAAERDR LRRDREEHRA ERDQAAQAVA LLQAELEELM RATGACSVEE LDAAVRRRDR HTEQERKLKE LTETIVCGPH SLEELMAEAA ETDPVRLRAD LEELSGRIEE LESLRTRLQD TFTRSKAELD RLDGSAEAAQ AAAEAETLGA ALVEEAEEYL RLEIAHAILL ECAETYRSAQ QDPVLERAGH LFGELTRGRF SGVELDPDED PPVIVARRSG GELLRVHQLS EATADQLYLA LRLASLERYA QEDRALPFTV DDIFMTFDDA RTRAALRVLD GMADRFQVIV FTHHEHLAAL ARQALPPGRV HVHTLPEYRP GELAPAAGA
|
| |