Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3618 |
Symbol | |
ID | 8604969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4165715 |
End bp | 4167370 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | urocanate hydratase |
Protein accession | YP_003301190 |
Protein GI | 269127820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0116497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGTC CTCGTACCGT CCGCGCCCCG CGCGGCACGT CGCTGACCGC CAAGGGGTGG CCGCAGGAGG CCGCGCTGCG GATGCTGATG AACAACCTCG ACCCCGACGT GGCCGAGCAC CCCGAGGAGC TGGTCGTCTA CGGCGGCACC GGCCGCGCCG CCCGCAACTG GGACGCCTTC GACGCGCTCG TGCGCTCCCT GCGGGACCTG GAAGGGGACG AGACGCTGCT GGTGCAGTCC GGCAAACCCG TGGGGATCTT CCGCACCCAC GAGTGGGCGC CGCGCGTGCT GATCGCCAAC TCCAACCTGG TGCCGCAGTG GGCCACCTGG GAGGAGTTCC GCCGGCTGGA GGCGGCCGGG CTGACCATGT ACGGGCAGAT GACCGCCGGG TCGTGGATCT ACATCGGCAC CCAGGGCATC CTGCAGGGCA CCTACGAGAC GTTCGCCGCC GTGGCCGCCA AACGGTTCGG CGGGTCGCTG GCCGGGACCC TCACCCTGAC CGCCGGGCTG GGCGGCATGG GCGGCGCCCA GCCGCTGGCC GTCACCATGA ACGGCGGGGT GGCCCTGTGC GTGGAGTGCG ACCCGTCCCG CATCGACCGC CGCGTCGCGC ACGGCTACTG CGATGTGCGC GTCGACGACC TGGACGAGGC GCTGCGCCTG GCCGAACAGG CCAAGGAGAG GCGGCGGCCG CTGTCGATCG GGGTGCTCGG CAACGCCGCC GACGTGGTGC CGAGGCTGCT GCGCCAAGGC GCCCCCGTCG ACATCGTCAC CGACCAGACC TCCGCCCACG ACCCGCTGAC CTACCTGCCG CTCGGCATCG CCTTCGAAGA CATGGCCGCC GAACGCGACA AGGACCCGGC CGGCTTCATC CGCCGCGCCC GCGAGTCCAT GGCCGTCCAC GTGGACGCCA TGGTCGGCTT CCACGACGCC GGAGCCGAGG TCTTCGACTA CGGCAACTCC CTTCGCGGCG AGGCCAAGCT GGGCGGCTGC GAGCGGGCCT TCGACTTCCC CGGCTTCGTG CCCGCCTACA TCCGCCCGCT GTTCTGCGAG GGAAAAGGCC CGTTCCGCTG GGCGGCGCTG TCGGGCGACC CCGCCGACAT CGCCCGCACC GACCGGGCGA TCTGCGAGCT GTTCCCCGAC AACGAGCCGC TGGTCCGCTG GATCCGCACG GCCGGGGAGA AGGTGCGCTT CCAAGGGCTG CCCGCGCGGA TCTGCTGGCT CGGCTACGGA GAACGCGACA AGGCCGGGGC GGTCTTCAAC GACCTGGTGG CCCGCGGCGA GATCAGCGCG CCCATCGTGC TGGGCCGCGA CCACCTGGAC GCCGGCAGCG TGGCCAGTCC CTACCGGGAG ACCGAAGCGA TGGCCGACGG CTCCGACGCC ATCGCCGACT GGCCGCTGCT CAACGCCCTC CTCAACACCG CCTCCGGCGC GTCCTGGGTG TCGATCCACC ACGGCGGCGG CGTCGGCATC GGCCGCTCGG TGCACGCCGG CCAGGTGTGC GTGGCCGACG GCACCCCGCT GGCCGCCGAG AAACTGGCCC GCGTCCTGAC CAACGACCCC GGCACCGGCG TGATGCGCCA CGCCGACGCC GGATACCCCC GCGCCGCTCA GGTCGCCGCC GAACGCGGCG TCTGCATCCC CATGGCGGAG CGATGA
|
Protein sequence | MSGPRTVRAP RGTSLTAKGW PQEAALRMLM NNLDPDVAEH PEELVVYGGT GRAARNWDAF DALVRSLRDL EGDETLLVQS GKPVGIFRTH EWAPRVLIAN SNLVPQWATW EEFRRLEAAG LTMYGQMTAG SWIYIGTQGI LQGTYETFAA VAAKRFGGSL AGTLTLTAGL GGMGGAQPLA VTMNGGVALC VECDPSRIDR RVAHGYCDVR VDDLDEALRL AEQAKERRRP LSIGVLGNAA DVVPRLLRQG APVDIVTDQT SAHDPLTYLP LGIAFEDMAA ERDKDPAGFI RRARESMAVH VDAMVGFHDA GAEVFDYGNS LRGEAKLGGC ERAFDFPGFV PAYIRPLFCE GKGPFRWAAL SGDPADIART DRAICELFPD NEPLVRWIRT AGEKVRFQGL PARICWLGYG ERDKAGAVFN DLVARGEISA PIVLGRDHLD AGSVASPYRE TEAMADGSDA IADWPLLNAL LNTASGASWV SIHHGGGVGI GRSVHAGQVC VADGTPLAAE KLARVLTNDP GTGVMRHADA GYPRAAQVAA ERGVCIPMAE R
|
| |