Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_0957 |
Symbol | |
ID | 8602265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 1087340 |
End bp | 1088461 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_003298581 |
Protein GI | 269125211 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.490105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCGT TCGTCCAGCT CCGCCGGGGG AAGACCCCGC GGCAGATCCA CCGGGACGTG GGGGACCTCA AAGACGACGA GCTCGGCCGC TACGGCTTCA CCGGCCGCAC CGCCCACCTG TACCGCCGCA ACGACCCCAC CCGGTTCCGC ATCGAGGGCG ACCTGGCCGC CGTCAACGTG CAGACCGGCG AGCTGAAGCC CACCGACCTG GAGGCCGACG GCGAGCCGCT GGTGATGTTC CACAACCCCG ACTGCCGGAT CCTGCTGAGC CGCCGCGGGC AGGTCGCGCC GTTCTACACC CGCAACATCG ACGGCGATGA GCTGATCTTC GTCCACGAGG GGACCGGGCA TTTCGAGACC GAGTTCGGGC GGCTGCCGTA CCGGCCCGGC GACTGGGTGT ACCTGCCCAA GGGCACCACC TACCGGCAGA TCCCCCAGGA CCGCTCCCAC CTGCTGATCA TCGAGGCGAC CGAGGAGTTC CGGGTGCCCG AGGCCGGCAC GCTCGGCCGG TTCTTCCCCT TCGACCCCTC CCTGATCACC ATCCCCGAGC CGGAGGTCTT CCCCGACGAC GGCCGGGAGG AGTACGAGAT CCGCCTGCGC CAGCGCGAGG GCCGCTCCTC GCTGTTCTAC CCGTTCAACC CGCTGGACGT GGAAGGGTGG CGGGGCGACA ACTTCCCCTT CACCTTCAAC ATCGCCGACT ACGACGTCAT CACCTCCGAC GACGTCCACC TGCCCCCCAC GGTGCACCTG TTCATGCAGG CCACCGGGGT GTACGTGCTG AACTTCCTGC CCCGCCCGGC CGAGGGCAAG CCGGGGGTGG AACGGGTCCC CTGGTACCAC CGCAACACCG ACTACGACGA GATCGCCTTC TACCACGGCG GCAGCGTCTT CGGCGTGGAC ATGCCCGCCG GGCTGATCTC GCACGCCCCC CAGGGCATCC ACCACGGCGT CCCCGAGCGG GCCCGCCGGC GCGCCCGGCG CCTGTTCGAG CAGGAAAAAC GGGTGGAGTG GAAGGTCATC GCGATCGACA CGCGCCGCCG GCTCATCCCC ACGCCGGTGA TGCTGGGCGC CCAGCCCACC CAGACCAAGG AAGAAGCCGA CAAGAAAGAG GTGCGGGCGT GA
|
Protein sequence | MESFVQLRRG KTPRQIHRDV GDLKDDELGR YGFTGRTAHL YRRNDPTRFR IEGDLAAVNV QTGELKPTDL EADGEPLVMF HNPDCRILLS RRGQVAPFYT RNIDGDELIF VHEGTGHFET EFGRLPYRPG DWVYLPKGTT YRQIPQDRSH LLIIEATEEF RVPEAGTLGR FFPFDPSLIT IPEPEVFPDD GREEYEIRLR QREGRSSLFY PFNPLDVEGW RGDNFPFTFN IADYDVITSD DVHLPPTVHL FMQATGVYVL NFLPRPAEGK PGVERVPWYH RNTDYDEIAF YHGGSVFGVD MPAGLISHAP QGIHHGVPER ARRRARRLFE QEKRVEWKVI AIDTRRRLIP TPVMLGAQPT QTKEEADKKE VRA
|
| |