Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3042 |
Symbol | |
ID | 8604386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3528481 |
End bp | 3529725 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | thiamine biosynthesis/tRNA modification protein ThiI |
Protein accession | YP_003300622 |
Protein GI | 269127252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0001724 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTGC TCGACACCGC GGCCGGGCAG ATCACCGGCG GCCCGCCGGC GATCGGTGAG CCGTGCGTGC TGATCAAACT GGGCGAGGTC GTGCTCAAGG GCAAGAACCG CGAGGTGTTC GAGCGGCGGT TGCAGAACAA CCTCCGGTCG GCGGTCCGCG ACATCGCGCC GGTGCGCATC TGGCGCCGCC ACGGCGTGAT GGTGGTGCGC GTGGAACGGG GCGGCGGCGC CGACGTCGCC ACGGTGGACG CGCTGGCCCG GCGGATCACC GATGTGATGG GCATCGTCTG GGTGCACCGG GCCTGGCGGG TCGGCAAGGA CCCCGACAGC GTGGTGGGCG CCGCCCTGGA ACTGATGGCC GGCCGCACCG GCAGCTTCGC GGTGCGCTCC CGCCGCCGCG ACAAGCGCTT CCCGCTGACC TCCACCGAGC TGGACCGCCT GGTCGGCTCC AAGATCGTTG AGGCCTACGG GCTGCCGGTG AAGCTGAAGG AGCCCGAGCA CACCCTGTCG ATCGAGGTCG ACCGCGACGA GGTGTTCGTC TTCACCGACG GGCTGCCCGG CCAGGGCGGG CTGCCGGTGG GCATGAGCGG GAGGGCGCTG GTGCTGCTGT CGGGCGGCAT CGACTCCCCG GTGGCCGCCT ACCGGATGAT GCGCCGCGGG CTGCGCGTGG ACTACCTGCA CTTTTCCGGC ATGCCCTTCA CCGGGCCGGA GTCGATCTAC AAGGCGTACG CGCTGGTGCG CGAGCTGGAC CGCTTCCAGG GCGGCTCGCG GCTGTTCGTG GTGCCCTTCG GCAAGGCCCA GCAGCAGATC AAATCCTCCG GCGCCGACCG GCTGGCGGTG GTCGCCCAGC GCCGCCTGAT GCTGCGCACC GGCGAGATCC TGGCCCGCCG GCTGGGCGGT CTCGCGCTGA TCACCGGCGA CTCCCTGGGC CAGGTCAGCA GCCAGACGCT GGCCAACATG ACCGCCGTGG ACGATGCGGT GGAGCTGCCC ATCCTGCGTC CGCTGGTGGG CATGGACAAG GTCGAGATCA TGGACACCGC GCGCCGCATC GGCACGCTGA CCATCTCCGA GCTGCCCGAC GAGGACTGCT GCACGCTGCT GGCGCCGCGC CGCGCCGAGA CCCGCGCCAA GATCGAGGAC CTGCGGCAGA TCGACCGGCG GCTGGACGCC GAGGAGCTGG CCGAGAAGCT GGCCGACTCC GTCCAGGAGC ACCGTCCCGT CTACGGCGAG GGCAACGCCG CCTGA
|
Protein sequence | MTVLDTAAGQ ITGGPPAIGE PCVLIKLGEV VLKGKNREVF ERRLQNNLRS AVRDIAPVRI WRRHGVMVVR VERGGGADVA TVDALARRIT DVMGIVWVHR AWRVGKDPDS VVGAALELMA GRTGSFAVRS RRRDKRFPLT STELDRLVGS KIVEAYGLPV KLKEPEHTLS IEVDRDEVFV FTDGLPGQGG LPVGMSGRAL VLLSGGIDSP VAAYRMMRRG LRVDYLHFSG MPFTGPESIY KAYALVRELD RFQGGSRLFV VPFGKAQQQI KSSGADRLAV VAQRRLMLRT GEILARRLGG LALITGDSLG QVSSQTLANM TAVDDAVELP ILRPLVGMDK VEIMDTARRI GTLTISELPD EDCCTLLAPR RAETRAKIED LRQIDRRLDA EELAEKLADS VQEHRPVYGE GNAA
|
| |