Gene Tcur_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_3042 
Symbol 
ID8604386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp3528481 
End bp3529725 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content72% 
IMG OID 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_003300622 
Protein GI269127252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0001724 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTGC TCGACACCGC GGCCGGGCAG ATCACCGGCG GCCCGCCGGC GATCGGTGAG 
CCGTGCGTGC TGATCAAACT GGGCGAGGTC GTGCTCAAGG GCAAGAACCG CGAGGTGTTC
GAGCGGCGGT TGCAGAACAA CCTCCGGTCG GCGGTCCGCG ACATCGCGCC GGTGCGCATC
TGGCGCCGCC ACGGCGTGAT GGTGGTGCGC GTGGAACGGG GCGGCGGCGC CGACGTCGCC
ACGGTGGACG CGCTGGCCCG GCGGATCACC GATGTGATGG GCATCGTCTG GGTGCACCGG
GCCTGGCGGG TCGGCAAGGA CCCCGACAGC GTGGTGGGCG CCGCCCTGGA ACTGATGGCC
GGCCGCACCG GCAGCTTCGC GGTGCGCTCC CGCCGCCGCG ACAAGCGCTT CCCGCTGACC
TCCACCGAGC TGGACCGCCT GGTCGGCTCC AAGATCGTTG AGGCCTACGG GCTGCCGGTG
AAGCTGAAGG AGCCCGAGCA CACCCTGTCG ATCGAGGTCG ACCGCGACGA GGTGTTCGTC
TTCACCGACG GGCTGCCCGG CCAGGGCGGG CTGCCGGTGG GCATGAGCGG GAGGGCGCTG
GTGCTGCTGT CGGGCGGCAT CGACTCCCCG GTGGCCGCCT ACCGGATGAT GCGCCGCGGG
CTGCGCGTGG ACTACCTGCA CTTTTCCGGC ATGCCCTTCA CCGGGCCGGA GTCGATCTAC
AAGGCGTACG CGCTGGTGCG CGAGCTGGAC CGCTTCCAGG GCGGCTCGCG GCTGTTCGTG
GTGCCCTTCG GCAAGGCCCA GCAGCAGATC AAATCCTCCG GCGCCGACCG GCTGGCGGTG
GTCGCCCAGC GCCGCCTGAT GCTGCGCACC GGCGAGATCC TGGCCCGCCG GCTGGGCGGT
CTCGCGCTGA TCACCGGCGA CTCCCTGGGC CAGGTCAGCA GCCAGACGCT GGCCAACATG
ACCGCCGTGG ACGATGCGGT GGAGCTGCCC ATCCTGCGTC CGCTGGTGGG CATGGACAAG
GTCGAGATCA TGGACACCGC GCGCCGCATC GGCACGCTGA CCATCTCCGA GCTGCCCGAC
GAGGACTGCT GCACGCTGCT GGCGCCGCGC CGCGCCGAGA CCCGCGCCAA GATCGAGGAC
CTGCGGCAGA TCGACCGGCG GCTGGACGCC GAGGAGCTGG CCGAGAAGCT GGCCGACTCC
GTCCAGGAGC ACCGTCCCGT CTACGGCGAG GGCAACGCCG CCTGA
 
Protein sequence
MTVLDTAAGQ ITGGPPAIGE PCVLIKLGEV VLKGKNREVF ERRLQNNLRS AVRDIAPVRI 
WRRHGVMVVR VERGGGADVA TVDALARRIT DVMGIVWVHR AWRVGKDPDS VVGAALELMA
GRTGSFAVRS RRRDKRFPLT STELDRLVGS KIVEAYGLPV KLKEPEHTLS IEVDRDEVFV
FTDGLPGQGG LPVGMSGRAL VLLSGGIDSP VAAYRMMRRG LRVDYLHFSG MPFTGPESIY
KAYALVRELD RFQGGSRLFV VPFGKAQQQI KSSGADRLAV VAQRRLMLRT GEILARRLGG
LALITGDSLG QVSSQTLANM TAVDDAVELP ILRPLVGMDK VEIMDTARRI GTLTISELPD
EDCCTLLAPR RAETRAKIED LRQIDRRLDA EELAEKLADS VQEHRPVYGE GNAA