Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3737 |
Symbol | |
ID | 9157917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3854387 |
End bp | 3856087 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003648654 |
Protein GI | 296141411 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.871211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCCT CTGTTTCCGC ACCCGTCGAT ACCGTCACCG TCGGCCCGAT CGAGGGATCC GAGAAGGGTT ATCGCGCTAT CGCGGGAAGC AATGGCACTC TGCGCGTGCC CTTCCGGCGG ATCTCGCTGA CCAACGGCGA CCATCACGAC GTCTACGACA CCTCCGGTCC GTACACCCAG TACGCCTCGG AGGATCAGCT GCACGACCTG CAGGCGGGCC TGCCGAAGAC ACGCGATGAA TGGGCGAAAC CCGAGCCCGT GGCCACCGAG AACGGCGGTG CCGGCGCCCG CACCCAGCTC GCCTGGGCAC GCGCCGGTGT GATCACCGAC GAGATCCGGT TCGTCGCTGC CCGTGAGGGA TTCGACCCTG AGTTCGTCCG TGCCGAGGTG GCCGCCGGTC GCGCGGTGAT CCCGGCGAAC CACAAGCACC CCGAACTGGA ACCCGCGATC ATCGGCAAGG CGTTCGCGGT GAAGATCAAC GCGAACATCG GCAACTCGGC GGTCACCAGC TCGATCGGCG AGGAGGTCGA GAAGATGGTG TGGGCCACCC GCTGGGGCGG CGACACCATC ATGGATCTCT CCACCGGCAA GGACATCCAC GAGACCCGCG AGTGGATCAT GCGGAACTCA CCGGTCCCGG TGGGCACGGT GCCGATCTAC CAGGCCCTGG AGAAGGTCAA GGGCGATCCC ACCAAGCTCA CCTGGGAGAT CTACCGGGAC ACGGTGATCG AGCAGTGCGA GCAGGGCGTC GACTACATGA CCGTGCACGC GGGTGTGCTG CTGCGCTACG TTCCGCTGGC CGCGAACCGG GTGACCGGCA TCGTCAGCCG CGGCGGTTCG ATCATGGCGG CCTGGTGCCT CGCGCACCAC GAGGAGTCGT TCCTGTACAC GCACTTCGGT GAGCTGTGCG AGATCCTCCG CGAGTACGAC GTGACCTTCT CCCTGGGCGA TGGCCTGCGC CCCGGATCCA TCGCGGACGC CAACGACGAG GCGCAGTTCG CCGAACTGCG CACCTTGGGT GAGCTGACGA AGATCGCGAA GTCGTATGGC GTGCAGGTGA TGATCGAGGG CCCCGGACAC GTGCCGATGC ACAAGATCGT GGAGAACGTG CGCCTGGAGG AGGAGCTGTG CGAGGAGGCG CCGTTCTACA CGCTCGGCCC GCTCGCCACC GATATCGCGC CGGCGTACGA CCACATCACC TCGGCCATCG GCGCGGCGAT CATCGCGCAG GCCGGTACGG CGATGCTCTG TTACGTGACG CCGAAGGAGC ACCTGGGCCT GCCGAACCGC GACGATGTGA AGACCGGCGT GATCACGTAC AAGATCGCCG CGCACTCGGC CGATCTCGCG AAGGGCCACC CGGGCGCGCA GTCCCGCGAT GACGAACTCT CCAAGGCGCG CTTCGAGTTC CGCTGGGTGG ACCAGTTCAA CCTATCGCTG GACCCCGACA CCGCCCGTGA GTTCCACGAC GAGACACTGC CCGCCGAACC GGCGAAGACG GCGCACTTCT GCTCGATGTG CGGCCCGAAG TTCTGCTCCA TGCGGATCAG CGCCGATGTG CGCGCCTACG CGCAGGAGCA CAACCTGGTG ACGCAGGAGG ACATCGCCGC CAAGCTCGCC GCCGATATGG CGGAGAAGTC GAAGGAATTC ACCGACGCCG GTGGCCGGGT GTACATCCCG CTCGAGACCG CGCAGCGGTG A
|
Protein sequence | MSPSVSAPVD TVTVGPIEGS EKGYRAIAGS NGTLRVPFRR ISLTNGDHHD VYDTSGPYTQ YASEDQLHDL QAGLPKTRDE WAKPEPVATE NGGAGARTQL AWARAGVITD EIRFVAAREG FDPEFVRAEV AAGRAVIPAN HKHPELEPAI IGKAFAVKIN ANIGNSAVTS SIGEEVEKMV WATRWGGDTI MDLSTGKDIH ETREWIMRNS PVPVGTVPIY QALEKVKGDP TKLTWEIYRD TVIEQCEQGV DYMTVHAGVL LRYVPLAANR VTGIVSRGGS IMAAWCLAHH EESFLYTHFG ELCEILREYD VTFSLGDGLR PGSIADANDE AQFAELRTLG ELTKIAKSYG VQVMIEGPGH VPMHKIVENV RLEEELCEEA PFYTLGPLAT DIAPAYDHIT SAIGAAIIAQ AGTAMLCYVT PKEHLGLPNR DDVKTGVITY KIAAHSADLA KGHPGAQSRD DELSKARFEF RWVDQFNLSL DPDTAREFHD ETLPAEPAKT AHFCSMCGPK FCSMRISADV RAYAQEHNLV TQEDIAAKLA ADMAEKSKEF TDAGGRVYIP LETAQR
|
| |