Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPTO_4976 |
Symbol | thiC |
ID | 1186661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas syringae pv. tomato str. DC3000 |
Kingdom | Bacteria |
Replicon accession | NC_004578 |
Strand | - |
Start bp | 5646584 |
End bp | 5648473 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637396297 |
Product | thiamin biosynthesis protein ThiC |
Protein accession | NP_794709 |
Protein GI | 28872090 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACAA AACCAAAAAA CGCAGCCCAT TTGAGCGAGT CGGCACAAGT CGATTCCGGA TCGGTACAGC CGTTTACCCG TTCGCAGAAA ATCTACGTTC AGGGCTCACG CCCGGACATC CGCGTGCCGA TGCGCGAAGT GACTCTGGAT GTCACGCCGA CGGACTTTGG CGGCGAGATC AACGCGCCGG TTACCGTTTA TGACACGTCA GGCCCGTACA CCGACCCCAA CGTGATCATC GATGTGCGCA AAGGCCTGGC GGATGTTCGC TCGCCGTGGA TCGACTCGCG CAATGACACC GAGCGCCTGC CGGGCTTGAG TTCCCACTTC GGCCAGCAGC GCCTGAGTGA CGCCGAACTG ACCGCGCTGC GCTTTGCCCA TGTGCGCAAT CCGCGTCGCG CCAAGGCGGG CGCCAACGTC AGTCAGATGC ACTATGCGCG TCAGGGGATC ATCACTGCCG AGATGGAATA CGTTGCCATC CGCGAGAACA TGAAACTTCA GGAAGCCCGC GCCGCAGGCC TGCTGACCCA GCAGCACGCC GGGCACAGCT TCGGGGCGAG CATTCCGAAG GAGATCACCG CCGAGTTCGT GCGCGAGGAA ATCGCCCGGG GTCGAGCCAT CATTCCGGCC AACATCAACC ATGTCGAGCT GGAACCGATG ATCATCGGCC GTAACTTCCT GGTGAAGATC AACGGCAACA TCGGCAACAG TGCGCTGGGT TCTTCCATCG AAGAAGAAGT CGCCAAACTG ACCTGGGGCA TTCGCTGGGG TTCGGACACG GTCATGGACC TGTCCACCGG CAAGCACATT CACGAAACCC GCGAATGGAT CATCCGTAAC TCGCCGGTGC CGATCGGTAC GGTGCCGATC TATCAGGCGC TGGAAAAAGT CGGCGGCGCG GCCGAGGATC TGACCTGGGA GCTGTTCCGC GACACGTTGA TCGAACAGGC CGAGCAGGGC GTCGATTACT TCACCATTCA CGCCGGTGTG CTGCTGCGTT ACGTACCGCT GACCGCCAAG CGCGTGACCG GGATTGTCAG CCGTGGTGGT TCGATCATGG CCAAGTGGTG CCTGGCGCAT CACCAGGAAA ACTTCCTGTA CACCCATTTC GAAGACATCT GCGAAATCAT GAAGGCCTAC GACGTCAGCT TCTCGCTGGG CGATGGCTTG CGCCCCGGCT CGATTGCCGA CGCCAACGAT GCGGCGCAGT TCGGTGAGCT GGAAACCCTC GGCGAACTGA CCAAGATTGC CTGGAAACAC GACTTGCAGA CCATGATCGA AGGCCCAGGC CACGTGCCGA TGCAGTTGAT CAAAGAGAAC ATGGACAAGC AGCTGGAGTG CTGCGACGAA GCGCCGTTCT ACACCCTCGG CCCGCTGACC ACTGATATTG CACCGGGTTA TGACCACATC ACCTCGGGCA TCGGCGCGGC GATGATCGGC TGGTTCGGTT GCGCCATGCT GTGTTATGTC ACACCCAAGG AACACCTGGG TTTGCCGAAC AAGGATGACG TGAAGACCGG CATCATCACC TACAAGATCG CGGCCCACGC AGCGGACCTG GCCAAAGGGC ATCCTGGCGC GCAGATTCGC GACAACGCGC TGAGCAAGGC GCGTTTCGAG TTCCGCTGGG AAGACCAGTT CAACCTCGGC CTGGACCCGG ACACTGCGCG TTCATACCAC GATGAAACCC TGCCCAAGGA CTCGGCCAAG GTCGCGCATT TCTGCTCCAT GTGCGGGCCG AAATTCTGCT CGATGAAAAT CACCCAGGAA GTACGTGAAT ACGCCGCCAA TCAGCGCATT GAAGCAGTGG ATGTCGACGT CGCCAGGGGC CTTGCCGAAC AGGCCGAGCG CTTCAAGCAG GAAGGCAGCC AGTTGTACAA GAAGGTGTAG
|
Protein sequence | MSTKPKNAAH LSESAQVDSG SVQPFTRSQK IYVQGSRPDI RVPMREVTLD VTPTDFGGEI NAPVTVYDTS GPYTDPNVII DVRKGLADVR SPWIDSRNDT ERLPGLSSHF GQQRLSDAEL TALRFAHVRN PRRAKAGANV SQMHYARQGI ITAEMEYVAI RENMKLQEAR AAGLLTQQHA GHSFGASIPK EITAEFVREE IARGRAIIPA NINHVELEPM IIGRNFLVKI NGNIGNSALG SSIEEEVAKL TWGIRWGSDT VMDLSTGKHI HETREWIIRN SPVPIGTVPI YQALEKVGGA AEDLTWELFR DTLIEQAEQG VDYFTIHAGV LLRYVPLTAK RVTGIVSRGG SIMAKWCLAH HQENFLYTHF EDICEIMKAY DVSFSLGDGL RPGSIADAND AAQFGELETL GELTKIAWKH DLQTMIEGPG HVPMQLIKEN MDKQLECCDE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG WFGCAMLCYV TPKEHLGLPN KDDVKTGIIT YKIAAHAADL AKGHPGAQIR DNALSKARFE FRWEDQFNLG LDPDTARSYH DETLPKDSAK VAHFCSMCGP KFCSMKITQE VREYAANQRI EAVDVDVARG LAEQAERFKQ EGSQLYKKV
|
| |