Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtpsy_2971 |
Symbol | |
ID | 7385416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax ebreus TPSY |
Kingdom | Bacteria |
Replicon accession | NC_011992 |
Strand | - |
Start bp | 3162017 |
End bp | 3163861 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643656281 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002554404 |
Protein GI | 222112140 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.103817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCC CCGACAAGTT CGCCAGCCTG CTTGCGCTCA CGCGCGAACC CTTTCCCGCT TCCACCAAGT CCTACCTCGC CGGCAGCCAA CCGGGGCTGC GCGTGCCGGT GCGCGACATT CAGCTCACCA ACGGCGAAGT GGTGAGCGTG TACGACACGT CCGGCCCCTA TACCGATCCT GCCGTGCAGA TCGACGTGCG CAAGGGCCTT GCGAGCGTGC GGGGCGAATG GATTGCCGCG CGCGGCGACA CCGAGGGCTA TGAGGGTCGC GTACGCAAGG CGCTGGACGA CGGCCAGAAG GCCGAGGATG GCGACCGCCT GGCCCAGCTG CGCGCCGAGG CTGCGGCGCT GCAGCGCCAG CCGCTGCGCG CCAGGAGCGG CGCCAACGTC ACGCAGATGC ACTACGCGAA GAAGGGCATC GTCACTCCCG AGATGGAATA CGTGGCCTTG CGCGAGAACG GTCGCCGCGA GTGGATGCAG CAATACATGC AGGACGCCGC GCGCGAGCAG CGCCTGGCCG GCAACCCACT GGGTGCGAGC ATTCCGAAAA TCATCACGCC CGAGTTCGTG CGCGACGAGG TCGCCCGTGG CCGCGCCATC ATTCCCGCCA ACATCAACCA CCCCGAAGTG GAGCCCATGG CCATCGGGCG CAACTTCAAG GTGAAGATCA ACGCCAACAT CGGCAACTCC GCCGTCACGT CGAGCATCGA GGAAGAGGTG GAGAAGCTCG TCTGGGCCAT CCGCTGGGGC GCCGACAACG TGATGGACTT GTCCACCGGC AAGAACATCC ACACCACGCG CGACTGGATC GTGCGCAACT CGCCCGTGCC CATCGGCACG GTGCCTATCT ACCAGGCGCT GGAAAAGGTC GGCGGCATTG CCGAGGACCT GACCTGGGAG ATCTTCCGCG ACACGCTGAT CGAGCAGGCC GAGCAGGGCG TGGACTATTT CACCATCCAC GCGGGCGTGC GCCTGGCCTA CATCCAGCTC ACCGCCGCGC GCCGCACGGG CATCGTGTCC CGTGGCGGCT CCATCATGGC CAAGTGGTGC ATGGCGCACC ACAAGGAGAG CTTCCTCTAC ACACACTTCG AGGACATCTG CGACATCATG AAGGCGTACG ACGTGGCCTT CAGCCTGGGT GATGGCCTGC GTCCGGGCTG CGCCTCGGAC GCCAACGACG AAGCCCAGTT TGCCGAGCTG CACACGCTGG GCGAGCTGAC GCAGATTGCC TGGAAGCACG ACGTGCAGAC CATGATCGAA GGCCCCGGCC ACGTGCCCAT GCACATGATC CAGGCCAACA TGACGGAGCA GCTCAAGACC TGCCACGAGG CGCCGTTCTA CACCCTGGGC CCGCTGACCA TCGACATCGC CCCCGGCTAC GACCACATCG CCAGCGCCAT CGGTGCCGCC ATGATCGGCT GGATGGGCAC GGCCATGCTG TGCTACGTGA CGCCCAAGGA GCACCTGGGC CTGCCCGACC GCGACGATGT CAAGCAGGGC ATCATTGCCT ACAAGATCGC GGCCCACGCG GCCGACGTCG CCAAGGGGCA TCCGGGTGCC CGTGCGCGCG ACGACGCGCT GAGCCAGGCG CGGTTCGACT TCCGCTGGCA GGACCAGTTC AACCTGGGCC TGGACCCCGA TACGGCCAAG GAATACCACG ACGAGACCCT GCCCAAGGAC AGCGCCAAGG TGGCGCACTT CTGCTCCATG TGCGGGCCGA AGTTCTGCTC GATGAAGATC ACGCAGGAAG TGCGCGAATT CGCCCAACAG GGCCTGCAGT CCAAGGCCGA GGAGTTCAAC CGCACGGGCG GCGAGCTCTA CGTGCCCATC CACCGCGCCG ACTGA
|
Protein sequence | MNAPDKFASL LALTREPFPA STKSYLAGSQ PGLRVPVRDI QLTNGEVVSV YDTSGPYTDP AVQIDVRKGL ASVRGEWIAA RGDTEGYEGR VRKALDDGQK AEDGDRLAQL RAEAAALQRQ PLRARSGANV TQMHYAKKGI VTPEMEYVAL RENGRREWMQ QYMQDAAREQ RLAGNPLGAS IPKIITPEFV RDEVARGRAI IPANINHPEV EPMAIGRNFK VKINANIGNS AVTSSIEEEV EKLVWAIRWG ADNVMDLSTG KNIHTTRDWI VRNSPVPIGT VPIYQALEKV GGIAEDLTWE IFRDTLIEQA EQGVDYFTIH AGVRLAYIQL TAARRTGIVS RGGSIMAKWC MAHHKESFLY THFEDICDIM KAYDVAFSLG DGLRPGCASD ANDEAQFAEL HTLGELTQIA WKHDVQTMIE GPGHVPMHMI QANMTEQLKT CHEAPFYTLG PLTIDIAPGY DHIASAIGAA MIGWMGTAML CYVTPKEHLG LPDRDDVKQG IIAYKIAAHA ADVAKGHPGA RARDDALSQA RFDFRWQDQF NLGLDPDTAK EYHDETLPKD SAKVAHFCSM CGPKFCSMKI TQEVREFAQQ GLQSKAEEFN RTGGELYVPI HRAD
|
| |