Gene Dtpsy_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2971 
Symbol 
ID7385416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp3162017 
End bp3163861 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content66% 
IMG OID643656281 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002554404 
Protein GI222112140 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.103817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCC CCGACAAGTT CGCCAGCCTG CTTGCGCTCA CGCGCGAACC CTTTCCCGCT 
TCCACCAAGT CCTACCTCGC CGGCAGCCAA CCGGGGCTGC GCGTGCCGGT GCGCGACATT
CAGCTCACCA ACGGCGAAGT GGTGAGCGTG TACGACACGT CCGGCCCCTA TACCGATCCT
GCCGTGCAGA TCGACGTGCG CAAGGGCCTT GCGAGCGTGC GGGGCGAATG GATTGCCGCG
CGCGGCGACA CCGAGGGCTA TGAGGGTCGC GTACGCAAGG CGCTGGACGA CGGCCAGAAG
GCCGAGGATG GCGACCGCCT GGCCCAGCTG CGCGCCGAGG CTGCGGCGCT GCAGCGCCAG
CCGCTGCGCG CCAGGAGCGG CGCCAACGTC ACGCAGATGC ACTACGCGAA GAAGGGCATC
GTCACTCCCG AGATGGAATA CGTGGCCTTG CGCGAGAACG GTCGCCGCGA GTGGATGCAG
CAATACATGC AGGACGCCGC GCGCGAGCAG CGCCTGGCCG GCAACCCACT GGGTGCGAGC
ATTCCGAAAA TCATCACGCC CGAGTTCGTG CGCGACGAGG TCGCCCGTGG CCGCGCCATC
ATTCCCGCCA ACATCAACCA CCCCGAAGTG GAGCCCATGG CCATCGGGCG CAACTTCAAG
GTGAAGATCA ACGCCAACAT CGGCAACTCC GCCGTCACGT CGAGCATCGA GGAAGAGGTG
GAGAAGCTCG TCTGGGCCAT CCGCTGGGGC GCCGACAACG TGATGGACTT GTCCACCGGC
AAGAACATCC ACACCACGCG CGACTGGATC GTGCGCAACT CGCCCGTGCC CATCGGCACG
GTGCCTATCT ACCAGGCGCT GGAAAAGGTC GGCGGCATTG CCGAGGACCT GACCTGGGAG
ATCTTCCGCG ACACGCTGAT CGAGCAGGCC GAGCAGGGCG TGGACTATTT CACCATCCAC
GCGGGCGTGC GCCTGGCCTA CATCCAGCTC ACCGCCGCGC GCCGCACGGG CATCGTGTCC
CGTGGCGGCT CCATCATGGC CAAGTGGTGC ATGGCGCACC ACAAGGAGAG CTTCCTCTAC
ACACACTTCG AGGACATCTG CGACATCATG AAGGCGTACG ACGTGGCCTT CAGCCTGGGT
GATGGCCTGC GTCCGGGCTG CGCCTCGGAC GCCAACGACG AAGCCCAGTT TGCCGAGCTG
CACACGCTGG GCGAGCTGAC GCAGATTGCC TGGAAGCACG ACGTGCAGAC CATGATCGAA
GGCCCCGGCC ACGTGCCCAT GCACATGATC CAGGCCAACA TGACGGAGCA GCTCAAGACC
TGCCACGAGG CGCCGTTCTA CACCCTGGGC CCGCTGACCA TCGACATCGC CCCCGGCTAC
GACCACATCG CCAGCGCCAT CGGTGCCGCC ATGATCGGCT GGATGGGCAC GGCCATGCTG
TGCTACGTGA CGCCCAAGGA GCACCTGGGC CTGCCCGACC GCGACGATGT CAAGCAGGGC
ATCATTGCCT ACAAGATCGC GGCCCACGCG GCCGACGTCG CCAAGGGGCA TCCGGGTGCC
CGTGCGCGCG ACGACGCGCT GAGCCAGGCG CGGTTCGACT TCCGCTGGCA GGACCAGTTC
AACCTGGGCC TGGACCCCGA TACGGCCAAG GAATACCACG ACGAGACCCT GCCCAAGGAC
AGCGCCAAGG TGGCGCACTT CTGCTCCATG TGCGGGCCGA AGTTCTGCTC GATGAAGATC
ACGCAGGAAG TGCGCGAATT CGCCCAACAG GGCCTGCAGT CCAAGGCCGA GGAGTTCAAC
CGCACGGGCG GCGAGCTCTA CGTGCCCATC CACCGCGCCG ACTGA
 
Protein sequence
MNAPDKFASL LALTREPFPA STKSYLAGSQ PGLRVPVRDI QLTNGEVVSV YDTSGPYTDP 
AVQIDVRKGL ASVRGEWIAA RGDTEGYEGR VRKALDDGQK AEDGDRLAQL RAEAAALQRQ
PLRARSGANV TQMHYAKKGI VTPEMEYVAL RENGRREWMQ QYMQDAAREQ RLAGNPLGAS
IPKIITPEFV RDEVARGRAI IPANINHPEV EPMAIGRNFK VKINANIGNS AVTSSIEEEV
EKLVWAIRWG ADNVMDLSTG KNIHTTRDWI VRNSPVPIGT VPIYQALEKV GGIAEDLTWE
IFRDTLIEQA EQGVDYFTIH AGVRLAYIQL TAARRTGIVS RGGSIMAKWC MAHHKESFLY
THFEDICDIM KAYDVAFSLG DGLRPGCASD ANDEAQFAEL HTLGELTQIA WKHDVQTMIE
GPGHVPMHMI QANMTEQLKT CHEAPFYTLG PLTIDIAPGY DHIASAIGAA MIGWMGTAML
CYVTPKEHLG LPDRDDVKQG IIAYKIAAHA ADVAKGHPGA RARDDALSQA RFDFRWQDQF
NLGLDPDTAK EYHDETLPKD SAKVAHFCSM CGPKFCSMKI TQEVREFAQQ GLQSKAEEFN
RTGGELYVPI HRAD