Gene Ajs_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_3661 
Symbol 
ID4674184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp3871696 
End bp3873540 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content66% 
IMG OID639840693 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_987848 
Protein GI121595952 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCC CCGACAAGTT CGCCAGCCTG CTTGCGCTCA CGCGCGAACC CTTTCCCGCT 
TCCACCAAGT CCTACCTCGC CGGCAGCCAG CCCGGGCTGC GCGTGCCGGT GCGCGACATT
CAGCTCACCA ACGGCGAAGT GGTGAGCGTG TACGACACGT CCGGCCCCTA TACCGATCCT
GCCGTGCAGA TCGACGTGCG CAAGGGCCTT GCGAGCGTGC GGGGCGAATG GATTGCCGCG
CGCGGTGACA CCGAGGGCTA TGAGGGTCGC GTGCGCAAGG CGCTGGACGA CGGCCAGAAG
GCCGAGGATG GCGACCGCCT GGCCCAGCTG CGCGCCGAGG CTGCGGCGCT GCAGCGCCAG
CCGCTGCGCG CCAGGAGCGG CGCCAACGTC ACGCAGATGC ACTACGCGAA GAAGGGCATC
GTCACTCCCG AGATGGAATA CGTGGCCTTG CGCGAGAACG GTCGCCGTGA GTGGATGCAG
CAATACATGC AGGACACCGC GCGCGAGCAG CGCCTGGCCG GCAACCCGCT GGGTGCGAGC
ATCCCGAAAA TCATCACGCC CGAGTTCGTG CGCGACGAGG TCGCCCGTGG CCGCGCCATC
ATTCCCGCCA ACATCAACCA CCCCGAAGTG GAGCCCATGG CCATCGGGCG CAACTTCAAG
GTGAAGATCA ACGCCAACAT CGGCAACTCC GCCGTCACGT CGAGCATCGA GGAAGAGGTG
GAGAAGCTCG TCTGGGCCAT CCGCTGGGGC GCCGACAACG TGATGGACCT GTCCACCGGC
AAGAACATCC ACACCACGCG CGACTGGATC GTGCGCAACT CGCCCGTGCC CATCGGCACG
GTGCCTATCT ACCAGGCGCT GGAAAAGGTC GGCGGCATTG CCGAGGACCT GACCTGGGAG
ATCTTCCGCG ACACGCTGAT CGAGCAGGCC GAGCAGGGCG TGGACTATTT CACCATCCAC
GCAGGCGTGC GCCTGGCCTA CATCCAGCTC ACCGCCGCGC GCCGCACGGG CATCGTGTCC
CGTGGCGGCT CCATCATGGC CAAGTGGTGC ATGGCGCACC ACAAGGAGAG CTTCCTCTAC
ACGCACTTCG AGGACATCTG CGACATCATG AAGGCGTACG ACGTGTCGTT CAGCCTGGGT
GATGGCCTGC GTCCGGGCTG CGCCTCGGAC GCCAACGACG AAGCCCAGTT TGCCGAGCTG
CACACGCTGG GCGAGCTGAC GCAGATTGCC TGGAAGCACG ACGTGCAGAC CATGATCGAA
GGCCCCGGCC ACGTGCCCAT GCACATGATC CAGGCCAACA TGACGGAGCA GCTCAAGACC
TGCCACGAGG CGCCGTTCTA CACCCTGGGC CCGCTGACCA TCGACATCGC CCCCGGCTAC
GACCACATCG CCAGCGCCAT CGGTGCCGCC ATGATCGGCT GGATGGGCAC GGCCATGCTG
TGCTACGTGA CGCCCAAGGA GCACCTGGGC CTGCCCGACC GCGACGATGT CAAGCAGGGC
ATCATTGCCT ACAAGATTGC CGCGCACGCG GCCGACGTGG CCAAGGGCCA CCCGGGCGCC
CGCGCGCGTG ACGATGCGCT GAGCCAGGCG CGGTTCGACT TCCGCTGGCA GGACCAGTTC
AACCTGGGCC TGGACCCCGA TACGGCCAAG GAGTACCACG ACGAGACCCT GCCCAAGGAC
AGCGCCAAGG TGGCGCACTT CTGCTCCATG TGCGGGCCGA AGTTCTGCTC GATGAAGATC
ACGCAGGAAG TGCGCGAATT CGCCCAACAG GGCCTGCAGT CCAAGGCCGA GGAGTTCAAC
CGCACGGGCG GCGAGCTCTA CGTGCCCATC CACCGCGCCG ACTGA
 
Protein sequence
MNAPDKFASL LALTREPFPA STKSYLAGSQ PGLRVPVRDI QLTNGEVVSV YDTSGPYTDP 
AVQIDVRKGL ASVRGEWIAA RGDTEGYEGR VRKALDDGQK AEDGDRLAQL RAEAAALQRQ
PLRARSGANV TQMHYAKKGI VTPEMEYVAL RENGRREWMQ QYMQDTAREQ RLAGNPLGAS
IPKIITPEFV RDEVARGRAI IPANINHPEV EPMAIGRNFK VKINANIGNS AVTSSIEEEV
EKLVWAIRWG ADNVMDLSTG KNIHTTRDWI VRNSPVPIGT VPIYQALEKV GGIAEDLTWE
IFRDTLIEQA EQGVDYFTIH AGVRLAYIQL TAARRTGIVS RGGSIMAKWC MAHHKESFLY
THFEDICDIM KAYDVSFSLG DGLRPGCASD ANDEAQFAEL HTLGELTQIA WKHDVQTMIE
GPGHVPMHMI QANMTEQLKT CHEAPFYTLG PLTIDIAPGY DHIASAIGAA MIGWMGTAML
CYVTPKEHLG LPDRDDVKQG IIAYKIAAHA ADVAKGHPGA RARDDALSQA RFDFRWQDQF
NLGLDPDTAK EYHDETLPKD SAKVAHFCSM CGPKFCSMKI TQEVREFAQQ GLQSKAEEFN
RTGGELYVPI HRAD