Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2227 |
Symbol | |
ID | 7293695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 2496367 |
End bp | 2498184 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643590629 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002488281 |
Protein GI | 220912972 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000000000308788 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAATACAC CCAATGCAGT GCTGCCCCCT GCCCAAAACC ACCCTGCCGG AGCCACGCCT GAAGCACCGG TCACCCAGTC CCTGAAGTCC CACTCGCTGG CCTACATCGA TGATCCGCAG CACGGCATCC GTGTCCCGGT AACGGAGATC GCGCTGGAGC CCTCGCCCAA CGGCCAGCCG AACGCCCCTT TCCGCACGTA CCGGACAGCG GGCCCGGGAA GCGACCCCGT CCGGGGCCTC AGCCCTTTCC GGTCGGGATG GATTGAAGGG CGGGATGACA CGGAAGAGTA CAGCGGAAGG GCACGGAACC TGCTCGACGA CGGCCGCTCG GCTGTGCGCC GCGGCGCCGC CTCTGCGGAA TGGAAAGGCG GGCGCCCGGT GCCCCGCCGC GCCGTCGACG GCCGGACAGT CACCCAGATG CACTACGCCC GGAAGGGCGT GGTGACGCCG GAAATGCGGT TCGTGGCGCT GCGGGAAAAC TGTGATCCGG AACTGGTCCG GAGTGAGGTT GCGGCTGGAC GGGCCATTAT CCCCGCCAAT ATCAACCATC CGGAGTCCGA ACCGATGATC ATCGGCAAGG CTTTCCTGGT GAAGATCAAC GCCAACATCG GCAACTCCGC CGTCACCAGT TCCATCAGGG AGGAGGTGGA CAAGCTGCAG TGGGCCACCC GGTGGGGTGC CGACACGGTG ATGGACCTTT CCACGGGCGA CGACATCCAC ACCACCAGGG AATGGCTCAT CCGCAATTCC CCCGTGCCGA TCGGCACCGT GCCCATCTAC CAGGCCCTGG AAAAGGTCAA CGGCGAGGCG AACGCACTGA CGTGGGAAAT CTTCCGCGAC ACCGTCATAG AACAATGTGA ACAGGGCGTG GACTATATGA CGGTCCACGC CGGCGTCCTG CTCCGCTACG TGCCGCTGAC CGCCAACAGG GTCACCGGCA TCGTGTCCCG GGGCGGGTCC ATCATGGCGG GATGGTGCCT GGCGCACCAC CAGGAGAACT TCCTGTACAC GCATTTCGAT GAGCTGTGTG AGATCTTCGC CAAGTACGAT GTCGCGTTCT CGCTGGGTGA CGGCCTGCGC CCCGGTGCCA CCGCCGATGC CAACGACGCT GCGCAGTTCG CGGAACTCGA CACGCTGGCT GAGCTTACGG ACCGCGCCTG GAAACATGAC GTGCAGGTCA TGGTGGAAGG GCCGGGCCAC GTCCCGTTTC ATCTGGTGCG GGAGAACGTG GAGCGCCAGC AGCAACTGTG CAAGGGCGCG CCGTTCTATA CGCTGGGGCC GCTGGTCACC GATGTGGCCC CAGGCTACGA CCACATCACC TCAGCCATCG GCGCCACCGA GATCGCGCGC TACGGCACCG CCATGCTCTG CTACGTCACA CCCAAGGAGC ATCTGGGACT GCCCGACAGG GACGATGTGA AGACCGGAGT CATCACCTAC AAAATCGCCG CGCACGCCGC CGACCTGGCC AAGGGCCACC CCGGGGCGCA CGAACGGGAC GACGCCCTGT CCAAGGCCAG GTTCGAGTTC CGCTGGCGGG ACCAGTTTGC CCTTTCACTT GACCCCGAGA CGGCCGAAGC CTTCCATGAT GAAACCCTTC CGGCCGAGCC CGCCAAGACG GCACACTTCT GCTCCATGTG CGGGCCGAAG TTCTGCTCGA TGCGGATCAG CCAGGACATC CGCAATGAGT ACGGATCCGC GGAAGCCCAG GCCGCGATAG CCGAAGCGGC ATCCGGGATG CGGGAGAAGA GCCAGGAGTT CCTGGAATCC GGCGGCAAGG TGTACCTTCC CGAGCTGAAA GTCCCGGCCG GCAGCTAA
|
Protein sequence | MNTPNAVLPP AQNHPAGATP EAPVTQSLKS HSLAYIDDPQ HGIRVPVTEI ALEPSPNGQP NAPFRTYRTA GPGSDPVRGL SPFRSGWIEG RDDTEEYSGR ARNLLDDGRS AVRRGAASAE WKGGRPVPRR AVDGRTVTQM HYARKGVVTP EMRFVALREN CDPELVRSEV AAGRAIIPAN INHPESEPMI IGKAFLVKIN ANIGNSAVTS SIREEVDKLQ WATRWGADTV MDLSTGDDIH TTREWLIRNS PVPIGTVPIY QALEKVNGEA NALTWEIFRD TVIEQCEQGV DYMTVHAGVL LRYVPLTANR VTGIVSRGGS IMAGWCLAHH QENFLYTHFD ELCEIFAKYD VAFSLGDGLR PGATADANDA AQFAELDTLA ELTDRAWKHD VQVMVEGPGH VPFHLVRENV ERQQQLCKGA PFYTLGPLVT DVAPGYDHIT SAIGATEIAR YGTAMLCYVT PKEHLGLPDR DDVKTGVITY KIAAHAADLA KGHPGAHERD DALSKARFEF RWRDQFALSL DPETAEAFHD ETLPAEPAKT AHFCSMCGPK FCSMRISQDI RNEYGSAEAQ AAIAEAASGM REKSQEFLES GGKVYLPELK VPAGS
|
| |