Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2487 |
Symbol | |
ID | 4445076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2781177 |
End bp | 2783057 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639690302 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_831966 |
Protein GI | 116671033 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.320621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATACAC AAGAAACACA GCTGATCCCT GCCCAAAACG AGGCCGCCCC CGGCAATTCG GCACCTGCCG AAGCGCAGTC CCTGAAGTCG CACTCGCTGG CATTCATCAC CGATGAAGCC ACCGGGATCC GGGTGCCGGT GACCGAAATC GCCCTTGAGG ACTCACCGGG CGGGGCAGCC AACCCGCCGT TCCGCGTGTA CCGGACTGCC GGGCCCGGCA GCGATCCCGT GGTGGGCCTG GAACCGTTCA GGACCCGGTG GATCGAGTCG CGGGCCGACA CCGAGCCTTA TGGCGGCCGG GAACGGAACC TGCTCGACGA CGGCCGGTCG GCCGTGCGCC GCGGCGCCGC ATCCGCGGAG TGGAAGGGCG CGCAGCCCGT GCCCCGCCGC GCCGTCGAAG GCAGGACTGT CACGCAGATG CACTACGCCC GGCAGGGTGT CGTGACGCCG GAGATGCAGT TCGTTGCCCT CCGCGAAAAC TGCGACGTGG AGCTGGTCCG CAGCGAAGTG GCTGCCGGCC GCGCCATCAT CCCCAACAAC ATCAACCACC CGGAGTCCGA ACCGATGATC ATCGGCAAGG CCTTCCTGGT GAAGATCAAC GCCAACATCG GCAACTCGGC CGTCACGAGC TCCATCGCGG AGGAGGTCGA CAAGCTGCAG TGGGCCACAC TGTGGGGCGC CGACACCGTG ATGGACCTGT CCACCGGCGA TGACATCCAC ACCACCCGTG AGTGGATCAT CCGCAACTCC CCCGTGCCGA TCGGCACCGT TCCCATCTAC CAGGCACTGG AAAAGGTCAA CGGCGAGGCC AACAAACTGA CGTGGGAAAT TTTCCGCGAC ACCGTGATCG AGCAGTGCGA GCAGGGCGTG GACTACATGA CCATCCACGC CGGCGTGCTG CTGCGGTATG TGCCCCTGAC CGCCAACCGG GTGACCGGCA TCGTCTCCCG CGGCGGCTCC ATCATGGCCG GCTGGTGCCT TGCCCACCAC CAGGAGAACT TCCTGTACAC GCACTTCGAC GAGCTGTGCG AAATTTTCGC CAAGTACGAC GTCGCGTTTT CGCTGGGCGA CGGGCTGCGG CCCGGTGCGA CGGCGGACGC GAACGACGCC GCCCAGTTCG CCGAGCTGGA TACCCTGGCC GAACTGACGC AGCGCGCCTG GGAGTTCGAC GTGCAGGTGA TGGTGGAAGG ACCCGGCCAC GTGCCGTTCC ACCTGGTCCG TGAAAACGTG GAACGCCAGC AGGAACTCTG CAAGGGAGCA CCGTTCTACA CGCTGGGGCC GCTGGTCACG GACATAGCCC CGGGCTACGA CCACATCACC TCCGCCATCG GCGCCACGGA AATCGCCCGC TACGGCACGG CCATGCTCTG CTACGTCACG CCCAAGGAAC ACCTGGGGCT GCCAAACAAG GACGATGTCA AGACAGGCGT CATCACCTAC AAGATCGCCG CCCACGCCGC CGACCTCGCC AAGGGCCACC CCGGCGCGCA CCAACGCGAC GACGCCCTGT CCAAGGCCCG GTTCGAATTC CGCTGGCGGG ACCAGTTCGC CCTCTCGCTG GACCCGGTCA CCGCCGAATC CTTCCATGAC GAGACGCTGC CCGCGGAGCC AGCCAAGACC GCGCACTTCT GCTCCATGTG CGGGCCCAAG TTCTGCTCAA TGCGCATCAG CCAGGACATC AGGGACGAGT ACGGTTCCGC CGAGGCACAG TCGGCACTCG CCGAGATGGC GGCAGGCATG CGTGAAAAGA GCAACGAATT CCTCGCAGCC GGCGGCAAGG TCTACCTACC CGAGCTGCAG CTTCCAGACC CGGAACGACC GGGCCGGCAC GGTGCAGCGA CGGGCGACGC TACGACGCCC GTGAGTGCTG ACGCCTGCTG A
|
Protein sequence | MNTQETQLIP AQNEAAPGNS APAEAQSLKS HSLAFITDEA TGIRVPVTEI ALEDSPGGAA NPPFRVYRTA GPGSDPVVGL EPFRTRWIES RADTEPYGGR ERNLLDDGRS AVRRGAASAE WKGAQPVPRR AVEGRTVTQM HYARQGVVTP EMQFVALREN CDVELVRSEV AAGRAIIPNN INHPESEPMI IGKAFLVKIN ANIGNSAVTS SIAEEVDKLQ WATLWGADTV MDLSTGDDIH TTREWIIRNS PVPIGTVPIY QALEKVNGEA NKLTWEIFRD TVIEQCEQGV DYMTIHAGVL LRYVPLTANR VTGIVSRGGS IMAGWCLAHH QENFLYTHFD ELCEIFAKYD VAFSLGDGLR PGATADANDA AQFAELDTLA ELTQRAWEFD VQVMVEGPGH VPFHLVRENV ERQQELCKGA PFYTLGPLVT DIAPGYDHIT SAIGATEIAR YGTAMLCYVT PKEHLGLPNK DDVKTGVITY KIAAHAADLA KGHPGAHQRD DALSKARFEF RWRDQFALSL DPVTAESFHD ETLPAEPAKT AHFCSMCGPK FCSMRISQDI RDEYGSAEAQ SALAEMAAGM REKSNEFLAA GGKVYLPELQ LPDPERPGRH GAATGDATTP VSADAC
|
| |