Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_44600 |
Symbol | thiC |
ID | 7763331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4510561 |
End bp | 4512447 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643807312 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002801553 |
Protein GI | 226946480 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.168262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTGA CAGAACAACA GAAGAACCTG AGCGAGTCCG CCCAGGTCGA CCAGCAGTCC GTACAGCCCT TCCCGCGTTC GCGGAAGATC TACGTCACCG GCTCGCGCCC GGACATCCGC GTGCCGATGC GCGAGATCGG CCTGGACGTG ACACCCACCG CGTTCGGCGG CGAGATCAAC CCGCCGGTCA CCGTCTACGA CACCTCCGGC CCCTACACCG ACCCGAACGT CGTCATCGAC GTGCGCAAGG GGCTGGCCGA CGTGCGCAGC GCCTGGATCG AGGATCGCGG CGACACCGAG AAGCTGCCGG GCCTGACCTC GGAGTTCGGC CAGCGCCGCC TCTCCGACCC GGAACTGGCC GCCATGCGCT TCGCCCACGT ACGCAACCCG CGCCGCGCCA AGGCCGGGCA CAACGTCACG CAGATGCACT ATGCCAGGAA AGGCATCGTC ACCCCGGAGA TGGAATACGT CGCCATCCGC GAGAACATGA AGCTCGCCGA GGCGCGCGAG GCCGGCCTGC TCGCCGCGCA GCATCCGGGG CAGAGCTTCG GCGCCAGCAT TCCGAAGGAA ATCACCCCCG AGTTCGTCCG AGCCGAAGTC GCCCGCGGCC GCGCCATCAT CCCGGCCAAC ATCAACCACG TGGAGCTGGA GCCGATGATC ATCGGCCGCA ACTTCCTGGT GAAGATCAAC GGCAATATCG GCAACTCGGC GCTGGGATCG AGCATCGAAG AGGAAGTGGC CAAGCTGACC TGGGGCATCC GCTGGGGCTC GGACACCGTG ATGGACCTGT CCACCGGCAA GCACATCCAC GAGACCCGCG AGTGGATCAT CCGCAACTCG CCGGTGCCGA TCGGCACGGT GCCGATCTAC CAGGCGCTGG AAAAGGTCGG CGGCATCGCC GAGGACCTGA CCTGGGAGCT GTTCCGCGAC ACCCTGATCG AGCAGGCCGA GCAGGGCGTG GACTACTTCA CCATTCATGC CGGCGTGCTG CTGCGCTACG TGCCGCTGAC CGCGAAAAGG GTCACCGGCA TCGTCTCCCG CGGCGGTTCG ATCATGGCCA AATGGTGTCT GGCGCATCAC CAGGAAAACT TCCTCTACAC CCACTTCGAA GACATCTGCG ACATCATGAA GGCCTACGAT GTCAGCTTCT CCCTGGGCGA CGGCCTGCGC CCCGGCTCGG TGGCCGATGC CAACGACGCC GCCCAGTTCG GCGAACTGGA AACCCTCGGC GAACTGACCA GGGTCGCCTG GAAGCACGAG GTGCAGACCA TCATCGAAGG CCCCGGCCAC GTGCCGATGC ACATGATCAA GGAGAACATG GACAAGCAGC TCGAATGCTG CGACGAGGCG CCGTTCTACA CCCTCGGCCC GTTGGTCACC GACATCGCCC CCGGTTACGA CCACATCACC TCCGGCATCG GCGCGGCGAT GATCGGCTGG TTCGGCTGCG CCATGCTCTG CTACGTGACG CCCAAGGAGC ACCTCGGCCT GCCGAACAAG GACGATGTGA AGACCGGCAT CATCACCTAC AAGATCGCCG CGCACGCCGC CGATCTGGCC AAGGGCCATC CGGGCGCGCA GATCCGCGAC AACGCGCTGA GCAAGGCGCG CTTCGAGTTC CGCTGGGAGG ACCAGTTCAA CCTCGGCCTC GATCCGGACA CCGCGCGTGC CTTCCACGAC GAGACCCTGC CCAAGGACTC GGCCAAGGTG GCGCACTTCT GCTCCATGTG CGGGCCGAAG TTCTGTTCGA TGAAGATCAC CCAGGAAGTG CGCGACTACG CCGCCGAGCA CGGCCTGTCC GAGGAAAGCC AGGCCGTCGA GGCCGGTTTC CGGGAGCAGG CCGAGCGCTT CCGCGAAGAG GGCTCGGTGA TCTACAAGCA GGTCTGA
|
Protein sequence | MNVTEQQKNL SESAQVDQQS VQPFPRSRKI YVTGSRPDIR VPMREIGLDV TPTAFGGEIN PPVTVYDTSG PYTDPNVVID VRKGLADVRS AWIEDRGDTE KLPGLTSEFG QRRLSDPELA AMRFAHVRNP RRAKAGHNVT QMHYARKGIV TPEMEYVAIR ENMKLAEARE AGLLAAQHPG QSFGASIPKE ITPEFVRAEV ARGRAIIPAN INHVELEPMI IGRNFLVKIN GNIGNSALGS SIEEEVAKLT WGIRWGSDTV MDLSTGKHIH ETREWIIRNS PVPIGTVPIY QALEKVGGIA EDLTWELFRD TLIEQAEQGV DYFTIHAGVL LRYVPLTAKR VTGIVSRGGS IMAKWCLAHH QENFLYTHFE DICDIMKAYD VSFSLGDGLR PGSVADANDA AQFGELETLG ELTRVAWKHE VQTIIEGPGH VPMHMIKENM DKQLECCDEA PFYTLGPLVT DIAPGYDHIT SGIGAAMIGW FGCAMLCYVT PKEHLGLPNK DDVKTGIITY KIAAHAADLA KGHPGAQIRD NALSKARFEF RWEDQFNLGL DPDTARAFHD ETLPKDSAKV AHFCSMCGPK FCSMKITQEV RDYAAEHGLS EESQAVEAGF REQAERFREE GSVIYKQV
|
| |