Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_45840 |
Symbol | thiI |
ID | 7763451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4655448 |
End bp | 4656902 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643807429 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_002801670 |
Protein GI | 226946597 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCA TCGTCAAGGT CTTCCCGGAA ATCACCATCA AGAGCCCGCC GGTGCGCAAG CGCTTCATTC GCCAACTGGC GAAGAACATC CGCACCGTGC TGCGCGACCT GGATCCGGAG CTGGCGGTGA CGGGCGTGTG GGACAACCTC GAGGTGGAGA CCGCGGTCGA GAAGCCGCGC CTGTTGCGCG AGATGATCGA GCGGCTGTGC TGCACGCCCG GTATCGCCCA CTTCCTCGAG GTGCACGAGT ATCCCCTGGG CGACCTCGAC GACATCCTGG AGAAGTGCAA GCGCCACTAC GCCGGGCAAC TGCCCGGCAA GATCTTCGCC GTGCGCTGCA AGCGCGCCGG CAGGCATCCC TTCACCTCGA TGGAGGTGGA GCGCTACGTC GGCGGCTGCC TGCGCCAGCA GTGCGGTGCC GCGGGCGTCT CGCTGAGCGC TCCGGAAGTG GAAGTGCGCA TGGAGATCCG CGACCAGCGC CTGTTCGTCG TCCACCGTCA GCACGACAGC ATCGGCGGTT ACCCGCTGGG CGCCCTGGAG CAGACCCTGG TGCTGATGTC CGGCGGCTTC GACTCCACCG TGGCGGCTTT CCAGACGATG CGCCGCGGGC TGATGACCCA TTTCTGCTTC TTCAACCTGG GCGGGCGCGC CCACGAGCTG GGGGTGATGG AAGTCGCCCA CTACCTCTGG CAGAAGTACG GCAGCTCGCA GCGCGTGCTG TTCGTCAGCG TGCCCTTCGA GGAGGTGGTC GGCGAGATCC TCGGCAAGGT CGACAACAGC CAGATGGGCG TGATCCTGAA GCGCATGATG CTGCGCGCCG CCACGCGCGT CGCCGAGCGC CTGAAGATCG ACGCGCTGGT CACCGGCGAA GCCATCTCCC AGGTGTCCAG CCAGACCCTG CCGAACCTCT CGGTGATCGA TTCGGCCACC GACATGCTGG TGCTGCGCCC ACTGATCGCC AGCCACAAGC AGGACATCAT CGACACCGCG ACGCGCATCG GCACCGCCGA GTTCGCCCGG CACATGCCCG AATACTGCGG GGTGATCTCG GTGAATCCGA CCACCCGCGC CAGGCGTGAC CGCATCGATT ACGAGGAGCG CCAGTTCGAC ATGGCGATCC TCGAGCGCGC GCTGGAGCGC GCCCACCTGG TGCCGATCGA CCGGGTGATC GACGAACTGG GCGAGGAGAT TTGCGTGGAG GAGGTCCGCG AGGCTCTGGC CGGACAGATC GTCCTCGACA TCCGCCATCC GGAGGCGGTC GAGGACGGGC CGCTGGAGTT GCCCGGCATC GAGGTGCGGG CGCTGCCTTT CTATGCGCTG AACAACCGCT TCAAGGAACT GGACAGCAAC CGCCAATACC TTCTTTATTG CGACAAGGGT GTCATGAGCC GCCTGCATGC TCATCACCTG CTGAGCGAGG GACATGCCAA TGTGCGCGTT TATCGTCCGG CATAG
|
Protein sequence | MKLIVKVFPE ITIKSPPVRK RFIRQLAKNI RTVLRDLDPE LAVTGVWDNL EVETAVEKPR LLREMIERLC CTPGIAHFLE VHEYPLGDLD DILEKCKRHY AGQLPGKIFA VRCKRAGRHP FTSMEVERYV GGCLRQQCGA AGVSLSAPEV EVRMEIRDQR LFVVHRQHDS IGGYPLGALE QTLVLMSGGF DSTVAAFQTM RRGLMTHFCF FNLGGRAHEL GVMEVAHYLW QKYGSSQRVL FVSVPFEEVV GEILGKVDNS QMGVILKRMM LRAATRVAER LKIDALVTGE AISQVSSQTL PNLSVIDSAT DMLVLRPLIA SHKQDIIDTA TRIGTAEFAR HMPEYCGVIS VNPTTRARRD RIDYEERQFD MAILERALER AHLVPIDRVI DELGEEICVE EVREALAGQI VLDIRHPEAV EDGPLELPGI EVRALPFYAL NNRFKELDSN RQYLLYCDKG VMSRLHAHHL LSEGHANVRV YRPA
|
| |