Gene Avin_45840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_45840 
SymbolthiI 
ID7763451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4655448 
End bp4656902 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content65% 
IMG OID643807429 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002801670 
Protein GI226946597 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCA TCGTCAAGGT CTTCCCGGAA ATCACCATCA AGAGCCCGCC GGTGCGCAAG 
CGCTTCATTC GCCAACTGGC GAAGAACATC CGCACCGTGC TGCGCGACCT GGATCCGGAG
CTGGCGGTGA CGGGCGTGTG GGACAACCTC GAGGTGGAGA CCGCGGTCGA GAAGCCGCGC
CTGTTGCGCG AGATGATCGA GCGGCTGTGC TGCACGCCCG GTATCGCCCA CTTCCTCGAG
GTGCACGAGT ATCCCCTGGG CGACCTCGAC GACATCCTGG AGAAGTGCAA GCGCCACTAC
GCCGGGCAAC TGCCCGGCAA GATCTTCGCC GTGCGCTGCA AGCGCGCCGG CAGGCATCCC
TTCACCTCGA TGGAGGTGGA GCGCTACGTC GGCGGCTGCC TGCGCCAGCA GTGCGGTGCC
GCGGGCGTCT CGCTGAGCGC TCCGGAAGTG GAAGTGCGCA TGGAGATCCG CGACCAGCGC
CTGTTCGTCG TCCACCGTCA GCACGACAGC ATCGGCGGTT ACCCGCTGGG CGCCCTGGAG
CAGACCCTGG TGCTGATGTC CGGCGGCTTC GACTCCACCG TGGCGGCTTT CCAGACGATG
CGCCGCGGGC TGATGACCCA TTTCTGCTTC TTCAACCTGG GCGGGCGCGC CCACGAGCTG
GGGGTGATGG AAGTCGCCCA CTACCTCTGG CAGAAGTACG GCAGCTCGCA GCGCGTGCTG
TTCGTCAGCG TGCCCTTCGA GGAGGTGGTC GGCGAGATCC TCGGCAAGGT CGACAACAGC
CAGATGGGCG TGATCCTGAA GCGCATGATG CTGCGCGCCG CCACGCGCGT CGCCGAGCGC
CTGAAGATCG ACGCGCTGGT CACCGGCGAA GCCATCTCCC AGGTGTCCAG CCAGACCCTG
CCGAACCTCT CGGTGATCGA TTCGGCCACC GACATGCTGG TGCTGCGCCC ACTGATCGCC
AGCCACAAGC AGGACATCAT CGACACCGCG ACGCGCATCG GCACCGCCGA GTTCGCCCGG
CACATGCCCG AATACTGCGG GGTGATCTCG GTGAATCCGA CCACCCGCGC CAGGCGTGAC
CGCATCGATT ACGAGGAGCG CCAGTTCGAC ATGGCGATCC TCGAGCGCGC GCTGGAGCGC
GCCCACCTGG TGCCGATCGA CCGGGTGATC GACGAACTGG GCGAGGAGAT TTGCGTGGAG
GAGGTCCGCG AGGCTCTGGC CGGACAGATC GTCCTCGACA TCCGCCATCC GGAGGCGGTC
GAGGACGGGC CGCTGGAGTT GCCCGGCATC GAGGTGCGGG CGCTGCCTTT CTATGCGCTG
AACAACCGCT TCAAGGAACT GGACAGCAAC CGCCAATACC TTCTTTATTG CGACAAGGGT
GTCATGAGCC GCCTGCATGC TCATCACCTG CTGAGCGAGG GACATGCCAA TGTGCGCGTT
TATCGTCCGG CATAG
 
Protein sequence
MKLIVKVFPE ITIKSPPVRK RFIRQLAKNI RTVLRDLDPE LAVTGVWDNL EVETAVEKPR 
LLREMIERLC CTPGIAHFLE VHEYPLGDLD DILEKCKRHY AGQLPGKIFA VRCKRAGRHP
FTSMEVERYV GGCLRQQCGA AGVSLSAPEV EVRMEIRDQR LFVVHRQHDS IGGYPLGALE
QTLVLMSGGF DSTVAAFQTM RRGLMTHFCF FNLGGRAHEL GVMEVAHYLW QKYGSSQRVL
FVSVPFEEVV GEILGKVDNS QMGVILKRMM LRAATRVAER LKIDALVTGE AISQVSSQTL
PNLSVIDSAT DMLVLRPLIA SHKQDIIDTA TRIGTAEFAR HMPEYCGVIS VNPTTRARRD
RIDYEERQFD MAILERALER AHLVPIDRVI DELGEEICVE EVREALAGQI VLDIRHPEAV
EDGPLELPGI EVRALPFYAL NNRFKELDSN RQYLLYCDKG VMSRLHAHHL LSEGHANVRV
YRPA