Gene PA14_67580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_67580 
SymbolthiI 
ID4383430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp6036946 
End bp6038400 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content65% 
IMG OID639328052 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_793588 
Protein GI116053265 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA TCGTCAAGAC CTTCCAGGAA ATCACCATCA AGAGCCGGCC GGTGCGCAAG 
CGCTTCATCC GGCAGTTGGC GAAGAACATC CGCGCGGTGC TGCGCGACCT GGACCCGGAG
CTGAAGGTCG AGGGCGAGTG GGACAACCTT GAGGTAGAGA CCGCCGTGGT CGACGCCAAG
GTCAGGCGCG AAATGATCGA GCGCCTGACC TGCACCCCGG GCATCGGTCA TTTCCTCGAG
GTCCACGAGT ATCCGCTGGG CGATTTCGAC GACATCCTCG CCAAGTGCAA GGCGCATTTC
GGCGACCAGT TGGCCGGCAA GACCTTCGCC GTGCGCTGCA AGCGCGCCGG CAAGCACGCG
TTCACCTCGA TGGAGGTGGA GCGTTACGTC GGCAGCGGCC TGCGCCGCGA ATGCGGGGCT
GCCGGGATCG ACCTGAAGCA GCCGGAAGTC GAGGTGCGGA TGGAAATCCG CCTCGACCGC
CTGTTCGTCA TCCATCGCCA GCATCCGGGC CTGGGCGGCT ATCCGCTGGG CGCGCTGGAA
CAGGTGCTGG TGCTGATGTC CGGCGGCTTC GACTCGACCG TGGCGGCCTA CCAGATGATG
CGCCGCGGGA TGATCAGCCA CTTCGTGTTC TTCAACCTCG GCGGGCGCGC CCACGAACTG
GGGGTGATGG AAGTCGCCCA CTACCTGTGG GAGAAGTACG GCCGCTCGCA GCGCGTGCTG
TTCATCAGCG TGCCGTTCGA GGAAGTGGTC GGCGAGATCC TCACCAAGGT CGACGACAGC
TATATGGGCG TGACCCTCAA GCGCATGATG CTGCGCGCCG CCAGCCGCGT GGCCGAGCGC
CTGGAGCTGG ACGCGCTGGT GACCGGCGAG GCGATCTCCC AGGTGTCCAG CCAGACCCTG
CCGAACCTCT CGGTGATCGA CCGGGTTACC GACACCCTGG TGCTGCGCCC GCTGATCGTC
AGCCACAAGC AGGACATCAT CGACACCGCC CGGCAGATCG GCACCGCCGA GTTCGCCCGG
CACATGCCGG AATACTGCGG GGTGATCTCG GTGAACCCGA CCACCCAGGC CAAGCCCTAC
CGCGTCGAGC ACGAAGAGTC GAAATTCGAC ATGGCGGTGC TCGAGCGCGC CCTGGAGCGC
GCCACCCAGC GCACCGTCGA CCGGGTGATC GACGAACTCG GCCAGGACCT GCAGGTGGAA
GAGGTCGGCG AGGTGCTGCC CGGTCAGATT GTCATCGATA TCCGCCATCC TGATGCCCAG
GAAGACGAAC CCCTGGCCCT GGAAGGCGTC GAAGTCCAGG CGCTGCCGTT CTACGCGATC
AACAGCCGCT TCAAGGAACT GGACGCCAAC CGCCAGTACC TCCTGTATTG CGACAAAGGG
GTGATGAGCC GCCTGCATGC CCATCATCTG CTCAACGAGG GGCACACCAA TGTGCGTGTT
TATCGTCCGG CTTAA
 
Protein sequence
MKLIVKTFQE ITIKSRPVRK RFIRQLAKNI RAVLRDLDPE LKVEGEWDNL EVETAVVDAK 
VRREMIERLT CTPGIGHFLE VHEYPLGDFD DILAKCKAHF GDQLAGKTFA VRCKRAGKHA
FTSMEVERYV GSGLRRECGA AGIDLKQPEV EVRMEIRLDR LFVIHRQHPG LGGYPLGALE
QVLVLMSGGF DSTVAAYQMM RRGMISHFVF FNLGGRAHEL GVMEVAHYLW EKYGRSQRVL
FISVPFEEVV GEILTKVDDS YMGVTLKRMM LRAASRVAER LELDALVTGE AISQVSSQTL
PNLSVIDRVT DTLVLRPLIV SHKQDIIDTA RQIGTAEFAR HMPEYCGVIS VNPTTQAKPY
RVEHEESKFD MAVLERALER ATQRTVDRVI DELGQDLQVE EVGEVLPGQI VIDIRHPDAQ
EDEPLALEGV EVQALPFYAI NSRFKELDAN RQYLLYCDKG VMSRLHAHHL LNEGHTNVRV
YRPA