Gene LGAS_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_1223 
Symbol 
ID4439581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp1225549 
End bp1226766 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content36% 
IMG OID639673060 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_815030 
Protein GI116629858 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000164675 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATA CAGAAGTAAT GGTTCGCTAT GGCGAGCTAT CCACTAAAGG AAAAAATCGT 
AAAGATTTTA TTGGAAGATT AGCTGGTAAC GTAACTAGAG CTTTACAAGA TTTCCCAGAA
ATTGAAATAC ATCCTAAACA TGATCGAATG CACATTGTTT TAAATGGAGC TCCATTTGAT
AAAATCGATC AGCGATTAAA GCTTGTTTTT GGTATTCAAA CTTATTCGCC AACCATAAAA
GTTGAAAAGA ATCTTGATGC TATCAAAAAA GCTTCACTTG AATTAATGCA AGCGACTTTT
AAGGATGGAA TGACTTTTAA AGTTAATACT AGACGTAGTG ACCATGAATT TGAATATGAC
ACTAATCAAT TAAACACTAT GATTGGTGAT TACTTATTTG ATAATATGGA TAACTTAAAG
GTAAAAATGA AGAAGCCTGA TTTAGTCTTG AGAATTGAAG TTCGCCAAGA TGCTATCTAT
ATTTCAAATC AACTTCTTCA TGGTGCAGGT GGGATGCCAG TTGGTACGGC AGGAAGAGCA
GTGATGATGC TTTCAGGTGG AATTGATTCA CCAGTAGCTT CTTATCTCGC AATGAAGCGT
GGAGTTGAAA TTGATATGGT TCACTTCTTT AGTCCACCAT ATACTACAGA AAAAGCGCTA
GCTAAAGCAA AGGAACTTAC TGGAATTTTA GCTAACTATT CCGGAAAGAT TAATTTTATT
GCAGTACCTT TTACTGAAAT TCAAGAACAA ATTAAAGAAA AATTGCCAGA AGGTTATTTG
ATGACCATTC AGCGTCGCTT TATGCTTCAA CTAGCAGATC GTATTCGTGC AAAGCGTGGT
GGTTTAGCAA TTTTTAATGG AGAGTCAGTT GGTCAAGTAG CTTCACAAAC CTTAGAGTCA
ATGGTAGCGA TTAATGATGT TACTTCGACA CCTGTCCTTC GTCCTGTAGC CACAATGGAT
AAAACTGAAA TCATTAAGCT AGCTGAACAA ATTGGTACTT TTGATCTTTC TATTGAACCA
TTTGAAGATT GTTGTACTAT TTTTGCGCCA CCTCGTCCAA AGACTAAGCC TAAGCTAGAT
GAGGCTCGTA AGTTAGAAAA TAGACTTGAT GCCGAGAAAA TGATTCAACG CGCAATTGAT
GGAATGAAAA TTACACCAAT TTATCCAAAT CAAAAATTCT TGGATGATAA GGCTCAAGAA
GATGCAGACT TATTGTAA
 
Protein sequence
MQYTEVMVRY GELSTKGKNR KDFIGRLAGN VTRALQDFPE IEIHPKHDRM HIVLNGAPFD 
KIDQRLKLVF GIQTYSPTIK VEKNLDAIKK ASLELMQATF KDGMTFKVNT RRSDHEFEYD
TNQLNTMIGD YLFDNMDNLK VKMKKPDLVL RIEVRQDAIY ISNQLLHGAG GMPVGTAGRA
VMMLSGGIDS PVASYLAMKR GVEIDMVHFF SPPYTTEKAL AKAKELTGIL ANYSGKINFI
AVPFTEIQEQ IKEKLPEGYL MTIQRRFMLQ LADRIRAKRG GLAIFNGESV GQVASQTLES
MVAINDVTST PVLRPVATMD KTEIIKLAEQ IGTFDLSIEP FEDCCTIFAP PRPKTKPKLD
EARKLENRLD AEKMIQRAID GMKITPIYPN QKFLDDKAQE DADLL