Gene Strop_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1914 
Symbol 
ID5058376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2186619 
End bp2187797 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID640474187 
Productthiamin pyrophosphokinase, catalytic region 
Protein accessionYP_001158754 
Protein GI145594457 
COG category[S] Function unknown 
COG ID[COG4825] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0967076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.164651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTAC CTACGTTGCG CTGGACTCGA CCCGCCGAGC CGGGTCGGGC GGGCGGCACC 
GCCCGCCTGG ACCGTCGGAC CAAACGTCTG GTTGGCCGGC TCCGTCCCGG TGACGTCGCC
GTGATCGACC ACGTCGACCT GGACCGGGTC GCCGCCGATT CGCTGGTCGC GGTCGGTGTC
GGGGCTGTCC TCAACGCCAA GCCGTCGGTC TCGGGCCGCT ATCCCAATCT CGGCCCGGAA
GTGCTGATCG AGGCTGGTAT CCCGCTCCTG GACGACCTGG GCGAGGATGT CTTCGAACGG
ATCCAGGAGG GCGACACCGT CCGGATCGAG GGCAACACGG TCTATCTCGG CGAAGAGCCG
GTGGCCCACG GTGATCTGCA GGACGCGGAG ACCATCGGCA AGGCGATGGC CGATGCCCGG
GAGGGGCTAT CGGTCCAGCT GGAGGCGTTC GCAGCGAACA CCATGGTCTA CCTGAAGCAG
GAGCGGGACC TGCTGCTGTA CGGGGTGGGC GTTCCGGACA TCCGTACCGA GATTCAGGGC
CGGCACTGCC TGATCGTGGT GCGCGGCTAC GACTACAAGG CCGACCTGGA TGTGCTGCGC
CCGTACATCC GGGAGTTCAA GCCGGTGCTC ATCGGCGTCG ACGGCGGGGC GGACGCCCTG
GTCGAGGCCG GCTATCCACC CGACCTGATC ATCGGTGACA TGGACTCGGT GACCGACGAC
GTGCTGCGTT GCGGCGCCGA GGTCGTGGTA CACGCCTACC CAGACGGTCG TGCGCCCGGG
CTGGCCCGGG TCAATGGTCT CGGCGTTCCG GCGGTCACCT TTCCCGCCGC CGCCACCAGC
GAGGACCTGG CGATGCTGCT CGCCGACGAG AAGGGGGCCT CGCTCCTGGT GGCGGTCGGC
ACACACGCCA CGCTCGTCGA GTTCCTGGAC AAGGGACGGG GCGGGATGGC GTCGACCTTC
CTCACCCGGC TGAAGGTCGG CGGCAAGCTG GTTGACGCCA AGGGCGTAAG CCGGCTCTAC
CGGCAGAGCA TCTCCGGATC CTCACTGCTG CTGCTGGTGC TGTCCGCGAT TGCCGCGATG
GCCTCGGCTG TTGCGGTCTC CACCGTCGGC AAGGCGTACC TGGGTGTGGT CTCCGAGTGG
TGGAGCAATT TTGTGTTCCA GCTGGAACGG CTCTTCTGA
 
Protein sequence
MRLPTLRWTR PAEPGRAGGT ARLDRRTKRL VGRLRPGDVA VIDHVDLDRV AADSLVAVGV 
GAVLNAKPSV SGRYPNLGPE VLIEAGIPLL DDLGEDVFER IQEGDTVRIE GNTVYLGEEP
VAHGDLQDAE TIGKAMADAR EGLSVQLEAF AANTMVYLKQ ERDLLLYGVG VPDIRTEIQG
RHCLIVVRGY DYKADLDVLR PYIREFKPVL IGVDGGADAL VEAGYPPDLI IGDMDSVTDD
VLRCGAEVVV HAYPDGRAPG LARVNGLGVP AVTFPAAATS EDLAMLLADE KGASLLVAVG
THATLVEFLD KGRGGMASTF LTRLKVGGKL VDAKGVSRLY RQSISGSSLL LLVLSAIAAM
ASAVAVSTVG KAYLGVVSEW WSNFVFQLER LF